Site Reliability Engineer focused on developing and improving Kubernetes configurations for F5's infrastructure. Collaborating with product teams and ensuring operational delivery processes are efficient and reliable.
Responsibilities
Develop high-quality services, lead design discussions, execute development against design for development teams to utilize in a self-service model.
Coordinate with product and platform teams on regular maintenance, improve availability, scalability, and performance of the CI/CD environment.
Collaborate with product teams and work cross-functionally with F5 IT department and vendors to implement the services and automation required to support application use cases.
Actively engage with internal teams to develop tooling, framework to drive full observability and automation of the environment.
Ensure adherence to architecture standards and roadmaps.
Drive digital innovation by leveraging innovative new technologies and approaches to renovate, extend, and transform the existing core technology base.
Ensure that post-production operational processes / deliverables are well designed and implemented prior to the project moving into the solution support phase.
Define and create development procedures, processes, and scripts to drive a standard software development lifecycle.
Assist in the evaluation, selection, and implementation of new technologies with product teams to ensure adherence to architecture guidelines for new technology introduction.
Provide technical leadership on establishing standards and guidelines.
Facilitate collaboration between development and operations teams throughout the application lifecycle.
Partner with Corporate Information Security to ensure all security policies and audit inquiries are addressed.
Coordinate and align all other technology teams to ensure operational delivery processes are governed and monitored to expedite issue remediation.
Requirements
2 to 3 years of experience developing and implementing CI/CD automation, performance tuning, and scaling applications.
Direct experience with automation to deploy, manage and maintain complex Kubernetes installations.
1 to 2 years of experience with open-source technologies and cloud services preferably Azure.
Experience with microservice architecture and development.
Hands-on development experience with one or more general purpose programming languages including but not limited to: Python, JavaScript, or Go.
Infrastructure deployment experience using technologies such as TerraForm, and Ansible.
Excellent working knowledge of system environments – operating systems, networking, applications, platforms, and databases.
Site Reliability Engineer responsible for system reliability and performance at a leading financial services technology company. Collaborating with infrastructure, engineering, and security teams to build robust systems.
Principal Release Engineer leading and orchestrating end - to - end release management at F5. Driving cross - platform coordination and ensuring seamless releases across enterprise transformation programs.
Sr DevOps Manager leading the way in Cloud infrastructure, DevOps, and SRE practices at F5. Empowering engineers and fostering a culture of collaboration and improvement.
Senior Site Reliability Engineer developing IT infrastructure and automation solutions for Coinbase. Collaborating with Infrastructure, security, and compliance teams to enhance operational efficiency.
DevOps Engineer joining AI and Innovation team to ensure scalable, secure, and resilient systems at global media agency. Collaborating with UX and AI engineers for next - generation media experiences.
Site Reliability Engineer at HPE ensuring high availability and performance of cloud infrastructure across AWS and GCP environments. Managing incidents, monitoring systems, and supporting multi - cloud production.
Senior SRE/DevOps managing cloud architecture, driving automation, and ensuring operational reliability at Extensiv. Collaborating with teams to design scalable systems on AWS.
Site Reliability Engineer supporting Vista Global’s production environments and cloud infrastructure. Delivering solutions using AWS, Terraform, Ansible, Docker, and Kubernetes in a hybrid model.
Site Reliability Engineer responsible for architecting cloud infrastructure and containerized platforms at Vista Global. Implementing CI/CD pipelines and mentoring teams on best practices for production environments.
Senior DevOps Engineer focused on network automation and cloud infrastructure at Tiger Analytics. Building scalable solutions for multiple Fortune 500 companies and ensuring high availability and performance.