Site Reliability Engineer at HPE designing, building, and optimizing cloud infrastructure and deployment systems. Enhancing operational efficiency and security across platforms with cross-team collaboration.
Responsibilities
Enhance Infrastructure as Code (IAC) and enforce best practices.
Optimize cloud infrastructure for scalability, security, and cost-effectiveness.
Develop internal tools to support and streamline cloud platform operations.
Improve CI/CD pipelines and deployment workflows using FluxCD and Jenkins.
Address container image vulnerabilities and standardize remediation processes.
Build Amazon Machine Images (AMIs) aligned with CIS and STIG benchmarks.
Strengthen monitoring, alerting, and observability using Prometheus, Grafana, and logging tools.
Troubleshoot complex production issues to ensure system reliability and customer satisfaction.
Fine-tune distributed systems such as Apache Kafka and Cassandra.
Collaborate with development, security, and operations teams to align infrastructure with application needs.
Requirements
Minimum of 10 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE).
Proficiency with Linux systems, especially Debian-based distributions.
Strong experience with cloud platforms such as AWS and GCP.
Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible.
Solid programming skills in Python and/or Golang.
Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE).
Experience with GitOps workflows.
Proven track record in implementing and maintaining CI/CD pipelines.
Strong background in security and familiarity with security programs.
Experience with monitoring and logging tools (Prometheus, Grafana, ELK).
Knowledge of both relational (SQL) and non-relational databases.
Excellent problem-solving and debugging skills with a strong sense of ownership.
Experience managing distributed systems like Apache Kafka and Cassandra.
Effective communicator and collaborative team player.
Benefits
Health & Wellbeing: A comprehensive suite of benefits that supports physical, financial and emotional wellbeing.
Personal & Professional Development: Specific programs catered to helping you reach any career goals you have.
Unconditional Inclusion: An inclusive work environment that celebrates individual uniqueness.
Senior Manager leading DevSecOps & SRE practices for transforming pharmacy prior authorization solutions at CVS Health. Overseeing agile teams and enhancing security and reliability in hybrid environments.
DevSecOps Engineer at Nelnet provisioning and monitoring AWS cloud infrastructure. Supporting security standards and cross - functional teams while automating deployment processes.
Backend Developer focusing on .NET and observability at Beyond Soluções. Collaborating on high - impact technology projects in a hybrid work environment.
Senior DevOps Engineer supporting Navy customer in architecture and development using Agile methodologies. Focused on CI/CD, Software CM, and system evaluations.
Site Reliability Engineer responsible for enhancing cloud infrastructure and deployment systems. Key role in scalability and operational efficiency at Hewlett Packard Enterprise.
Senior Software Engineer developing monitoring and observability tools for transportation technology company Waabi. Leading architecture and collaboration while optimizing performance across cloud and on - prem environments.
Senior DevOps Engineer leading GitLab migration projects for telecommunications at Capgemini Engineering. Involvement in digital transformation with cutting - edge technologies.
DevOps Engineer at Welldoc enhancing software infrastructure and managing CI/CD pipelines in Bangalore. Collaborating with development teams and implementing cloud solutions.
DevOps Engineering Intern at ASSA ABLOY working on cloud technologies and automation. Building infrastructure on AWS and contributing to CI/CD pipelines in a hybrid work environment.
Senior DevOps Engineer designing and maintaining CI/CD pipelines for Solace Cloud. Collaborating with teams on AWS and Kubernetes to enhance developer experiences.