Senior DevOps Engineer managing AWS and GCP infrastructure in a mission-driven healthcare company. Focus on automation, stability, and performance across cloud environments.
Responsibilities
Design, deploy, and manage highly available, scalable infrastructure using Kubernetes and Docker across public cloud (AWS, GCP)
Develop and maintain robust Configuration Management solutions (Terraform) for consistent environment provisioning and management.
Implement and manage CI/CD pipelines to facilitate rapid, reliable, and automated software releases.
Administer and troubleshoot operating systems, encompassing Linux
Implement and optimize observability practices using monitoring tools like Prometheus for logging, tracing, and alerting.
Automate repetitive tasks and system operations using scripting languages, primarily Bash and Python.
Collaborate closely with development, data, and security teams to ensure infrastructure supports product requirements and compliance standards.
Participate in an on-call rotation to ensure service reliability and responsiveness to incidents.
Requirements
5+ years of professional experience in a DevOps, SRE, or infrastructure engineering role.
Deep expertise in containerization and orchestration, specifically Kubernetes (design, deployment, and troubleshooting) and Docker.
Strong proficiency in managing Cloud infrastructure Cloud (AWS, GCP).
Extensive experience with monitoring and observability platforms (tools like Prometheus/Grafana, New Relic).
2-3 years of experience in with cloud platforms like AWS/GCP/Azure.
Solid understanding of Web Servers, Networking, Load Balancers, Nginx, etc.
Proficiency in scripting languages like Bash, Python or equivalent.
Lead DevOps Engineer focused on AWS and Azure data platform solutions. Collaborating with teams to deliver scalable, secure, and highly available solutions.
DevOps Engineer working at GRÜN Software Group to automate and maintain stable infrastructures. Collaborating with teams to improve deployments and processes for better performance.
Linux System Administrator managing IT infrastructures for educational institutions and research. Collaborating on DevOps and HPC projects while ensuring system security and performance.
Azure SRE Engineer responsible for designing and maintaining secure, scalable Azure cloud infrastructure. Driving automation and operational excellence for leading organizations in technology transformation.
Senior Manager of Site Reliability Engineering overseeing Workday Kubernetes based platform. Leading teams while ensuring high availability and collaborating with federal agencies.
Site Reliability Engineer focusing on AWS cloud environments, SRE practices, and system reliability within GFT's team. Collaborating on cloud migrations and observability initiatives.
Senior DevOps Analyst enhancing infrastructure automation in a transformative technology firm. Collaborating on innovative projects in sectors like healthcare, finance, and utilities in Brazil.
Consultant at Minsait supporting technical decisions in infrastructure automation and developing solutions. Collaborating with teams for maintaining and evolving automation platforms.
Practical Trainee focusing on hardware reliability engineering at Sonova. Support reliability improvement initiatives and work closely with experienced engineers on real - life product challenges.