Staff SRE at Insulet managing teams to ensure system reliability and scalability. Driving best practices in Site Reliability Engineering with a focus on automation and modern technologies.
Responsibilities
Provide technical guidance and mentorship to the SRE team.
Drive the implementation of best practices in reliability, scalability, and performance.
Lead by example, demonstrating excellence in technical skills and problem-solving.
Collaborate with cross-functional teams to design scalable, resilient, and efficient systems.
Architect and implement infrastructure solutions that meet the requirements of high availability and performance.
Drive the adoption of modern technologies and tools to improve system reliability and efficiency.
Develop and maintain automation tools for provisioning, deployment, and monitoring.
Automate routine tasks to improve operational efficiency and reduce manual intervention.
Design and implement monitoring solutions to proactively identify issues and prevent service disruptions.
Lead incident response efforts, conducting post-mortem analysis, and implementing measures to prevent recurrence.
Develop & Automate runbooks and playbooks to streamline incident resolution processes.
Conduct capacity planning exercises to ensure systems can handle current and future loads.
Identify performance bottlenecks and optimize system performance through tuning and optimization efforts.
Collaborate with development teams to design and implement scalable architectures.
Document system architectures, configurations, and procedures.
Promote knowledge sharing within the team through technical presentations, workshops, and documentation.
Requirements
Bachelor’s in computer science, Engineering, or a related field.
9+ years of experience in the field including 5+ Site Reliability Engineering, DevOps, or a similar role.
Proven experience architecting and managing highly available, scalable, and fault-tolerant systems.
Strong understanding of cloud computing platforms (e.g., AWS, Azure, GCP) and container orchestration technologies (e.g., Kubernetes).
In-Depth knowledge of AWS services including VPC, Lambda, IAM, ELB, EC2, ECS, CloudWatch, API Gateway, S3, SQS, SNS, WAF, X-Ray, and Route53 or GCP services including VPC, Cloud Functions, IAM, Cloud Load Balancing, Compute Engine, Google Kubernetes Engine (GKE), Stackdriver, API Gateway, Cloud Storage, Pub/Sub, Firebase Cloud Messaging, Cloud Armor, Cloud Trace, Cloud DNS
Experience with infrastructure as code tools such as Terraform, Ansible, or similar.
Excellent troubleshooting and problem-solving skills.
Strong communication and leadership skills, with the ability to collaborate effectively with cross-functional teams.
Experience leading and mentoring engineering teams is highly desirable.
DevOps Lead at Leidos managing platform engineering, SRE, and application security functions. Driving operational excellence and ensuring scalability for federal government applications.
SRE Lead developing scalable cloud - native solutions for mission - critical systems supporting USAF. Managing teams, collaborating with cross - functional units, and ensuring high service reliability standards.
Junior DevOps / Platform Engineer at DieEnergiekoppler GmbH managing AWS/EKS platform operations. Collaborating with team members to improve platform functionalities and security compliance.
DevOps Engineer responsible for AWS infrastructures and backend development at Allguth GmbH. Engaging in greenfield projects with modern solutions in a collaborative team.
Cloud DevOps Specialist responsible for building scalable infrastructure solutions in AWS at SONDA. Focusing on automation, containerization, and data management in a collaborative environment.
DevOps Engineer maintaining and evolving deployment pipelines for Docebo’s AI - powered learning platform. Collaborating with cross - functional teams to ensure efficient software releases and infrastructure management.
DevOps Engineer optimizing CI/CD pipelines for Docebo, an AI - powered learning platform. Involves managing multi - tenant infrastructure using AWS, Docker, and Kubernetes.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes for cybersecurity solutions by NordLayer. Collaborating with teams to ensure performance and scalability of cloud services.
DevOps Engineer maintaining and improving infrastructure and CI/CD processes for cybersecurity solutions provider. Collaborating with cross - functional teams for reliable and scalable cloud solutions.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes at NordLayer. Collaborating with Senior Engineers to implement best practices in a dynamic cybersecurity environment.