Site Reliability Engineer enhancing system reliability and deployment practices at OpenLoop. Collaborating with cross-functional teams for incident management and performance tuning.
Responsibilities
Partner with engineering teams to improve system reliability and deployment practices
Engage with Openloop teams on SRE guidelines and best practices about automation and infrastructure
Work with security teams to implement secure, compliant infrastructure
Ensure 24/7 system availability and rapid incident response
Implement and maintain disaster recovery and business continuity plans
Skilled at performance tuning — identifying bottlenecks at infra, app, and database layers.
Advocate for blameless culture and continuous improvement.
Collaborate closely with product and engineering to make reliability a shared responsibility.
Requirements
2 - 3 years of experience in infrastructure, DevOps, or Site Reliability Engineering
Good background in AWS, particularly with serverless architectures
Understanding of observability and incident management
Strong knowledge in at least one programming language (Typescript, Python, Go, etc.). Previous experience as a Developer is a plus
Knowledge of Linux/Unix systems and networking
Experience with Infrastructure as Code (AWS CDK, Cloudformation)
Experience managing monitoring and observability tools (Prometheus, Grafana, ELK, etc.)
Knowledge of CI/CD pipelines and deployment automation (Github Actions, GitLab CI, etc)
Understanding of database systems and performance optimization
Leadership & Communication
English (C1) fluency
Excellent verbal and written communication skills
Ability to translate technical concepts to non-technical audiences
Good problem-solving and decision-making capabilities
Experience with agile methodologies
Benefits
Formal employment (“Planilla”) under a Peruvian entity — all legal benefits in soles (CTS, Gratificaciones, etc.).
Full-time schedule: Monday–Friday, 9am–6pm.
Unlimited vacation days 🏖️ — yes, we mean it!
EPS healthcare (Rimac) covered 100%.
Oncology insurance (Rimac) covered 100%.
AFP retirement plan.
Coworking access in Miraflores, Lima — with free beverages, talks, bicycle parking, and amazing city views.
Senior Fleet Reliability Engineer maintaining high fleet uptime for autonomous vehicle technology. Collaborating with technical teams to ensure peak operational performance in data collection efforts.
DevOps Lead at Leidos managing platform engineering, SRE, and application security functions. Driving operational excellence and ensuring scalability for federal government applications.
SRE Lead developing scalable cloud - native solutions for mission - critical systems supporting USAF. Managing teams, collaborating with cross - functional units, and ensuring high service reliability standards.
Junior DevOps / Platform Engineer at DieEnergiekoppler GmbH managing AWS/EKS platform operations. Collaborating with team members to improve platform functionalities and security compliance.
DevOps Engineer responsible for AWS infrastructures and backend development at Allguth GmbH. Engaging in greenfield projects with modern solutions in a collaborative team.
Cloud DevOps Specialist responsible for building scalable infrastructure solutions in AWS at SONDA. Focusing on automation, containerization, and data management in a collaborative environment.
DevOps Engineer maintaining and evolving deployment pipelines for Docebo’s AI - powered learning platform. Collaborating with cross - functional teams to ensure efficient software releases and infrastructure management.
DevOps Engineer optimizing CI/CD pipelines for Docebo, an AI - powered learning platform. Involves managing multi - tenant infrastructure using AWS, Docker, and Kubernetes.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes for cybersecurity solutions by NordLayer. Collaborating with teams to ensure performance and scalability of cloud services.
DevOps Engineer maintaining and improving infrastructure and CI/CD processes for cybersecurity solutions provider. Collaborating with cross - functional teams for reliable and scalable cloud solutions.