Senior Site Reliability Engineer responsible for designing scalable systems at Euna Solutions. Collaborating with developers and mentoring juniors while driving automation and reliability.
Responsibilities
Design & implement highly available, scalable, and fault-tolerant systems with a programming-driven approach to problem-solving.
Partner closely with software developers, applying your multi-language programming skills (e.g., Python, Go, Java, or others) to build tools, services, and automation that improve reliability.
Drive adoption of Infrastructure as Code (IaC) using Terraform and other technologies, ensuring repeatable, version-controlled deployments.
Design, build, and maintain CI/CD pipelines — integrating automated testing, linting, and deployment strategies informed by software development best practices.
Implement and manage observability solutions (monitoring, logging, tracing) that provide actionable insights into application performance and infrastructure health.
Participate in code reviews for infrastructure-related services, promoting high-quality, maintainable, and secure code.
Mentor junior engineers on both SRE principles and coding standards across languages.
Participate in incident response activities, perform root cause analysis, and implement long-term preventative measures — often via code-driven solutions.
Evaluate and integrate new tools, frameworks, and programming techniques to improve operational efficiency and team productivity.
Contribute to the technical direction of the SRE team, shaping priorities with a developer’s mindset.
Requirements
Bachelor’s degree in Computer Science, Software Engineering, or equivalent practical experience.
6+ years of combined experience in SRE, DevOps, or software engineering roles.
Proven expertise in designing and supporting distributed systems at scale.
Solid professional experience in multiple programming languages (e.g., Python, Go, Java, C#, or JavaScript/TypeScript) with strong debugging and code optimization skills.
Hands-on experience with IaC tools — especially Terraform.
Mechanical/Reliability Engineer responsible for mechanical installations in Bergen op Zoom. Analyzing maintenance strategies and leading projects to enhance reliability.
Senior DevOps Engineer responsible for cloud infrastructure and deployments. Optimizing AWS services and ensuring system security and reliability for Verizon.
Senior DevOps Engineer responsible for automating infrastructure and building CI/CD pipelines for collaborative robotics company. Collaborating with global engineering teams from the Bangalore office.
Site Reliability Engineer Intern at Tencent working on gaming services and cloud native solutions. Collaborating with global teams to eliminate toil and enhance reliability.
Cloud/DevOps Specialist at N5X managing and optimizing critical cloud infrastructures for Brazilian energy trading. Collaborating with a multidisciplinary team to ensure high availability and performance.
Cloud/Devops Specialist responsible for designing a hybrid architecture combining cloud and on - premises infrastructure for energy trading systems. Collaborating with a multidisciplinary team in a dynamic environment.
Reliability Engineering Specialist utilizing reliability tools and models to improve asset performance at Enbridge. Collaborating across teams to guide investment decisions for safe operations.
DevOps Engineer responsible for structuring and supporting cloud DevOps architecture in Brazil. Working strategically on automation and CI/CD practices with development teams in Pernambuco.
DevSecOps Software Engineer developing secure CI/CD pipelines for Boeing's military software systems. Collaborate with cross - functional teams and implement automation and security best practices.
DevOps Manager responsible for managing a team for multi - cloud solutions supporting the USAF Cloud One project. Focus on scalable cloud - native solutions and CI/CD practices.