Site Reliability Engineer improving system reliability and performance in production environments with a focus on automation and operational efficiency. Collaborating with engineering and infrastructure teams on deliverable-focused projects.
Responsibilities
Design, develop, test, and deploy automation tools, scripts, and engineering solutions to improve the stability, performance, and efficiency of production systems.
Identify opportunities to automate manual operational processes and reduce operational overhead.
Support and improve the release and deployment lifecycle of applications, ensuring reliable and controlled production rollouts.
Collaborate with software engineers and infrastructure teams to troubleshoot and resolve system issues.
Contribute to system design discussions, platform management, and capacity planning.
Create and maintain clear technical documentation for automation tools, operational procedures, and reliability improvements.
Provide regular updates on progress and deliverables to engineering stakeholders.
Requirements
At least 1 year of professional software development or reliability engineering experience
Proficiency in one or more programming languages such as Python, C++, Java, or shell scripting
Strong understanding of Linux operating system internals
Solid knowledge of networking concepts and troubleshooting
Experience with modern version control systems such as Git
Familiarity with monitoring, logging, and CI/CD tools (e.g., Prometheus, Grafana, Splunk, Jenkins, GitLab CI) is highly beneficial.
Ability to work independently, manage priorities effectively, and deliver results with minimal supervision.
Excellent written and verbal communication skills, with the ability to clearly communicate technical topics to engineering stakeholders.
Ability to quickly learn new technologies and tools and work across multiple programming languages and frameworks.
SRE Team Lead in charge of reliability strategy and operational maturity for a cybersecurity SaaS platform. Leading a specialized team to enhance system performance and incident management.
Junior DevOps Engineer implementing continuous integration and deployment architecture for the Defense Logistics Agency. Debugging cluster - based computing while using various configuration management tools.
Mobile DevOps Engineer developing hybrid applications with Ionic for a global organization. Collaborate across teams to optimize development practices and maintain mobile environment.
Lead Virtualisation Engineer at Mastercard focused on service quality and performance of platform virtualisation technologies. Collaborate with teams to ensure availability, scalability, and resilience across the network in Singapore.
Senior DevOps Engineer in a technology consulting firm connecting tech talents to impactful projects. Involves working in healthy environments with growth opportunities.
DevOps Engineer at LRQA, optimizing deployments and driving process improvements in a global assurance provider. Focusing on CI/CD pipelines, security best practices, and team collaboration.
DevOps Engineer at Booz Allen enhancing critical systems for space operations. Modernizing architectures and collaborating with teams to solve complex challenges.
DevOps Developer managing cloud infrastructure and CI/CD pipelines for Volkswagen Group Services. Collaborating with teams to ensure stable and efficient software deployments in a hybrid work environment.
Analista Devops Pleno at Finnet managing cloud and infrastructure projects for client solutions. Involves architecture design, systems management, and team collaboration.