Site Reliability Engineer working on cloudification of backup services at Expleo. Contributing to infrastructure evolution with a team of skilled engineers.
Responsibilities
Join our dynamic Backup Services team as a Site Reliability Engineer working on the exciting cloudification of our backup platform using Commvault technology.
You'll be part of a growing team of 5-10 highly skilled engineers within the Cloud Services division, contributing to the evolution of our critical backup infrastructure.
Design and implement reliable, scalable backup infrastructure solutions using Commvault in cloud environments
Lead new backup platform deployment project
Monitor, troubleshoot, and optimize backup platform performance and availability
Collaborate with cross-functional teams to ensure backup service reliability and disaster recovery capabilities
Implement automation and best practices to enhance operational efficiency and system resilience
Participate in on-call rotations and incident response to maintain 24/7 service availability
Requirements
Academic or Bachelor level education or equivalent experience
Minimum 3-5 years of Linux experience in a HA environment
Self-starter, autonomous
Comfortable with development/programming skills
Ability to engage with both technical and non-technical staff at all levels in the organization
Quick learner
Ability to make your way through a complex automation stack autonomously
Automation is your answer to (almost) everything
Open source enthusiast
Believe in Infrastructure as Code
Target zero support solutions and self-healing systems
Security minded
Ability to find trade-offs between: stability vs. agility, operational work vs. software engineering, proactive vs. reactive work
Technical knowledge of Commvault or another major backup solution
Excellent knowledge of Red Hat / CentOS Linux
Experience with source version control like Git, GitLab, Bitbucket, GIT flows and CI/CD
Experience with Puppet
Experience with Terraform
Comfortable with scripting (Bash, Python, Ruby, Puppet, CI/CD)
Understanding of network protocols (IP, DHCP, DNS, BGP, load balancing)
Experience with programming languages (Java, Golang, Python)
Experience with multi-DC setup in different countries
Experience with hybrid infrastructure (on premises + cloud)
Experience with security standards (PCI-DSS)
Excellent communication skills in English
Proficiency in at least one programming language (Python, Go, Ruby, Perl)
Excellent problem-solving and analytical skills
Strong communication and collaboration skills
Benefits
Holiday Voucher
Private medical insurance
Performance bonus
Easter and Christmas bonus
Employee referral bonus
Bookster subscription
7card
Work from home options depending on project
Job title
Senior Site Reliability Engineer – Backup Services
Senior DevOps Engineer working on deployment and operations of FedRAMP authorized products. Improve cloud infrastructure and collaborate with federal customers in a regulated environment.
DevOps Team Lead at Insightful managing DevOps engineers for optimizing cloud infrastructure and CI/CD processes. Focused on team mentoring and operational excellence in a collaborative environment.
Site Reliability Engineer ensuring the reliability and performance of Freewheel systems. Collaborating across teams to optimize infrastructure and automate operations.
DevOps Professional specializing in Salesforce release management at YASH Technologies. Involves CI/CD pipeline management, version control, and collaboration with development teams.
Instrument/Control SIS Reliability Engineer providing technical support for BASF's global engineering team. Delivering complex engineering solutions and ensuring adherence to technical standards and safety regulations across multiple projects.
Site Reliability Engineer working on Linux systems for observability platforms and logging. Design and maintain applications, support network visibility, and collaborate with teams.
DevOps Engineer working at White Circle, focusing on infrastructure for AI systems. Involves managing production environments, Kubernetes, CI/CD pipelines, and automation tools.
Airflow Reliability Engineer on the Customer Reliability Engineering team at Astronomer. Working with clients on optimizing their use of the managed Airflow service in a hybrid role in Hyderabad.
Full - Stack Engineer enhancing engineering productivity at Fidelity. Building internal tools for SRE teams to improve operational efficiency and reliability.