Senior Site Reliability Engineer at SPOTIO managing reliability, scalability, and operational excellence of cloud-native infrastructure on Microsoft Azure. Collaborating with development teams in a hybrid work environment.
Responsibilities
Ensure the availability, performance, and scalability of SPOTIO’s production systems and infrastructure.
Define, implement and monitor SLIs/SLOs, error budgets, capacity planning, incident response and root-cause workflows.
Build and evolve automation tooling: provisioning, deployments (CI/CD), monitoring, alerting, self-healing mechanisms, infrastructure as code.
Partner with software engineering teams to help design Highly Available and scalable systems
Manage and optimize our cloud stack on Azure: compute, networking, storage, identity, security, cost-optimization, disaster recovery, high-availability.
Work with tools such as Azure, Cloudflare, Elasticsearch, Pulumi, Kubernetes, and GitHub
Build and maintain robust CI/CD pipelines (automation of builds, tests, releases, rollbacks) to accelerate safe feature delivery.
Participate in on-call rotations, perform incident triage, drive post-mortem analyses and remediation.
Advocate for reliability culture: mentor engineers, evangelize best practices, create documentation, run disaster recovery exercises.
Requirements
5+ years of experience in a Site Reliability, Platform, or DevOps role at scale (cloud-native environment).
3+ years of experience with containerization and orchestration (Kubernetes, AKS).
3+ years of experience with Microsoft Azure (IaaS, PaaS, networking, security, monitoring).
Hands-on experience with Cloudflare (CDN, DNS, WAF or similar), Elasticsearch (deployment/management/observability) and Pulumi (or similar IaC: Terraform, CDK).
Strong automation mindset and tooling experience: build/operate CI/CD pipelines, scripting (Python, .NET, JavaScript, or similar).
Experience with version control (Git / GitHub), branching strategies, release management.
Excellent troubleshooting skills. E.G. Strong ability to dig into production issues, latency, MTTR, and drive remediation.
Strong communication and collaboration skills w/ experience working across teams.
Proactive, self-driven, and comfortable in a rapidly-evolving startup/scale-up environment.
Benefits
**What we can offer you:**
Interesting work and real impact on the product, organization and technology selection
Cloud DevOps Engineer position at AIS supporting federal clients with cloud - based solutions. Responsibilities include CI/CD pipeline implementation and systems functionality oversight.
Product Software DRE overseeing software quality and feature tracking for GM's Body Control Module. Collaborating across Agile Release Trains to ensure effective software development and release integration.
Infrastructure & DevOps Engineer at Intel focusing on innovative infrastructure solutions and optimal CI/CD pipelines. Collaborating on product development and maintenance of system performance.
Senior Site Reliability Engineer managing the reliability and operational health of the Loan Origination System for a fintech company. Collaborating with engineering teams in Brazil and the US to improve system reliability.
Cloud Engineer working with Azure DevOps and digital transformation in a global team at EY. Collaborating on cloud engineering projects and supporting CI/CD pipeline development.
DevOps Engineer creating better conditions for developers in Saab's defence technology. Collaborating with developer teams for effective continuous development and delivery of software.
Ingénieur Infrastructure DevOps chez Bull, renforçant l'équipe AdminLab Echirolles. Travailler sur des infrastructures Linux et des pratiques d'automatisation dans un environnement HPC.
Product Quality & Reliability Engineer developing quality/reliability standards for Applied Materials. Design methods for testing products and analyze operational data in a supportive team environment.
DevOps System Engineer creating and managing infrastructure for ESET's global SaaS service. Collaborating with tech teams to maintain secure and stable operations.
Provides expertise in business applications design and functionality. Supports users and validates technical designs for alignment with business needs.