Senior SRE Engineer managing cloud infrastructure and driving Infrastructure-as-Code adoption for Resideo. Designing resilient systems while ensuring the health of cloud platforms.
Responsibilities
Maintain public cloud infrastructure by using at least one of the Cloud technology Azure or AWS or Google Cloud (GCP).
Build and Maintain cloud infrastructure automation (IaC) by using Terraform, ARM Templates or similar.
Build and Maintain IT automation using tools like Ansible, Chef or managing complex container-based applications like Helm for Kubernetes.
Build, delivery and deployment by using modern technologies like Git, Git Action, Jenkins, Octopus, Ansible, Docker, Kubernetes or similar.
Build and maintain observability and monitoring across different IT platforms by using Grafana, Prometheus, Elastic, DataDog or similar.
Be part of a L2 team that provides 24/7 support in troubleshooting IT platforms issues, when required (less than 20% of the working time).
Oversee all planned outages, assess RCA and assist with major upgrades to ensure minimum downtime.
Requirements
Minimum 3 years of working experience with at least one of the public cloud platforms. (Azure preferred but not required).
Minimum of 5 years Windows / Linux experience.
Minimum of 2 years Terraform or other IaC platforms experience.
Strong knowledge of Elastic, Grafana, Prometheus or other observability platforms (Datadog, Dynatrace, etc.).
Proven experience with running and/or managing large IT platform services with multiple availability regions.
Experience with container orchestration platform Docker or Kubernetes, or similar.
Strong English communication (written and oral) skills are required.
Benefits
Employment in a strong, well known international company and part of a global team.
Unlimited access to online training.
Flexible hybrid working arrangement to support work-life balance.
Meal ticket for each day worked.
Medical coverage to support your health and wellbeing.
Senior Site Reliability Engineer at Broadridge managing infrastructure design and operational support. Collaborating with teams to improve automation, performance, and reliability of services in a hybrid environment.
DevSecOps Engineer building and maintaining Azure DevOps cloud applications with API backend. Roles include developing CI/CD pipeline and automating backend tasks.
Reliability Engineer II at Cargill applying technical expertise to enhance process and asset reliability. Collaborating with teams to execute engineering strategies for equipment optimization in a salt mine setting.
Reliability Engineer applying technical knowledge to enhance process and asset reliability. Partnering with teams to implement reliability excellence activities and predictive maintenance programs.
Cloud & DevOps Engineer designing and maintaining infrastructure as code in cloud environments. Collaborating on application development interacting with APIs and AI solutions.
Senior Business Systems Analyst assisting in PLM Dev Ops at Arthrex. Involves supporting automation in deployment, testing, and monitoring of PLM systems.
Principal Software Engineer leading DevSecOps strategies for automated delivery and security across product engineering. Innovating CI/CD pipelines and embedding security practices in software delivery.
DevSecOps Engineer responsible for embedding security controls in CI/CD at Keyloop. Collaborate with engineering teams to integrate security in build and deployment workflows.
DevOps Engineer modernizing infrastructure for a fintech company focused on empowering e - commerce businesses. Engaging in hands - on work with GCP and Kubernetes to establish reliable, efficient deployment pipelines.
DevSecOps Engineer supporting AI - enabled financial compliance initiative for the Department of War. Responsible for designing secure infrastructure and collaborating with cross - disciplinary teams.