Senior DevOps Engineer operating AWS infrastructure and Kubernetes for BlueCat Cloud SaaS platform. Focused on automation and operational stability while collaborating with cross-functional teams.
Responsibilities
Own the day-to-day operation, reliability, and performance of production services running on AWS.
Operate and support containerized workloads across ECS and Kubernetes (EKS) environments.
Maintain and evolve an EKS-based platform, including cluster upgrades, add-ons, and operational tooling.
Manage Kubernetes workloads using Helm and standard deployment and release practices.
Build, maintain, and improve CI/CD pipelines to support safe, repeatable, and efficient deployments.
Automate infrastructure and operational workflows using Infrastructure as Code (Terraform preferred).
Participate in an on-call rotation, respond to customer-impacting production incidents, and lead troubleshooting efforts.
Drive incidents through resolution, perform root cause analysis (RCA), and implement preventative improvements.
Troubleshoot Kubernetes networking, ingress, service discovery, and workload-level issues.
Implement and maintain monitoring, alerting, and logging solutions (CloudWatch, Prometheus, Grafana, InfluxDB, etc.).
Partner with application teams to ensure services are production-ready and operationally supportable.
Work closely with engineers across Toronto and Serbia teams to support production systems.
Provide technical guidance and informal mentorship to junior DevOps and SRE engineers.
Requirements
5–8+ years of experience in DevOps, cloud infrastructure, or production operations roles.
Graduate Reliability Engineer at GKN Aerospace enhancing operational excellence through data analysis and project participation within large structural assemblies.
Site Reliability Engineer at WRITER, ensuring 24/7 availability and performance of AI - powered workflows. Collaborating on scalable infrastructure solutions while impacting enterprise customer trust.
Engineer at Trading Technologies improving platform stability through coding and automation. Focus on building advanced monitoring tools for global trading operations.
Senior ML Ops/DevOps developing MLOps platform components at Capco Poland for financial digital transformation. Responsibilities include CI/CD, model deployment, monitoring, and team collaboration.
Senior DevOps Engineer at Verisk, focusing on AWS infrastructure and CI/CD pipeline automation. Ensuring high availability and security through collaboration with development and QA teams.
Senior DevOps & Infrastructure Engineer at IMAGO focusing on automation and infrastructure improvements. Building reliable infrastructure and leading CI/CD optimization in a dynamic environment.
DevOps Specialist creating and overseeing Azure hybrid cloud infrastructures for EVLO's battery energy storage solutions. Collaborating with teams to implement cutting - edge technologies in a dynamic environment.
Software Quality and Release Engineer developing and maintaining C++/Python software solutions for aerospace and defense industry. Collaborating on CI/CD automation and feedback documentation.
Senior DevOps Engineer building and managing big data platforms for clients in telecommunications and finance industries. Ensuring stability, scalability, and performance across cloud and on - premise environments.