Hands-on Manager of Site Reliability Engineering leading a team and ensuring availability of production infrastructure at Avalon Healthcare Solutions. Collaboration with security and product teams to enhance operational excellence.
Responsibilities
Lead, mentor, and grow a high-performing team of SREs through coaching, training, goal-setting, and performance feedback.
Collaborate closely with developer, security, and product teams to balance reliability with feature delivery velocity.
Own the end-to-end incident response process, including on-call management (PagerDuty), escalation handling, and RCA facilitation.
Establish and enforce SLIs, SLOs, and error budgets in alignment with business priorities.
Implement and maintain end-to-end monitoring, alerting, and observability using tools such as CloudWatch, Prometheus, and Grafana.
Machine Learning Engineer responsible for designing and maintaining ML infrastructure on AWS at Roche. Key role in revolutionizing drug discovery using machine learning techniques with a close - knit team.
Senior Site Reliability Engineer operating scalable services in Azure and Kubernetes environments with a focus on reliability and performance improvements.
HPC Architect designing and optimizing high - performance computing solutions for semiconductor equipment. Collaborating with cross - functional teams to enhance compute workload capabilities.
Senior Site Reliability Engineer ensuring reliability, automation, and observability across cloud infrastructure. Focused on building self - service tools and improving performance in fast - paced environments.
Maintenance and Reliability Engineer optimizing preventive maintenance at VistaPrint's automated production facility in Venlo. Collaborating with cross - functional teams to drive continuous improvement in maintenance practices.
Senior Site Reliability Engineering Program & Compliance Manager leading process governance and operational maturity for infrastructure services at cloud contact center provider Five9.
Senior Site Reliability Engineer at Five9 designing Kubernetes on bare metal and hypervisor platforms within private cloud environments. Responsible for architecture, design, and standardization in infrastructure and automation.
DevOps engineer supporting Jenkins - based CI/CD platform in Luxembourg. Managing cloud infrastructure and providing core banking systems support within a collaborative team.
Software Engineer - DevSecOps designing modern software systems for aerospace programs at Northrop Grumman. Collaborating with multi - disciplinary teams in an Agile environment to implement DevSecOps lifecycle.
Principal Software Engineer focused on DevSecOps software factory at Northrop Grumman. Working with multi - disciplinary teams to implement DevSecOps practices for aerospace programs across various locations.