Site Reliability Engineer managing Workday's Kubernetes platform for U.S. Federal Government. Ensuring high availability and security while collaborating with engineering teams.
Responsibilities
Ensuring the Workday Kubernetes based platform is maintained, healthy, and ensures high availability for our customers through, infrastructure automation, CI/CD pipelines, reporting, incident handling and response, and observability tools.
Maintain core platform components, ensuring high availability, scalability, and security.
Automate infrastructure provisioning, configuration management, and application deployments using tools like Terraform and Argo CD.
Provide support and solve for platform-related issues, working closely with development teams to resolve problems.
Implement and maintain security standard methodologies for the platform, ensuring compliance with industry standards.
Build and maintain comprehensive documentation for platform components and processes.
Actively participate in knowledge sharing within the team.
Collaborate effectively with other engineers and development teams across multiple locations and time zones.
Stay up-to-date with the latest technologies and trends in the platform engineering space.
Requirements
A minimum of 5 years of hands-on experience working with large scale cloud infrastructure, automation, and overall DevOps methodologies.
Bachelor's degree in a computer related field or equivalent work experience.
Proficiency in infrastructure automation tools like Terraform.
Experience with building, maintaining, and consuming CI/CD pipelines and tools like Argo CD.
Strong analytical and problem-solving skills.
Excellent communication and collaboration skills.
Benefits
Workday Bonus Plan or role-specific commission/bonus
Java Full Stack and AWS DevOps Developer for Boeing's Manufacturing Quality Information Technology Team, maintaining and enhancing software systems and DevOps environments while ensuring compliance.
Senior DevOps Engineer at One Pass redefining health engagement, managing scalable cloud infrastructure and enhancing automation. Collaborate across teams to ensure system reliability and performance.
DevOps Engineer at One Pass building and improving cloud infrastructure in AWS. Collaborating with engineers on deployments, reliability, and automation in a fast - paced environment.
Senior Release Engineer designing CI/CD pipelines for Kaseware’s mission - critical software. Collaborating with engineering, security, and operations teams to ensure fast and reliable deployments.
Site Reliability Engineer maintaining cloud infrastructure reliability for Tecsys solutions. Collaborating across teams to support services and implement automation, observability, and frameworks.
DevOps Engineer managing Kubernetes and cloud infrastructure for innovative legal software startup. Collaborating with development teams and ensuring smooth deployment processes.
DevOps Architect defining and evolving AgencyBloc’s cloud and DevOps strategy. Leading design of infrastructure and CI/CD frameworks for secure and scalable SaaS platforms.
DevOps Engineer at VERBI Software GmbH managing AWS - centric infrastructure and driving reliability, scalability, and modernization. Hands - on role applying SRE principles to evolve towards cloud - native best practices.
Sr. DevSecOps Engineer I at MetroStar ensuring integration of security best practices in development and operations lifecycle. Collaborating in delivering high - quality solutions for federal government applications.