DevOps Engineer supporting automation and cloud platform technologies with team collaboration at Workday. Developing and managing CI/CD pipelines while enhancing infrastructure efficiency in a SaaS environment.
Responsibilities
Design infrastructure and automated systems to support our distributed architecture
Build and Manage CI/CD pipelines and constantly improve their reliability & speed, and reduce lead time for changes
Forecast and plan for the infrastructure needs of a fast-growing SaaS company and find ways to improve efficiency and cost
Trace performance bottlenecks and identify optimizations and improvements at both the infrastructure and application level
Collaborate with our engineering team to meet high SLO and SLA requirements from customers
Maintain highly available web and backend systems that serve millions of users, and 1000’s of requests per second
Closely collaborating with Developers to setup, configure and plan the necessary cloud services in support of new feature development on AWS
Analyze system performance and capacity and plan for future growth
Securing our infrastructure at both the cloud layer (IAM) and application layer (PKI)
Building and expanding monitoring and alerting systems for both infrastructure and business operations, using internal tools & integrating into established 3rd party SaaS ones
Establishing comprehensive infrastructure-as-code coverage to support our entire platform
Develop tools to enhance and support Developer Productivity
Champion automation of manual processes and reducing operational overhead
Requirements
3+ years DevOps or software development experience
3+ years of experience building, maintaining and scaling database technologies such as Postgresql, MySQL, Redis, and DynamoDB
2+ years experience orchestrating large scale distributed microservice deployments on Kubernetes and EC2
2+ years experience building and managing EKS clusters and strong knowledge of the K8s ecosystem
2+ years of experience with Prometheus/Grafana/Cloudwatch metrics monitoring, ELK/OpenSearch stack for logging and PagerDuty and alerting
DevOps Platform Intern managing cloud infrastructure and deployment pipelines for AI - native software delivery. Partnering with a Product Development Intern, set up and manage containerized applications on Azure Kubernetes Service.
UNIX DevOps Engineer managing AIX and Solaris server operations for a Swiss telecom company. Focusing on automation, optimization and 7x24h monitoring responsibilities across multiple locations.
Staff Site Reliability Engineer designing and building backend services for NordVPN. High - ownership role focusing on system architecture and operational excellence.
Senior Site Reliability Engineer managing VPN and DNS services to ensure performance and reliability. Collaborating with application teams to maintain security and quality across global infrastructure operations.
Senior Site Reliability Engineer managing globally distributed VPN and DNS services. Optimizing service performance and handling security posture in a hybrid work environment.
Senior Site Reliability Engineer focused on observability for NordVPN. Designing monitoring systems and collaborating with data teams on anomaly detection.
Senior Site Reliability Engineer ensuring content accessibility across global edge infrastructure for NordVPN. Designing and troubleshooting systems critical to internet traffic management.
Staff Site Reliability Engineer designing tools for Threat Protection Pro and NordLynx protocol. Working on globally distributed backend services for NordVPN with a focus on security and privacy.
Senior Site Reliability Engineer focused on observability for cybersecurity tools at NordVPN. Designing monitoring systems and collaborating on anomaly detection within distributed systems.
Senior Site Reliability Engineer focused on traffic engineering at NordVPN. Working to enhance the world's most advanced VPN and online security solutions.