Develop infrastructure through reusable code with automated tooling and tests.
Ensure services are monitored effectively using application and system metrics.
Understand the complexities of working with multi-tier applications and scaling of distributed systems across data centres and cloud.
Design architecture with security in mind.
Lead the delivery of new features and tooling for both business and technical stakeholders.
Provide tailored communications to different stakeholders.
Support engineers in accomplishing their daily work, including building CI/CD pipelines (we use Buildkite), writing Terraform modules or Helm chart, providing advice in application design.
You focus on operational excellence, constructively identifying problems and proposing solutions.
On-call rotation where required.
Contribute to team recruitment.
Mentor and help others grow.
Requirements
AWS - everything from IAM, Lambda, Cloudfront, RDS and DynamoDB.
Docker experience is required.
Kubernetes and Helm experience represents a must.
Terraform Code in one or more programming languages, such as Python , Node.JS , Java.
Use of a monitoring platform (we use Datadog) for application observability
Familiar with Git and GitHub
Knowing how an app should be designed and built for the cloud
Being comfortable authoring documentation and sharing with your peers
Network & Datacenter Deployment Engineer at Cloudflare focused on building and expanding their global network infrastructure with collaboration across multiple engineering teams and vendors.
Senior DevOps Engineer leading cloud - native solutions at Sparksoft Corporation. Driving automation and system reliability within a fast - paced Agile team.
Platform Engineer focusing on supporting CI/CD pipelines and Kubernetes at PCCW. Responsible for ensuring platform services' reliability and performance, with night - time support as needed.
Site Reliability Engineer at Bumble optimizing large - scale Linux environments and ensuring system stability. Focusing on troubleshooting, incident recovery, and performance tuning in complex infrastructures.
Senior DevOps Manager overseeing CI/CD processes for NVIDIA Networking products. Leading a team and collaborating with global teams to enhance R&D efficiency and infrastructure.
DevOps Manager overseeing engineering team developing scalable CI/CD processes for NVIDIA Networking products. Enhancing global R&D efficiency in a technology - focused company.
Join Operations Team as Senior Site Reliability Engineer driving operational excellence for cybersecurity solutions. Collaborate across teams to manage production platforms and optimize infrastructure.
Software Developer - DevOps System Administrator working within the SCMT team to enhance software application efficiency. Collaborating on tools and scripts for application lifecycle management.