DevOps Engineer improving reliability and stability of cloud services at Madhive. Responsibilities include CI/CD tooling, monitoring, and cloud infrastructure management.
Responsibilities
Improve the reliability and stability of Madhive’s cloud services, operating primarily in a mix of AWS and Google Cloud, with more of the latter.
Design, build, and maintain CI/CD tooling for Infrastructure as Code and internal services (GitHub Workflows, CloudBuild).
Develop and support monitoring, alerting, and observability systems to ensure platform health.
Automate deployment and management of cloud infrastructure using Terraform, Helm, and other IaC tooling.
Administer, monitor, and optimize databases to ensure performance, reliability, and availability.
Implement database backup, recovery, and scaling strategies to support large-scale distributed systems.
Enforce cloud security best practices (IAM, permissions, policies).
Identify opportunities to optimize cloud services and databases for efficiency and cost control.
Collaborate with cross-functional guilds to establish operational standards and reduce risk.
Stay current on emerging cloud and database technologies, evaluating for potential adoption.
Requirements
Strong understanding of cloud infrastructure, networking, containerization, and distributed systems (GCP preferred, AWS/Azure a plus).
Hands-on experience with Infrastructure as Code (Terraform or similar).
Proficiency with Bash and command-line utilities; Golang experience required (PHP/Python/JavaScript nice to have).
Experience with containerization and orchestration (Docker, Kubernetes).
Solid background in Database Administration: provisioning, scaling, tuning, monitoring, backup/recovery, and troubleshooting.
Familiarity with database performance optimization and observability tools.
Experience with monitoring systems (Google Cloud Monitoring Suite, Datadog, Cloudwatch, etc.).
Strong troubleshooting and problem-solving skills, with a systematic approach.
Excellent written and verbal communication skills; able to document and share best practices.
Comfortable in a fast-paced environment with a growth mindset and eagerness to learn.
Benefits
We embrace our differences and believe they fuel our creativity.
We come from varied backgrounds and think that’s important.
We are all trail-blazing team players who think big and want to make an impact.
We are committed to cultivating a culture of inclusion and collaboration.
We welcome diversity in education, culture, opinions, race, ethnicity, gender identity, veteran status, religion, disability, sexual orientation, and beliefs.
Mechanical/Reliability Engineer responsible for mechanical installations in Bergen op Zoom. Analyzing maintenance strategies and leading projects to enhance reliability.
Senior DevOps Engineer responsible for cloud infrastructure and deployments. Optimizing AWS services and ensuring system security and reliability for Verizon.
Senior DevOps Engineer responsible for automating infrastructure and building CI/CD pipelines for collaborative robotics company. Collaborating with global engineering teams from the Bangalore office.
Site Reliability Engineer Intern at Tencent working on gaming services and cloud native solutions. Collaborating with global teams to eliminate toil and enhance reliability.
Cloud/DevOps Specialist at N5X managing and optimizing critical cloud infrastructures for Brazilian energy trading. Collaborating with a multidisciplinary team to ensure high availability and performance.
Cloud/Devops Specialist responsible for designing a hybrid architecture combining cloud and on - premises infrastructure for energy trading systems. Collaborating with a multidisciplinary team in a dynamic environment.
Reliability Engineering Specialist utilizing reliability tools and models to improve asset performance at Enbridge. Collaborating across teams to guide investment decisions for safe operations.
DevOps Engineer responsible for structuring and supporting cloud DevOps architecture in Brazil. Working strategically on automation and CI/CD practices with development teams in Pernambuco.
DevSecOps Software Engineer developing secure CI/CD pipelines for Boeing's military software systems. Collaborate with cross - functional teams and implement automation and security best practices.
DevOps Manager responsible for managing a team for multi - cloud solutions supporting the USAF Cloud One project. Focus on scalable cloud - native solutions and CI/CD practices.