Senior DevOps Engineer building and operating developer platforms for reliable production shipping at Demandbase in Hyderabad. Focused on improving developer experience and cloud infrastructure.
Responsibilities
Build and operate the platforms, tooling, and workflows that enable engineers to ship reliably to production.
Partner with software, data, and security engineering teams to identify friction across the software delivery lifecycle and address it through automation, platform abstractions, and improved workflows.
Design and evolve developer-facing platforms and tooling that standardize how services and pipelines are built, deployed, and operated.
Enable self-service workflows with opinionated defaults that improve reliability, security, and consistency without slowing teams down.
Use developer feedback, operational data, and production signals to prioritize and drive the DevEx roadmap.
Design, build, and maintain CI/CD orchestration that supports high release velocity, strong security guardrails, and local-to-production parity, preferably using GitLab CI/CD.
Standardize build, test, and deployment patterns across application and data workloads.
Support modern deployment strategies and GitOps-based workflows.
Build, operate, and evolve Kubernetes-based platforms across AWS and GCP, including EKS and GKE.
Enable teams to run workloads on Kubernetes by providing clear operational guardrails, platform defaults, and documented best practices.
Manage multi-account cloud environments with a focus on security, scalability, and ease of use.
Design and maintain infrastructure using Infrastructure as Code, including Terraform and Crossplane.
Build and operate internal platform components such as GitOps tooling, secret management systems, and service mesh infrastructure.
Operate and evolve observability platforms (e.g., Prometheus, Mimir, Thanos, Grafana, Datadog) to provide actionable signals for platform and application teams.
Define and apply SLIs, SLOs, alerting strategies, and incident response practices.
Lead and participate in blameless post-mortems, translating learnings into platform improvements and reduced operational toil.
Support engineering teams running data pipelines and batch workloads on platforms such as Airflow, EMR, and Dataproc.
Standardize deployment, observability, and operational patterns for data workloads.
Improve reliability and operability of data platforms through shared tooling and best practices.
Serve as a technical leader within DevEx, promoting best practices in platform engineering, reliability, and secure software delivery.
Mentor engineers and influence teams through strong technical design, documentation, and collaboration.
Drive adoption of internal platforms through strong defaults, clear documentation, and self-service tooling.
Requirements
8+ years of overall engineering experience, including hands-on software development and cloud infrastructure ownership.
Strong software engineering fundamentals with experience in at least one general-purpose programming language (e.g., Go, Python, Java).
5+ years of experience building and operating cloud infrastructure on AWS and/or GCP at scale.
Proven experience managing multi-account cloud environments, including IAM, networking, and security best practices.
Strong proficiency with Infrastructure as Code, preferably Terraform and Crossplane.
Extensive experience operating Kubernetes platforms in production, including EKS and/or GKE.
Experience managing multiple Kubernetes clusters, including upgrades, networking, and security.
Hands-on experience with service mesh technologies such as Istio in multi-cluster environments.
Deep experience designing and operating CI/CD systems that support high release velocity, preferably GitLab CI/CD.
Experience building developer-facing tooling that improves local-to-production parity and reduces cognitive load.
Familiarity with GitOps practices and modern deployment strategies.
Experience supporting data platforms such as Airflow, EMR, and Dataproc.
Strong experience building and operating observability platforms including Prometheus, Mimir, Thanos, Grafana, and Datadog.
Solid understanding of SLIs, SLOs, alerting, and incident response.
Demonstrated ability to partner with engineering teams to identify pain points and improve developer experience.
Strong communication skills, including experience participating in or leading blameless post-mortems.
Benefits
Group Medical
Personal Accident
Term Life Insurance
Preventive healthcare including dental, vision, and OPD needs
DevOps Engineer responsible for maintaining and optimizing infrastructure at Tenet3. Focused on security, automation, and technical operations within a collaborative team environment.
Site Reliability Engineer II at LexisNexis Risk Solutions building Terraform modules and CI/CD pipelines. Responsible for developing cloud infrastructure and ensuring reliability, security, and observability.
Journeyman Cloud Operations Engineer maintaining cloud infrastructure across DoD organizations. Supporting DevSecOps and ensuring compliance with security requirements in a high - visibility program.
DevOps Engineer supporting cloud modernization for the Department of the Air Force on the Cloud One contract. Involved in systems analysis, security practices, and collaboration with engineering teams.
DevOps Engineer managing cloud - native platforms for Capgemini. Collaborating with development, data/ML, and security teams to deliver scalable solutions on Azure.
Head of IT & DevSecOps at JamLoop, managing internal technology and security improvements. Leading strategy and implementation of cloud infrastructure for efficiency and reliability.
I&E Maintenance and Reliability Engineer at LyondellBasell focused on asset maintenance strategies in a multidisciplinary environment. Collaborating for operational excellence and safety performance at the Pasadena facility.
Manager, DevOps & Cloud Infrastructure overseeing security and operational efficiency in a hybrid environment at Thomson Reuters. Leading teams to deliver secure solutions in on - premises and cloud setups.
DevOps Engineer responsible for building and maintaining the infrastructure of IONOS' AI platform. Collaborating on CI/CD pipelines and ensuring system optimization across various locations.
DevOps Engineer building and supporting cloud infrastructure at PointClickCare. Collaborate with senior engineers and software teams to enhance AI - enabled workloads and improve system reliability.