Senior Manager, DevOps responsible for scaling and owning platform operations at FloQast. Collaborating with cross-functional teams and managing DevOps Engineers in a hybrid setting.
Responsibilities
Lead, mentor, and scale a DevOps organization; build career paths and leadership bench
Define and execute the DevOps, reliability, and observability strategy aligned with business goals
Own platform reliability, availability, and performance for a production SaaS platform
Establish and mature observability practices (metrics, logs, traces, alerts, dashboards)
Drive infrastructure initiatives across AWS focused on scalability, resilience, and modernization
Own and mature incident management including on-call, response, executive communication, and postmortems
Oversee day-to-day operational excellence including CI/CD, deployments, and environment health
Set and manage cloud cost strategy, forecasting, and optimization in partnership with Finance
Partner with Security and Compliance on SOC2, SOX, and audit readiness
Support AI/ML and data platform workloads as part of the broader infrastructure strategy
Requirements
10+ years of DevOps / SRE / Infrastructure experience
4+ years managing DevOps or Platform teams
Deep expertise with AWS at scale (multi-account, networking, IAM)
Strong hands-on background with Terraform, Kubernetes, and CI/CD
Proven ownership of incident management and operational maturity
Experience building and operating observability platforms for SaaS systems
Experience with AI/ML or data-intensive platforms
Observability tools such as Datadog, Grafana, Prometheus, OpenTelemetry
Sr. Site Reliability Engineer designing and automating robust technical infrastructure at Broadridge. Collaborating across teams for successful deployment and operational support of services.
Senior Fleet Reliability Engineer maintaining high fleet uptime for autonomous vehicle technology. Collaborating with technical teams to ensure peak operational performance in data collection efforts.
DevOps Lead at Leidos managing platform engineering, SRE, and application security functions. Driving operational excellence and ensuring scalability for federal government applications.
SRE Lead developing scalable cloud - native solutions for mission - critical systems supporting USAF. Managing teams, collaborating with cross - functional units, and ensuring high service reliability standards.
Junior DevOps / Platform Engineer at DieEnergiekoppler GmbH managing AWS/EKS platform operations. Collaborating with team members to improve platform functionalities and security compliance.
DevOps Engineer responsible for AWS infrastructures and backend development at Allguth GmbH. Engaging in greenfield projects with modern solutions in a collaborative team.
Cloud DevOps Specialist responsible for building scalable infrastructure solutions in AWS at SONDA. Focusing on automation, containerization, and data management in a collaborative environment.
DevOps Engineer maintaining and evolving deployment pipelines for Docebo’s AI - powered learning platform. Collaborating with cross - functional teams to ensure efficient software releases and infrastructure management.
DevOps Engineer optimizing CI/CD pipelines for Docebo, an AI - powered learning platform. Involves managing multi - tenant infrastructure using AWS, Docker, and Kubernetes.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes for cybersecurity solutions by NordLayer. Collaborating with teams to ensure performance and scalability of cloud services.