Senior DevOps Engineer managing AWS and Azure cloud infrastructure for a startup SaaS company. Focused on CI/CD, system reliability, and security best practices.
Responsibilities
Design, implement, and own CI/CD pipelines across multiple services to streamline software development and deployment.
Maintain, optimize, and architect cloud infrastructure (AWS, Azure) to ensure scalability, security, reliability, and cost-effectiveness.
Automate infrastructure provisioning, monitoring, and management using Infrastructure as Code (Terraform, Ansible, etc.) with modular, reusable patterns.
Monitor and improve system performance, troubleshoot production issues, and ensure high availability and reliability across environments.
Collaborate with software engineers to enhance deployment strategies and build internal tooling that improves development workflows.
Implement security best practices across infrastructure, networking, identity, and access.
Own or support security and compliance requirements, including SOC 2 controls, documentation, and evidence collection.
Manage and enhance containerization and orchestration tools (experience with any modern orchestration platform; ECS, Kubernetes, or similar).
Optimize logging, monitoring, and alerting systems (ELK stack, Datadog, etc.) to improve visibility and accelerate incident response.
Build and maintain observability tooling, including metrics, logging, and tracing (OpenTelemetry experience is a strong plus).
Optimize cloud resource usage and implement cost-efficient infrastructure practices.
Stay current with the latest DevOps best practices, tools, and industry standards.
Requirements
6–8+ years of experience in DevOps, Site Reliability Engineering (SRE), or Infrastructure Engineering.
Strong proficiency in cloud platforms (AWS required; Azure/GCP a plus) and cloud-native architectures.
Experience designing CI/CD pipelines and deployment workflows end-to-end.
Proficiency with Infrastructure as Code tools (Terraform preferred; CloudFormation or Ansible also welcome).
Strong software development skills, with the ability to write clean, maintainable code in a modern programming language.
Hands-on experience with containerization and orchestration (experience with any major platform; ECS, Kubernetes, or similar).
Strong understanding of security best practices, IAM, networking, system administration, and distributed systems.
Experience supporting or contributing to security and compliance programs, ideally SOC 2 or similar.
Proficiency with monitoring and observability tooling (ELK, Datadog, Prometheus, OpenTelemetry, etc.).
Strong understanding of cloud cost optimization strategies.
Comfortable operating autonomously in an agile, fast-paced startup environment with a high degree of ownership.
Benefits
Medical, Dental, Vision, STD and Life insurance (100% Company-paid for the Employee)
Senior DevOps Engineer responsible for leading CI/CD pipeline design and optimization. Collaborating with teams to drive DevOps maturity across the enterprise while managing infrastructure automation.
Cloud Operations Engineer ensuring reliable performance of cloud systems at 2Innovate. Focused on automation, incident management, cloud security, and infrastructure monitoring in cloud environments.
AWS DevOps Engineer responsible for delivering scalable digital experiences for EXL's MarTech ecosystem. Engaging in development, maintenance, and collaboration across stakeholders and services.
Senior Site Reliability Engineer managing critical infrastructure at Hornetsecurity. Collaborating with product teams to ensure performance and reliability across services.
Site Reliability Engineer enhancing platform reliability for AI workflows at WRITER. Overseeing automated solutions and cloud infrastructure supporting high - trafficked AI systems.
Site reliability engineer ensuring 24/7 availability of AI - powered workflows at WRITER. Developing and automating robust platforms for high - traffic AI demands.
Site Reliability Engineer maintaining cloud infrastructure for Tricentis SaaS Products. Collaborating closely with engineers, focusing on observability and performance.