Senior DevOps Engineer developing core infrastructure supporting Shelf products. Focused on building reliable, secure, and scalable systems in hybrid work environment.
Responsibilities
Develop reusable components, improve system performance, and create scalable abstractions.
Maintain high standards for reliability and security in your work and in the systems used by other teams.
Manage everything from Terraform/OpenTofu modules and CI/CD pipelines to SSO permissions and observability tools.
Requirements
Write and maintain infrastructure as code in OpenTofu, making modules more reusable and robust.
Write clear runbooks and playbooks that explain how things work and what to do when they break.
Care deeply about the health of our infrastructure by keeping databases, LLMs, and third-party self-hosted services on current versions.
Participate in on call rotations and incident response, and write clear postmortems with concrete action items.
Treat CI/CD pipelines as a critical product. Own and improve hundreds of pipelines.
Become a Datadog and observability expert, tuning logging, metrics, tracing, dashboards, and alerts.
Make thoughtful build vs buy decisions and work directly with vendors to solve infrastructure problems.
Implement and enforce SOC 2 aligned policies for infrastructure and deployments.
Benefits
B2B contract
Company Stock Options
Hardware: MacBook Pro
Modern technical stack. Develop open-source software
Premier AI development environment: GitHub Copilot, Claude Code, OpenAI, TypingMind, v0, MCP Servers, plus credits to experiment with emerging AI tools
Cloud/Kubernetes Engineer supporting hybrid infrastructure across AWS and on - premise Kubernetes environments. Automating tasks and managing production reliability, security, and scalability.
AWS Infrastructure DevOps Engineer at Growth Acceleration Partners supporting AWS environments and infrastructure automation. Focused on reliability, security, and operational efficiency across production environments.
Site Reliability Engineer driving innovation and automation for Banking Solutions and Payments. Collaborating with teams to ensure application performance and reliability in a dynamic environment.
Mainframe SRE working on critical payment systems for fintech, ensuring stability and security. Collaborating with teams to perform root cause analysis and automate processes.
DevOps Engineer responsible for cloud product delivery, platform reliability, and using AI tools in DevOps workflows. Building CI/CD pipelines and optimizing container workloads for security and performance.
Senior DevOps Engineer for Paysafe, designing and deploying AWS applications and infrastructure. Collaborating on cloud environments and improving processes for scalable solutions.
Senior Site Reliability Engineer at Broadridge managing infrastructure design and operational support. Collaborating with teams to improve automation, performance, and reliability of services in a hybrid environment.
DevSecOps Engineer building and maintaining Azure DevOps cloud applications with API backend. Roles include developing CI/CD pipeline and automating backend tasks.
Reliability Engineer applying technical knowledge to enhance process and asset reliability. Partnering with teams to implement reliability excellence activities and predictive maintenance programs.
Reliability Engineer II at Cargill applying technical expertise to enhance process and asset reliability. Collaborating with teams to execute engineering strategies for equipment optimization in a salt mine setting.