Site Reliability Engineer ensuring platform stability and managing AWS migration. Focused on hands-on maintenance work and engineering automation for healthcare staffing platform.
Responsibilities
Split your time 50/50 between keeping the lights on (maintenance/support) and active development (improving automation and tooling).
Contribute to major roadmap projects, such as the migration of our infrastructure stack to a different AWS region to improve cost and availability.
Participate in our "Sheriff" rotation (approx. 1 week per month during standard office hours), acting as the primary point of contact for infrastructure ad-hoc support to prevent interruptions for the rest of the team.
Investigate logs, debug crashing pods, and troubleshoot complex distributed system issues using observability tools - going deeper than just deploying charts.
Use modern AI tools (ClaudeCode, Cursor) to automate toil and speed up workflows, maintaining a realistic balance between using new tech and relying on proven engineering principles.
Requirements
Hands-on production experience with Terraform (or Pulumi/CloudFormation), managing version-controlled, modular infrastructure through an infrastructure-as-code–first approach.
Ability to spin up pods, debug broken deployments, and manage resources via YAML. We don't need you to architect a cluster from scratch on Day 1, but you must be comfortable operating inside one.
Strong understanding of web application fundamentals (HTTP/S, SSL, GET/POST) to troubleshoot connectivity issues between services effectively.
A programmatic mindset. Whether it’s Python, Go, or Bash, you write code to fix problems rather than fixing them manually every time.
An open but skeptical attitude toward AI. You are willing to use LLMs to be more efficient, but you verify the output and don't use them as a crutch.
Benefits
Additional vacation days for better work-life balance.
Modern office in Warsaw’s Powiśle district with Vistula River views, recreational facilities, and great nearby restaurants.
Thoughtfully designed private medical package to take care of what matters most.
DevOps Engineer designing CI/CD pipelines and managing Azure cloud infrastructure for leading organizations. Collaborating with global teams and automating deployment processes across projects.
Senior DevOps professional at iugu managing system reliability and performance in a dynamic environment. Collaborating with development teams and automating processes for efficiency.
Site Reliability Engineer maintaining stability and availability of healthcare staffing platform while collaborating with engineering teams on AWS migration projects.
Site Reliability Engineer for ShiftKey, ensuring stability and performance of healthcare management platform. Involves maintenance and development initiatives with a proactive approach to prevent incidents.
Site Reliability Engineer maintaining the ShiftKey Marketplace platform while ensuring its stability and availability. Collaborating on infrastructure projects and support with a remote - first approach.
Senior DevOps Engineer responsible for deployment and secure operations of FedRAMP products at Semperis. Focusing on compliance, automation, and collaborating with security teams.
DevOps Team Lead managing deployment and operations of FedRAMP authorized products at Semperis. Lead a team in a regulated environment focusing on security and process improvement.
DevOps/IT Apprentice supporting cloud infrastructure and CI/CD pipelines at tech startup. Involves learning, taking ownership, and growing within the engineering team.
DevOps Engineer at Cloud++ collaborating on infrastructure and CI/CD pipelines across multi - cloud environments. Engaging with development teams to ensure reliable and secure releases.