Site Reliability Engineer for ShiftKey, ensuring stability and performance of healthcare management platform. Involves maintenance and development initiatives with a proactive approach to prevent incidents.
Responsibilities
Split your time 50/50 between keeping the lights on (maintenance/support) and active development (improving automation and tooling).
Contribute to major roadmap projects, such as the migration of our infrastructure stack to a different AWS region to improve cost and availability.
Participate in our "Sheriff" rotation (approx. 1 week per month during standard office hours), acting as the primary point of contact for infrastructure ad-hoc support to prevent interruptions for the rest of the team.
Investigate logs, debug crashing pods, and troubleshoot complex distributed system issues using observability tools - going deeper than just deploying charts.
Use modern AI tools (ClaudeCode, Cursor) to automate toil and speed up workflows, maintaining a realistic balance between using new tech and relying on proven engineering principles.
Requirements
Hands-on production experience with Terraform (or Pulumi/CloudFormation), managing version-controlled, modular infrastructure through an infrastructure-as-code–first approach.
Ability to spin up pods, debug broken deployments, and manage resources via YAML.
Strong understanding of web application fundamentals (HTTP/S, SSL, GET/POST) to troubleshoot connectivity issues between services effectively.
A programmatic mindset. Whether it’s Python, Go, or Bash, you write code to fix problems rather than fixing them manually every time.
An open but skeptical attitude toward AI. You are willing to use LLMs to be more efficient, but you verify the output and don't use them as a crutch.
Benefits
Additional vacation days for better work-life balance.
Modern office in Warsaw’s Powiśle district with Vistula River views, recreational facilities, and great nearby restaurants.
Thoughtfully designed private medical package to take care of what matters most.
SME DevOps Engineer delivering enhancements for enterprise data and analytics products across DoD organizations. Collaborating with government and industry partners to translate strategic requirements into scalable solutions.
DevOps Engineer designing CI/CD pipelines and managing Azure cloud infrastructure for leading organizations. Collaborating with global teams and automating deployment processes across projects.
Senior DevOps professional at iugu managing system reliability and performance in a dynamic environment. Collaborating with development teams and automating processes for efficiency.
Site Reliability Engineer maintaining the ShiftKey Marketplace platform while ensuring its stability and availability. Collaborating on infrastructure projects and support with a remote - first approach.
Site Reliability Engineer ensuring platform stability and managing AWS migration. Focused on hands - on maintenance work and engineering automation for healthcare staffing platform.
Site Reliability Engineer maintaining stability and availability of healthcare staffing platform while collaborating with engineering teams on AWS migration projects.
DevOps Team Lead managing deployment and operations of FedRAMP authorized products at Semperis. Lead a team in a regulated environment focusing on security and process improvement.
Senior DevOps Engineer responsible for deployment and secure operations of FedRAMP products at Semperis. Focusing on compliance, automation, and collaborating with security teams.
DevOps/IT Apprentice supporting cloud infrastructure and CI/CD pipelines at tech startup. Involves learning, taking ownership, and growing within the engineering team.