Senior Site Reliability Engineer ensuring reliability and performance of customer-facing systems at Progress. Involves automation, cloud infrastructure management, and incident response.
Responsibilities
Safeguard systems against breaches and ensure compliance with PCI-DSS, HIPAA, SOC2, and other standards
Build and maintain scalable, secure services in Azure, AWS, or GCP using cloud-native tools
Automate routine tasks, manage backups, and configure servers for high availability and disaster recovery
Implement observability tools, optimize system performance, and proactively address issues
Participate in on-call rotations, troubleshoot service incidents, and lead postmortem reviews
Work closely with developers, QA, and support teams in agile environments
Handle complex customer account setups in coordination with Sales and Professional Services
Champion IaC practices using tools like Terraform, Ansible, Chef, or Puppet
Requirements
Proven experience in a Senior SRE or similar role in production environments
Strong skills in Windows/Linux, scripting (Shell, Python, PowerShell), and cloud services (ECS, EKS, S3, etc.)
Hands-on experience with Kubernetes, Docker, Rancher, and container orchestration
Familiarity with security protocols, network concepts (TCP/IP, HTTP, TLS), and compliance frameworks
Ability to debug, optimize, and automate to reduce operational toil
Excellent communication skills and a proactive, problem-solving mindset
Bachelor’s degree in Computer Science, Information Systems, or related field
Experience with PCI, HIPAA, and SOC2 compliance
Experience with cloud-hosted apps/services (Azure/AWS)
Benefits
Generous remuneration package
Employee Stock Purchase Plan Enrollment
Vacation, Family, and Health
30 days paid annual vacation
An extra day off for your birthday
2 additional days off for volunteering
Premium healthcare and dental care coverage
Additional pension insurance
Well-equipped gym on-site
Co-funded Multisport card
Daycare Center for your little ones onsite
Flexible working hours and work-from-home allowance
Free underground parking with a designated space for bikes and electric scooters
Site Reliability Engineer enhancing system reliability and deployment practices at OpenLoop. Collaborating with cross - functional teams for incident management and performance tuning.
Senior DevOps Engineer enhancing Azure application reliability for a healthcare fintech platform. Collaborating closely with engineering teams to ensure deploy safety and observability.
DevOps Engineer contributing to tooling changes and leading a community of practice at Totara. Focused on collaboration, development, and support for internal teams.
Site Reliability Engineer responsible for infrastructure supporting AI platform. Safeguarding US customer data and ensuring compliance in the Aerospace and Defense sector.
Senior Infrastructure Engineer managing Azure platform for a SaaS product at Rillion. Focused on automation, security, reliability, and scalability in a hybrid work environment.
Statistician/Reliability Engineer applying statistical analysis for satellite systems at Aerospace Corporation. Leading projects on system reliability and working closely with interdisciplinary teams in a full - time on - site role.
DevOps Engineer designing and implementing solutions to optimize operations in media technology at Mediagenix. Collaborating with cross - functional teams to enhance user experiences.
DevOps Senior Software Engineer at SimCorp developing high - quality software solutions for financial technology. Responsible for mentoring junior engineers and solving complex technical challenges.
Senior DevOps Engineer at SimCorp managing cloud environments and automating builds using Azure. Collaborating with cross - functional teams to ensure high service availability and compliance.
DevOps Engineer designing, building, and operating software development infrastructure for CodeMettle. Leading automation and best practices to enhance value delivery across teams.