Site Reliability Engineer building and scaling cloud infrastructure at fintech startup Rainforest. Owning systems from infrastructure design to production reliability in fast-moving environments.
Responsibilities
Owning and scaling Rainforest’s Amazon Web Services (AWS)-based cloud infrastructure using Terraform and infrastructure-as-code (IaC) orchestration
Building, operating, and continuously improving Elastic Kubernetes Service (EKS) and serverless environments that support our core payments services
Designing and maintaining modern CI/CD pipelines with GitLab to enable fast, safe deployments
Implementing and evolving monitoring, alerting, and observability to ensure high uptime and quick incident resolution using tools like OpenTelemetry, Prometheus, and New Relic
Automating infrastructure and operational processes to eliminate manual work and accelerate delivery
Working side-by-side with application engineers to improve system performance, reliability, and scalability
Leading incident response efforts, conducting postmortems, and driving continuous improvement
Helping to define and roll out SRE best practices, including SLIs, SLOs, and error budgets as the company scales
Optimizing for cost, security, and compliance in a regulated fintech environment
Supporting and scaling Postgres database infrastructure using AWS RDS offerings (Global Aurora)
Requirements
3+ years of experience in SRE, DevOps, or cloud infrastructure roles (startup or high-growth experience a plus)
Passion for building reliable systems that scale with the business
Strong hands-on experience with cloud infrastructure (AWS, Google Cloud, Azure)
Deep experience with IaC using tools such as Terraform, OpenTofu, Terragrunt, and CloudFormation
Solid production experience with container orchestration (Kubernetes, ECS)
Experience building CI/CD pipelines using tools like GitLab and GitHub Actions
Strong understanding of monitoring and observability principles and design and providing dashboards, visualizations and alerts
Proficiency in at least one modern programming language (e.g., Python, Java, Go, or Ruby).
Bachelor’s degree or equivalent work experience in the areas of Information Science, Computer Science, or related disciplines is preferred
DevOps Lead at Leidos managing platform engineering, SRE, and application security functions. Driving operational excellence and ensuring scalability for federal government applications.
SRE Lead developing scalable cloud - native solutions for mission - critical systems supporting USAF. Managing teams, collaborating with cross - functional units, and ensuring high service reliability standards.
Junior DevOps / Platform Engineer at DieEnergiekoppler GmbH managing AWS/EKS platform operations. Collaborating with team members to improve platform functionalities and security compliance.
DevOps Engineer responsible for AWS infrastructures and backend development at Allguth GmbH. Engaging in greenfield projects with modern solutions in a collaborative team.
Cloud DevOps Specialist responsible for building scalable infrastructure solutions in AWS at SONDA. Focusing on automation, containerization, and data management in a collaborative environment.
DevOps Engineer maintaining and evolving deployment pipelines for Docebo’s AI - powered learning platform. Collaborating with cross - functional teams to ensure efficient software releases and infrastructure management.
DevOps Engineer optimizing CI/CD pipelines for Docebo, an AI - powered learning platform. Involves managing multi - tenant infrastructure using AWS, Docker, and Kubernetes.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes for cybersecurity solutions by NordLayer. Collaborating with teams to ensure performance and scalability of cloud services.
DevOps Engineer maintaining and improving infrastructure and CI/CD processes for cybersecurity solutions provider. Collaborating with cross - functional teams for reliable and scalable cloud solutions.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes at NordLayer. Collaborating with Senior Engineers to implement best practices in a dynamic cybersecurity environment.