Senior Site Reliability Engineer contributing to AWS cloud initiatives and enhancing kubernetes developer platform at Patreon. Collaborating within a high-performing team to ensure reliability and scalability.
Responsibilities
Contribute to AWS cloud infrastructure initiatives to improve performance, reliability, and cost efficiency
Participate in operability and production readiness reviews for scalability, resiliency, and operability
Advocate and implement Site Reliability Engineering practices across the organization
Enhance the feature set of the new kubernetes developer platform and assist with workloads migration
Develop tooling and automation to facilitate self-service for constituent teams
Support and maintain critical infrastructure components including infrastructure as code project and observability stack
Requirements
Experience in DevOps, Site Reliability, or backend/infrastructure engineering for a company experiencing fast-paced growth
Proficiency with a programming language like Python and shell scripting
Hands on experience implementing Site Reliability Engineering practices (SLIs, SLOs, SLAs) and using metrics for data-based decisions
Knowledgeable in configuration management with a framework such as Terraform, Ansible, Chef, or Puppet
Worked with continuous integration and deployment systems, with ideas about building and improving those systems
Excellent documentation and verbal communication skills, with the ability to collaborate and rally support with team members
Productive habits, healthy process awareness, and good teamwork skills and instincts
Bachelor’s degree in Computer Science, Computer Engineering, or related field, or equivalent work experience
Benefits
Competitive benefits package including salary, equity plans, healthcare, flexible time off, company holidays and recharge days, commuter benefits, lifestyle stipends, learning and development stipends, patronage, parental leave, and 401k plan with matching
Junior and DevOps Engineers designing and running secure cloud - native platforms for UK public - sector organisations. Collaborating with teams to streamline deployment and automate infrastructure workflows.
DevOps Engineer at Gemba designing secure, cloud - native platforms for public - sector organizations. Leading technical decisions and collaborating to solve complex challenges for critical systems.
DevOps Engineer designing and constructing secure cloud - native platforms for public - sector organizations across the UK. Leading technical decisions while collaborating closely with clients.
DevOps Engineer automating cloud - native infrastructure for public - sector organizations. Join an agile team to enhance deployment processes and support critical systems.
Site Reliability Engineer optimizing global trading infrastructure for a crypto capital markets partner. Responsibilities include cloud environment management and system design for high availability.
DevOps Engineer responsible for implementing and operating CI/CD pipelines for SaaS services. Collaborating with teams to ensure reliable and secure operations in the Risk & Fraud business unit.
Site Reliability Engineer focused on building resilient systems and ensuring uptime at MealSuite. Involved in troubleshooting, platform reliability, and enhancing deployment automation.
(Senior) DevOps Engineer at Wavestone developing and operating complex software solutions for digitalization projects. Collaborating in teams and contributing to technology landscape advancements.
Reliability Engineer focused on the dependability and mission success of complex space systems. Involvement includes analyses, collaboration, and adherence to aerospace reliability standards.