Senior Site Reliability Engineer contributing to AWS cloud initiatives and enhancing kubernetes developer platform at Patreon. Collaborating within a high-performing team to ensure reliability and scalability.
Responsibilities
Contribute to AWS cloud infrastructure initiatives to improve performance, reliability, and cost efficiency
Participate in operability and production readiness reviews for scalability, resiliency, and operability
Advocate and implement Site Reliability Engineering practices across the organization
Enhance the feature set of the new kubernetes developer platform and assist with workloads migration
Develop tooling and automation to facilitate self-service for constituent teams
Support and maintain critical infrastructure components including infrastructure as code project and observability stack
Requirements
Experience in DevOps, Site Reliability, or backend/infrastructure engineering for a company experiencing fast-paced growth
Proficiency with a programming language like Python and shell scripting
Hands on experience implementing Site Reliability Engineering practices (SLIs, SLOs, SLAs) and using metrics for data-based decisions
Knowledgeable in configuration management with a framework such as Terraform, Ansible, Chef, or Puppet
Worked with continuous integration and deployment systems, with ideas about building and improving those systems
Excellent documentation and verbal communication skills, with the ability to collaborate and rally support with team members
Productive habits, healthy process awareness, and good teamwork skills and instincts
Bachelor’s degree in Computer Science, Computer Engineering, or related field, or equivalent work experience
Benefits
Competitive benefits package including salary, equity plans, healthcare, flexible time off, company holidays and recharge days, commuter benefits, lifestyle stipends, learning and development stipends, patronage, parental leave, and 401k plan with matching
Build and scale cloud infrastructure that powers Heidi's healthcare AI platform. Work with AWS and Azure while enhancing automation and reliability in an innovative healthtech startup.
Infrastructure - as - Code DevOps Engineer designing and managing cloud - native platforms at Vodafone. Collaborating with agile teams for digital transformation and business success.
Director of Data Engineering leading a strategic DevOps team within Enterprise AI. Balancing leadership with hands - on expertise to enable AI technology adoption.
Join a Data Engineering Team as a Senior DevOps to support multiple Data & AI initiatives. Utilize cloud technologies and enhance data pipelines in a collaborative environment.
Principal Site Reliability Engineer at Early Warning designing performance and resiliency patterns for applications and infrastructure. Collaborating with development teams to improve systems and data integrity.
DevOps Engineer contributing to CI/CD setup and Azure services management. Collaborates with teams to ensure efficient project delivery in a hybrid environment.
IT DevOps Specialist at BMW responsible for analyzing requirements and implementing software solutions in AWS cloud environments. Collaborating internationally within agile teams for digital transformation projects.
DevOps Engineer at Vistra designing, implementing, and maintaining robust CI/CD pipelines and cloud infrastructure. Enabling software delivery across multiple technology stacks with a focus on AWS.
Manage complex customer rollouts and initial system deployments at Talex.ai. Bridging technical development with real - world application in robotics and AI systems.
Cloud Operations Engineer designing and implementing highly reliable cloud solutions. Leading cloud infrastructure initiatives for production operations and customer success in a growing team.