Senior Site Reliability Engineer developing IT infrastructure and automation solutions for Coinbase. Collaborating with Infrastructure, security, and compliance teams to enhance operational efficiency.
Responsibilities
Partner with the Coinbase Infrastructure team to support and extend existing ci/cd frameworks to support IT services, including enterprise network platforms
Partner with security and compliance to build surveillance tooling into deployment pipelines
Design and implement automation to streamline overall operational IT support workflows
Action Kubernetes deployment, implementation, and support
Build a technological roadmap based on product requirements
Participate in on-call to support the AWS service deployment pipeline
Promote DevSecOps mentality and establish best practices to ensure top-tier cloud security
Set and maintain a standard of excellence for technical documentation across IT engineering
Participate in an operational environment with strict SLAs and managed incident response and disaster recovery strategies
Facilitate incident response, conduct root cause analysis and blameless retrospectives
Define metrics and design/implement automation opportunities based on monitoring/observability
Developing and maintaining integrations with other systems, such as source control and build systems
Troubleshooting and resolving technical issues with internal toolings
Requirements
At least 8 years experience supporting network infrastructure
At least 8 years experience automating cloud infrastructure
Proficient in at least one scripting languages (Bash, python, Ruby, Go, etc)
Proficiency with version control using CI/CD (Git)
Strong experience supporting AWS services and CI/CD workflows using terraform or equivalent framework
Strong experience with configuration management systems like Terraform, Ansible, Chef, Puppet, or Salt
Strong experience with containers and containers orchestration like Docker and Kubernetes
Benefits
medical
dental
vision
401(k)
Job title
Senior Site Reliability Engineer, IT Infrastructure
Site Reliability Engineer responsible for system reliability and performance at a leading financial services technology company. Collaborating with infrastructure, engineering, and security teams to build robust systems.
Principal Release Engineer leading and orchestrating end - to - end release management at F5. Driving cross - platform coordination and ensuring seamless releases across enterprise transformation programs.
Site Reliability Engineer focused on developing and improving Kubernetes configurations for F5's infrastructure. Collaborating with product teams and ensuring operational delivery processes are efficient and reliable.
Sr DevOps Manager leading the way in Cloud infrastructure, DevOps, and SRE practices at F5. Empowering engineers and fostering a culture of collaboration and improvement.
DevOps Engineer joining AI and Innovation team to ensure scalable, secure, and resilient systems at global media agency. Collaborating with UX and AI engineers for next - generation media experiences.
Site Reliability Engineer at HPE ensuring high availability and performance of cloud infrastructure across AWS and GCP environments. Managing incidents, monitoring systems, and supporting multi - cloud production.
Senior SRE/DevOps managing cloud architecture, driving automation, and ensuring operational reliability at Extensiv. Collaborating with teams to design scalable systems on AWS.
Site Reliability Engineer supporting Vista Global’s production environments and cloud infrastructure. Delivering solutions using AWS, Terraform, Ansible, Docker, and Kubernetes in a hybrid model.
Site Reliability Engineer responsible for architecting cloud infrastructure and containerized platforms at Vista Global. Implementing CI/CD pipelines and mentoring teams on best practices for production environments.
Senior DevOps Engineer focused on network automation and cloud infrastructure at Tiger Analytics. Building scalable solutions for multiple Fortune 500 companies and ensuring high availability and performance.