Senior SRE/DevOps managing cloud architecture, driving automation, and ensuring operational reliability at Extensiv. Collaborating with teams to design scalable systems on AWS.
Responsibilities
Architect and manage cloud infrastructure and application services on AWS, ensuring scalability, performance, and security.
Implement and maintain containerization strategies using Docker, AWS Fargate
Develop and maintain automation scripts in PowerShell and Python to streamline operations.
Utilize IaC tools such as Packer and Terraform for efficient infrastructure provisioning and management.
Optimize and maintain SQL servers and databases for performance and reliability.
Collaborate with development teams to incorporate DevOps practices into the software development lifecycle.
Monitor system performance, troubleshoot issues, and implement solutions to ensure high availability and disaster recovery.
Conduct system audits, improve security postures, and ensure compliance with industry standards.
Requirements
Bachelor's degree or equivalent experience
5+ years of experience in a SRE/DevOps role with a strong background in AWS cloud services
Proficient in scripting with Python
Proficiency in one or more shells like Bash or PowerShell
Extensive experience with IaC tools, specifically Terraform, Serverless Framework
Understanding of SQL and experience managing relational databases
Must understand relationships between stats/indexes and performance troubleshooting
Knowledge (all not required)
Strong analytical and troubleshooting skills, with a proactive approach to problem-solving
Site Reliability Engineer responsible for system reliability and performance at a leading financial services technology company. Collaborating with infrastructure, engineering, and security teams to build robust systems.
Principal Release Engineer leading and orchestrating end - to - end release management at F5. Driving cross - platform coordination and ensuring seamless releases across enterprise transformation programs.
Site Reliability Engineer focused on developing and improving Kubernetes configurations for F5's infrastructure. Collaborating with product teams and ensuring operational delivery processes are efficient and reliable.
Sr DevOps Manager leading the way in Cloud infrastructure, DevOps, and SRE practices at F5. Empowering engineers and fostering a culture of collaboration and improvement.
Senior Site Reliability Engineer developing IT infrastructure and automation solutions for Coinbase. Collaborating with Infrastructure, security, and compliance teams to enhance operational efficiency.
DevOps Engineer joining AI and Innovation team to ensure scalable, secure, and resilient systems at global media agency. Collaborating with UX and AI engineers for next - generation media experiences.
Site Reliability Engineer at HPE ensuring high availability and performance of cloud infrastructure across AWS and GCP environments. Managing incidents, monitoring systems, and supporting multi - cloud production.
Site Reliability Engineer supporting Vista Global’s production environments and cloud infrastructure. Delivering solutions using AWS, Terraform, Ansible, Docker, and Kubernetes in a hybrid model.
Site Reliability Engineer responsible for architecting cloud infrastructure and containerized platforms at Vista Global. Implementing CI/CD pipelines and mentoring teams on best practices for production environments.
Senior DevOps Engineer focused on network automation and cloud infrastructure at Tiger Analytics. Building scalable solutions for multiple Fortune 500 companies and ensuring high availability and performance.