Site Reliability Engineer developing and supporting systems for diagnosing issues in Comcast's network. Collaborating with software developers and managing the product lifecycle from development to deployment.
Responsibilities
Develop solutions for a wide range of difficult applications, problems or procedures.
Interpret internal/external business issues and recommend complete solutions based on best practices and proven technologies.
Work with members of cross-functional teams, third party vendors, company product managers, and marketing teams to deliver quality products in a timely fashion that meet defined requirements.
Provide technical leadership and mentorship.
Diligent about recording/documenting development and production support activities and tasks in our ticketing tool.
Ensure that project requests are properly accepted into the SRE engineering team, are worked in a timely and efficient manner, are of high quality, and smoothly follow the DevOps life cycle – continuous innovation, feedback, and improvement.
Deploy new systems and software and conduct appropriate testing to ensure successful deployment.
Requirements
Experience with Cloud Providers and configuring Infrastructure AWS
Kubernetes
Experience with CM Tools, such as Terraform and Ansible
Docker
Monitoring systems (Prometheus/AlertManager/Grafana)
Git
Experience with CI/CD Tools ECS/ECR
Scripting experience with bash and python 3
Experience troubleshooting applications and networking (Java, Angular, VPC’s Firewalls etc)
Understanding distributed systems and how the pieces fit together.
Benefits
Medical & Dental
401(k) Savings Plan
Generous paid time off
Life Milestones - from adoption assistance, childcare resources, pet insurance, and more, Comcast supports you at all life stages.
Courtesy Services - We offer all of our full-time employees in serviceable areas free digital TV and internet.
Discount tickets for Universal Resorts, including theme park tickets and onsite hotel rooms.
Senior Site Reliability Engineer developing IT infrastructure and automation solutions for Coinbase. Collaborating with Infrastructure, security, and compliance teams to enhance operational efficiency.
DevOps Engineer joining AI and Innovation team to ensure scalable, secure, and resilient systems at global media agency. Collaborating with UX and AI engineers for next - generation media experiences.
Site Reliability Engineer at HPE ensuring high availability and performance of cloud infrastructure across AWS and GCP environments. Managing incidents, monitoring systems, and supporting multi - cloud production.
Senior SRE/DevOps managing cloud architecture, driving automation, and ensuring operational reliability at Extensiv. Collaborating with teams to design scalable systems on AWS.
Site Reliability Engineer responsible for architecting cloud infrastructure and containerized platforms at Vista Global. Implementing CI/CD pipelines and mentoring teams on best practices for production environments.
Site Reliability Engineer supporting Vista Global’s production environments and cloud infrastructure. Delivering solutions using AWS, Terraform, Ansible, Docker, and Kubernetes in a hybrid model.
Senior DevOps Engineer focused on network automation and cloud infrastructure at Tiger Analytics. Building scalable solutions for multiple Fortune 500 companies and ensuring high availability and performance.
Software Developer developing software solutions for BASF's digitalization of gas treatment services. Design and implement new features while managing infrastructure using IaC in a global team setting.
DevOps Engineer developing global monitoring and observability platform for Organon, enhancing API integrations and ensuring system compliance. Collaborating across teams to optimize performance and security.
DevOps Engineer working in Iasi at Ness Digital Engineering, managing cloud environments and deploying software releases in a dynamic team. Responsibilities include monitoring, testing, and debugging systems.