Site Reliability Engineer developing and supporting systems for diagnosing issues in Comcast's network. Collaborating with software developers and managing the product lifecycle from development to deployment.
Responsibilities
Develop solutions for a wide range of difficult applications, problems or procedures.
Interpret internal/external business issues and recommend complete solutions based on best practices and proven technologies.
Work with members of cross-functional teams, third party vendors, company product managers, and marketing teams to deliver quality products in a timely fashion that meet defined requirements.
Provide technical leadership and mentorship.
Diligent about recording/documenting development and production support activities and tasks in our ticketing tool.
Ensure that project requests are properly accepted into the SRE engineering team, are worked in a timely and efficient manner, are of high quality, and smoothly follow the DevOps life cycle – continuous innovation, feedback, and improvement.
Deploy new systems and software and conduct appropriate testing to ensure successful deployment.
Requirements
Experience with Cloud Providers and configuring Infrastructure AWS
Kubernetes
Experience with CM Tools, such as Terraform and Ansible
Docker
Monitoring systems (Prometheus/AlertManager/Grafana)
Git
Experience with CI/CD Tools ECS/ECR
Scripting experience with bash and python 3
Experience troubleshooting applications and networking (Java, Angular, VPC’s Firewalls etc)
Understanding distributed systems and how the pieces fit together.
Benefits
Medical & Dental
401(k) Savings Plan
Generous paid time off
Life Milestones - from adoption assistance, childcare resources, pet insurance, and more, Comcast supports you at all life stages.
Courtesy Services - We offer all of our full-time employees in serviceable areas free digital TV and internet.
Discount tickets for Universal Resorts, including theme park tickets and onsite hotel rooms.
Site Reliability Engineer enhancing system reliability and deployment practices at OpenLoop. Collaborating with cross - functional teams for incident management and performance tuning.
Senior DevOps Engineer enhancing Azure application reliability for a healthcare fintech platform. Collaborating closely with engineering teams to ensure deploy safety and observability.
DevOps Engineer contributing to tooling changes and leading a community of practice at Totara. Focused on collaboration, development, and support for internal teams.
Site Reliability Engineer responsible for infrastructure supporting AI platform. Safeguarding US customer data and ensuring compliance in the Aerospace and Defense sector.
Senior Infrastructure Engineer managing Azure platform for a SaaS product at Rillion. Focused on automation, security, reliability, and scalability in a hybrid work environment.
Statistician/Reliability Engineer applying statistical analysis for satellite systems at Aerospace Corporation. Leading projects on system reliability and working closely with interdisciplinary teams in a full - time on - site role.
DevOps Engineer designing and implementing solutions to optimize operations in media technology at Mediagenix. Collaborating with cross - functional teams to enhance user experiences.
DevOps Senior Software Engineer at SimCorp developing high - quality software solutions for financial technology. Responsible for mentoring junior engineers and solving complex technical challenges.
Senior DevOps Engineer at SimCorp managing cloud environments and automating builds using Azure. Collaborating with cross - functional teams to ensure high service availability and compliance.
DevOps Engineer designing, building, and operating software development infrastructure for CodeMettle. Leading automation and best practices to enhance value delivery across teams.