Site Reliability Engineer maintaining Taco Bell's Smarthub technology platform. Troubleshooting store issues and enhancing customer experience through innovative solutions.
Responsibilities
Troubleshoot and analyze store level issues.
Conduct production validation test for deployments.
Document processes, tools, and known solutions.
Participate in problem records troubleshooting bridges.
Communicate findings clearly during issue investigation.
Analyze ingested metrics to identify store or platform level issues.
Implement monitoring and alerting.
Participate in sprint planning, design, operations and deployment meetings.
Serve as SRE liaison for Platform, Service Desk and Proactive teams.
Support vendor NextGen projects and platform upgrades.
Maintain vendors build servers for smarthub in Taco Bell lab.
Validate and coordinate resolutions across teams.
Support existing tools.
Apply technical knowledge and learning to improve the tooling.
Initiate and work on projects that provide value to Engineering, SRE, or SD teams.
Requirements
Bachelor's degree in Computer Science, Engineering, or a related field.
1–3 years of experience in IT, systems engineering, DevOps, or technical support.
Experience with containerized platforms, API/Microservices and software development life cycle.
Practical knowledge working with Linux systems.
Familiarity with observability platforms such as Datadog.
Experience with automation and basic scripting using Bash or Python
Solid understanding of system monitoring principles
Strong analytical and problem-solving abilities
Demonstrated ability to learn rapidly and adapt within fast-paced environments
Strong attention to detail
Demonstrates curiosity and initiative in learning
Communicate effectively with peers and cross-functional teams
Shows ownership and follow-through on assigned tasks+
Benefits
Hybrid work schedule and year-round flex day Friday
Onsite childcare through Bright Horizons
Onsite dining center and game room (yes, there is a Taco Bell inside the building)
Onsite dry cleaning, laundry services, carwash
Onsite gym with fitness classes and personal trainer sessions
Up to 4 weeks of vacation per year plus holidays and time off for volunteering
Tuition reimbursement and education benefits
Generous parental leave for all new parents and adoption assistance program
401(k) with a 6% matching contribution from Yum! Brands with immediate vesting
Comprehensive medical & dental including prescription drug benefits and 100% preventive care
Discounts, free food, swag and… honestly, too many good benefits to name
DevOps Cloud Engineer managing AWS infrastructure for ventx GmbH in a hybrid environment. Support and collaborate with a great team in an innovative consulting firm.
DevOps Engineer supporting IT platform development at K - tronik GmbH. Join a team focused on container - based customer platforms and CI/CD pipeline automation in a hybrid role.
Site Reliability Engineering Specialist ensuring service performance and reliability at BT Group. Driving automation and cloud solutions while mentoring a diverse team of engineers.
DevOps Engineer ensuring stability, reliability, and smooth operation of Kubernetes platforms across UK, Ireland, and Australia. Collaborating with global teams to deliver technical support and improvements.
DevSecOps Architect joining Multiverse's Information Security team to build automation for secure code delivery. Advocate for security practices by collaborating with engineering teams to ensure secure product development.
Mid - senior AWS/DevOps engineer at ispace, inc. responsible for cloud infrastructure design and operations in Tokyo. Requires 5+ years of AWS experience with bilingual Japanese and English skills.
Junior DevOps/SRE responsible for designing Pigment's infrastructure for real - time data management. Working on scalability, performance, and incident response in a collaborative team environment.
DevOps Engineer designing and optimizing modern cloud - native infrastructures for AI/ML workflows. Collaborating with cross - functional teams on AWS, Azure, and GCP platforms.
Cloud/Azure DevOps Developer supporting workflow automation platform at an innovative company in Austria. Working with Azure and DevOps methodologies to enhance and scale the platform.
Senior Site Reliability Engineer focused on building reliable, scalable infrastructure at a tech company. Driving best practices in observability, incident response, and engineering collaboration.