Production Support Engineer II providing support for business-critical systems while ensuring operational stability. Resolving incidents, maintaining system health, and collaborating with engineering teams.
Responsibilities
Provide day-to-day support for business-critical systems, ensuring operational stability.
Resolve lower to medium-priority incidents and maintain system health.
Support the improvement of production environments through collaboration with senior engineers and cross-functional teams.
Identify, troubleshoot, and resolve lower to medium-priority technical issues with guidance from senior engineers.
Support day-to-day monitoring of system performance and use monitoring tools to detect anomalies and take corrective actions.
Collaborate with cross-functional teams to resolve technical incidents and escalate higher-complexity issues to senior engineers as needed.
Assist in automating routine production support tasks by developing or modifying scripts and tools.
Maintain documentation for production issues, troubleshooting steps, and system configurations, contributing to the shared knowledge base.
Participate in incident, problem, and change management processes, following ITIL best practices.
Perform root cause analysis for recurring issues and assist senior engineers in implementing permanent fixes to improve system stability.
Support the implementation of process improvements to enhance system performance and minimize downtime.
Assist with mentoring and supporting junior-level engineers, providing guidance as needed.
Requirements
Bachelor’s degree in Computer Science, Information Systems, Engineering, or a related field.
Four to eight years of experience in production support, systems engineering, database engineering or related technical roles.
Experience with IT Service Management (ITSM) tools such as ServiceNow with solid understanding of incident, problem, and change management processes.
Proficiency in using monitoring tools like Splunk, Dynatrace, or CloudWatch to detect and resolve system performance issues.
Strong analytical and problem-solving skills, with the ability to assist in root cause analysis and incident resolution.
Ability to work independently on lower-to-medium priority incidents and escalate complex issues when necessary.
Sr. Staff Production Engineer at Zscaler implementing scalable multi - cloud infrastructure. Leading automation efforts and managing incident responses within a global platform.
Production Engineer ensuring availability of applications in distributed environments for Consort Group. Collaborating on technical projects and maintaining operational quality across services.
Site Reliability Engineer ensuring stability and security for ShiftKey’s Marketplace platform while executing AWS migration. Blends maintenance with engineering in a collaborative environment.
Production Engineer designing customer - oriented manufacturing concepts at Festo. Responsibilities include process development, documentation review, and collaboration with international teams.
Experienced Production Engineer supporting quality - critical processes and collaborating with teams to ensure high - quality pen needles. Engaging in stable operations and improvements within a 2 - year temporary contract.
Production Support Engineer ensuring system stability and reliability for Manulife's critical services. Collaborative role bridging development and infrastructure, providing seamless service for customers.
Senior Production Engineer (SRE) at Legion building and operating a secure AWS/Kubernetes platform. Focused on automation, reliability, and infrastructure as code.
Production Engineer managing database operations at Palantir, ensuring reliability and availability of data systems. Involved in architecture, design, and maintenance of production databases in various environments.
Production Engineer PCB managing first - line technical support for PCB assembly processes. Assisting with product introduction and implementing process improvements in a leading transport solutions company.