Production Support Engineer monitoring and resolving issues in production systems for a market data leader. Collaborating with engineering teams and participating in on-call support.
Responsibilities
Monitor production systems and infrastructure, ensuring uptime and performance metrics are met
Troubleshoot, diagnose, and resolve production issues in real time, minimizing service impact
Manage incident response, including escalation, root cause analysis, and post-mortem reporting
Collaborate with engineering teams to develop and implement monitoring tools, alert systems, and automated recovery processes
Analyze system logs, metrics, and trends to proactively identify potential risks or issues
Execute software deployments, configuration changes, and system upgrades with minimal disruption
Maintain and refine operational runbooks, escalation procedures, and best practices.
Drive continuous improvement by identifying areas for process optimization and operational efficiency
Participate in an on-call rotation to provide 24/7 support for production systems
Requirements
Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent work experience
2+ years of experience in production support, system administration, or monitoring role
Strong technical skills in Linux/Unix environments, with experience in troubleshooting and debugging
Hands-on experience with monitoring tools (e.g., ITRS, Prometheus, Grafana, Splunk) and incident management platforms
Scripting experience (e.g., Python, Bash) to automate monitoring and reporting tasks
Excellent problem-solving and analytical skills, with the ability to work under pressure in a fast-paced environment
Solid understanding of networking, system performance, and application monitoring concepts
Exceptional communication and collaboration skills to coordinate with cross-functional teams effectively
Engineer specializing in automation technology managing industrial control systems and technical support. Collaboration with engineering and operations teams for system configuration and troubleshooting.
Productivity Engineer at Owens Corning focusing on maximizing asset utilization and executing productivity initiatives across manufacturing processes. Collaborating across teams to enhance efficiency and profitability.
VoIP Engineer at Aircall owning stability, performance, and scaling of telephony infrastructure. Working with cloud technologies to ensure reliable voice connectivity for global customers.
Lead engineer in package design process, driving the evolution of high - performance networking technologies at Cisco. Collaborate with cross - functional teams on ASIC package solutions.
Prompt Engineer designing and optimizing LLM - driven solutions within Agentic POD architecture. Collaborating with AI engineers and data scientists to implement advanced prompting techniques.
Forward Deployed Engineer implementing Cloudflare solutions at strategic customer sites. Collaborating with engineering teams to deploy production systems and influence product direction.
Junior ELM Support Engineer supporting IBM ELM toolchain environments and troubleshooting technical issues for engineering projects. Collaborating with developers and process owners to ensure tool compliance.
Middleware Engineer at AWG focusing on installation and maintenance of middleware solutions. Ensuring stability and operational efficiency in corporate Linux/Unix/Windows environments.
Release Train Engineer leading Agile Release Trains at Navy Federal Credit Union. Enabling cross - functional delivery within large, complex environments focused on high - quality outcomes.
Lead Subsea Engineer providing technical leadership in subsea engineering projects. Overseeing design, installation, and maintenance of subsea systems within the oil and gas industry.