Maintenance Engineer ensuring reliability of production AI systems at Luminai. Monitoring, diagnosing, and improving AI workflows for critical organizational processes.
Responsibilities
Monitor, maintain, and improve the reliability of production AI systems and workflow infrastructure
Proactively identify, diagnose, and resolve system issues across application, integration, and cloud infrastructure layers
Own incident response processes, including root cause analysis and long-term remediation
Implement monitoring, alerting, and observability tooling to ensure system health and uptime
Collaborate with Engineering to harden deployments and improve system architecture for resilience and scalability
Support customer-facing teams by troubleshooting and resolving technical issues in live environments
Document system configurations, operational procedures, and recovery protocols
Continuously improve reliability standards, deployment practices, and operational safeguards
Requirements
3+ years of experience in support engineering, site reliability engineering, or infrastructure maintenance
Strong proficiency in Python or scripting languages
Experience managing cloud infrastructure (AWS, GCP, or Azure)
Strong problem-solving skills and a proactive, preventative mindset
Clear communication skills and ability to collaborate across engineering and customer-facing teams
High ownership and accountability in high-reliability environments
Reverse Engineer at Teller building APIs for connecting apps to users' financial accounts. Help crack mobile banking applications for seamless bank integrations.
Project Engineer supporting construction project teams at Fessler & Bowman. Assisting with project planning, scheduling, and management across multiple construction sites.
Lead Engineer developing AI - powered features for FIS’s cloud - based financial platform, collaborating with teams and mentoring junior engineers for architectural excellence.
Controls Engineer designing and maintaining control systems for manufacturing equipment. Involved in troubleshooting and onsite servicing for optimal operations.
Tier III VTC Engineer providing technical expertise for AT&T at customer site in Virginia. Responsible for video teleconferencing troubleshooting, installation, and design at various locations.
Lead Knowledge Engineer at S&P Global driving data transformation initiatives. Collaborating with technology teams to implement next - generation data architecture and knowledge management solutions.
Part 21 Electrical / Avionics Engineer at Boeing responsible for compliance with regulatory requirements. Supporting certification of modifications for global airline partners and collaborating with engineering teams.
Engineer designing, developing, and testing nuclear equipment and systems for Navy ships at Newport News Shipbuilding. Collaborating on safety, efficiency, and performance improvements while conducting relevant research and analysis.
Senior Forward Deployed Engineer embedding in strategic aviation operations to drive measurable impact. Working with airlines and MROs while ensuring successful adoption of AI - driven solutions and product enhancements.
Project Water Engineer at Arcadis delivering design solutions for water, wastewater, and reuse clients in California. Evaluate, plan, and design projects while supporting management and collaborating with teams.