Maintenance Engineer ensuring reliability of production AI systems at Luminai. Monitoring, diagnosing, and improving AI workflows for critical organizational processes.
Responsibilities
Monitor, maintain, and improve the reliability of production AI systems and workflow infrastructure
Proactively identify, diagnose, and resolve system issues across application, integration, and cloud infrastructure layers
Own incident response processes, including root cause analysis and long-term remediation
Implement monitoring, alerting, and observability tooling to ensure system health and uptime
Collaborate with Engineering to harden deployments and improve system architecture for resilience and scalability
Support customer-facing teams by troubleshooting and resolving technical issues in live environments
Document system configurations, operational procedures, and recovery protocols
Continuously improve reliability standards, deployment practices, and operational safeguards
Requirements
3+ years of experience in support engineering, site reliability engineering, or infrastructure maintenance
Strong proficiency in Python or scripting languages
Experience managing cloud infrastructure (AWS, GCP, or Azure)
Strong problem-solving skills and a proactive, preventative mindset
Clear communication skills and ability to collaborate across engineering and customer-facing teams
High ownership and accountability in high-reliability environments
Regional Plant Engineer managing network designs and ensuring compliance with engineering standards. Collaborating across teams for effective construction and maintenance of fiber and broadband networks.
Senior Process Engineer managing commercialization and capital projects in food and beverage manufacturing. Utilizing engineering expertise to optimize processes and mentor junior team members.
Senior Manufacturing Engineer at Emerald Technologies, a growing PCBA manufacturer. Leading process improvements and technical excellence across manufacturing operations in Brea, CA.
Supplier Quality Management Engineer overseeing supplier production for micromobility vehicles. Collaborating with European teams to ensure quality parts and seamless operations.
Senior Release Train Engineer leading large and complex Agile Release Trains at Cox Automotive. Ensuring successful delivery across diverse environments while driving continuous improvement.
Compiler Engineer developing syntax and semantic processing for Intel's Fortran compiler. Collaborating with a team to enhance compiler features and addressing customer inquiries.
Cloud & HPC engineer optimizing benchmark and application codes for SiPearl processors. Contributing to collaborative projects in performance analysis and R&D with various partners.
Journey - level protection engineer providing technical support for electric system protection at PG&E. Responsibilities include guidance to peers and handling complex protection projects in hybrid setting.
Engineer 1st Line at Vodafone involved in connectivity and sustainability efforts. Joining a diverse team to innovate and impact communities positively.
Senior Traffic Signals Design Engineer providing effective support in traffic signals schemes. Leading a team and delivering engineering solutions for transport infrastructure across the UK.