Lead Systems Engineer managing AI platform operations at emerging AI infrastructure start-up. Oversee vendor collaboration, technical troubleshooting, and customer engagement for optimal service delivery.
Responsibilities
Coordinate resolution of complex issues (L3) to (vendor) product/engineering teams and manage vendor responses
Monitor system health, alerts, and customer usage patterns
Document solutions/workarounds, create and maintain knowledge, document support procedures
Automate common tasks and fixes
Configure and integrate tooling to support optimal operation of the platform, and support tool selection
Assist customers with platform configuration, onboarding, and usage best practices
Collaborate with platform and infrastructure support/engineering teams to resolve platform integration issues
Ensure SLAs and customer satisfaction targets are met
L1 support for customer-reported issues and requests
L2 support by diagnosing, replicating, and troubleshooting issues across platform and infrastructure
Work with customers and multiple stakeholders to understand requirements and challenges, provide reporting on usage, workflow and billing
Requirements
Extensive experience in technical support, system engineering, or platform operations
Solid understanding of L1 and L2 support processes (ticketing, escalation, troubleshooting)
Familiarity with cloud-based platforms, APIs, and distributed systems
Understanding of AI/ML concepts and tooling (model training, inference, data pipelines basics)
Experience with monitoring/logging tools (e.g., Grafana, Kibana, Splunk)
Excellent communication skills to interface with both customers and internal / vendor teams
Good understanding of tools requirements for ML engineers and data scientists, and how to optimize the experience
System administration experience with OS's like RHEL/CentOS, Ubuntu, tuning Linux kernel
Proficiency with Ansible, Nvidia and CUDA toolkits, Kubernetes and container orchestration
Understanding of automation, monitoring and security with GPU as a service.
Senior Business Systems Analyst focusing on Oracle Revenue Management Cloud Service. Ensure reliability and scalability of critical systems while collaborating with technical teams.
System Engineer implementing technical customer solutions for Ascom in South of England. Collaborate on - site, manage installations, and ensure customer satisfaction with assigned tasks.
System Engineer Customer Services at Somnitec handling diverse IT support for Swiss clients. Engaging in troubleshooting, monitoring, and enhancing customer satisfaction through excellent service.
Senior Software Architect responsible for designing cloud - native solutions for a global aviation leader. Collaborating with development teams to modernize messaging platforms.
Systems Engineer role at Xcelerate Solutions focusing on software development for DIA - NMEC Technology Platform. Engaging in Agile methodologies and mentoring junior engineers within a collaborative team.
System Engineer specializing in vehicle features and technical specifications. Engage with stakeholders and manage system integrations for automotive solutions in the UK.
Systems Engineer responsible for mechanical requirements management and interface control in complex aerospace projects. Leading technical deliverables and stakeholder engagement for missile systems.
Lead Systems Engineering Value Stream and manage a team of systems engineers at Northrop Grumman. Drive value stream integration for major defense systems development milestones.
Mid - Level Systems Analyst focusing on PABX virtualized support for Enghouse in Brazil. Analyzing incidents and ensuring system stability while collaborating with manufacturers.
Systems Engineer developing cutting edge ISR and aviation solutions at SNC. Researching, modeling, and testing advanced aerospace systems with a cross - functional engineering approach.