Technical Customer Success Engineer at Baseten ensuring smooth deployment and performance of ML workloads for key enterprise accounts. Collaborating with product and engineering teams to enhance customer success.
Responsibilities
Diagnose and resolve runtime issues related to latency, memory behavior, GPU utilization, concurrency, and model lifecycle management.
Debug infrastructure issues across Kubernetes (pods, controllers), networking, observability, and alerting systems.
Lead incident response during outages or escalations, managing coordination between Product, FDE, Sales, and Engineering.
Serve as the technical owner for top enterprise accounts with strict SLAs and high responsiveness expectations.
Identify common failure modes and translate user feedback into roadmap signals, product improvements, our internal runbooks, knowledge bases, and diagnostic best practices.
Own project coordination end-to-end: scoping, execution, communication, and stakeholder alignment across technical and non-technical teams ranging from feature requests, new deployments, and operational debugging issues.
Requirements
Deep Kubernetes troubleshooting expertise, including advanced resource debugging, pod/runtime analysis, and log-based diagnostics using observability tooling such as Grafana, Loki, and Prometheus.
Strong infrastructure debugging ability across container orchestration, networking, and service dependencies, with hands-on experience supporting production-grade clusters.
Experience managing high-severity incidents with major customers, including SLAs, post-incident reviews, and clear communication throughout escalations.
Proven project management and organizational skills with an ownership mindset, able to manage multiple complex, multi-stakeholder initiatives in parallel — including issue resolution, root-cause analysis, and feature delivery.
Ability to translate recurring technical pain points into roadmap-level insights, documentation improvements, or product enhancements.
Strong communication skills and executive presence during high-visibility situations, ensuring technical clarity and customer confidence.
3+ years of experience in a fast-paced, high-growth, or customer-facing engineering environment.
Benefits
Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Customer Success Engineer providing technical troubleshooting and insights for Perplexity AI's enterprise accounts. Collaborating with multiple teams to enhance customer experience and product reliability.
Technical Customer Success Specialist supporting customers based in the United States and Switzerland. Provide training, troubleshoot inquiries, and partner with teams for seamless experiences.
Technical Customer Success Manager establishing client relationships around software solutions in the energy sector. Providing expertise and support while identifying upsell opportunities through a customer - focused approach.
Technical Customer Success Manager at Armis, guiding customers to realize value from the platform and aligning it with their business goals. Facilitating communication between technical teams and customers.
Technical Customer Success Specialist in a team developing web data gathering solutions. Providing exceptional customer experience and support for B2C/B2B clients.
Customer Success Engineer optimizing the Momentum platform for healthcare sites in Switzerland. Collaborating with clients and internal teams to ensure successful software implementation and customer satisfaction.
Customer Success Engineer managing technical relationships with customers at dLocal. Assisting in resolving operational difficulties while ensuring customer satisfaction in a fintech environment.
Customer Success Engineer integrating HUMAN’s products for clients while managing technical inquiries and documentation. Collaborating with Sales and Product teams to troubleshoot and improve services for clients.
Customer Success Manager serving technical stakeholders at Cloudinary, driving customer retention and revenue growth through expert guidance and relationship management.
Technical Implementation Specialist at Fiserv supporting deployment and configuration of Digital Banking applications using Microsoft technologies. Collaborating with clients to ensure operational readiness and secure system operations.