Engineering Manager leading a team at Baseten, scaling and optimizing LLM inference workloads. Focused on AI application performance, reliability, and cost efficiency in cloud environments.
Responsibilities
Lead, mentor, and grow a team of Forward Deployed Engineers, providing guidance on technical direction, project execution, and professional development.
Set clear goals and ensure timely, high-quality delivery across multiple customer-facing projects involving LLM deployment and inference optimization.
Collaborate with leadership to align team priorities with company and customer goals, balancing short-term delivery, widely varying customer priorities, and long-term technical initiatives.
Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects.
Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring). This involves working with customers’ engineering teams at every stage of the customer journey including: sales, implementation, and expansion.
Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers
Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs.
Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution.
Requirements
Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related field.
4+ years of professional software engineering experience, including 1+ year in a leadership or mentorship capacity.
Strong programming skills in Python, with production experience in building or optimizing ML inference systems.
Proven experience with LLMs, inference optimization, or serving frameworks (e.g., vLLM, TensorRT, Triton, Hugging Face, Ray Serve).
Familiarity with observability, profiling, and cost/performance tradeoffs in production ML systems.
Excellent communication and collaboration skills—able to lead cross-functional efforts and drive outcomes in ambiguous, fast-paced environments.
Benefits
Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Engineering Director responsible for scaling GEICO's AI - first Enterprise Platform. Delivering enterprise capabilities on schedule and ensuring high performance and reliability standards.
Engineering Manager leading AI - Native Engineering initiatives in Bangalore for healthcare technology. Responsible for data pipelines and ML - driven growth strategies in a hybrid work environment.
ML Research Engineering Manager leading research engineering team for TwelveLabs' multimodal models. Managing technical roadmap and team growth while ensuring high engineering standards.
Manager, Software Engineering role overseeing a team of engineers at Milliman IntelliScript. Responsible for software delivery, technical direction, and team growth within a SAFe Agile environment.
Senior Manager leading Data Acquisition team for WEX's DaaS platform. Focus on scalable, reliable data ingestion from diverse sources with strong technical oversight.
Data Engineering Manager leading a talented team to build reliable data pipelines for adm - Indicia. Partnering with stakeholders to ensure best practices in data quality and platform reliability.
Engineering Manager leading engineering teams at Foundation Health to deliver AI - powered healthcare solutions. Collaborating with leadership to drive innovation in a hybrid work environment.
Head of Engineering at Rabobank overseeing engineering strategy, technology stack governance, and talent management across Australia and New Zealand for the financial sector.
Lead the Software Engineering team at Henry Schein, driving AI and modern approaches for product delivery. Collaborate across teams to shape technical direction and ensure high - quality engineering outcomes.
Software Engineering Manager for Wells Fargo's Payments and Fraud Product Services. Leading application design, development, and team management in an agile environment.