Engineering Manager leading a team at Baseten, scaling and optimizing LLM inference workloads. Focused on AI application performance, reliability, and cost efficiency in cloud environments.
Responsibilities
Lead, mentor, and grow a team of Forward Deployed Engineers, providing guidance on technical direction, project execution, and professional development.
Set clear goals and ensure timely, high-quality delivery across multiple customer-facing projects involving LLM deployment and inference optimization.
Collaborate with leadership to align team priorities with company and customer goals, balancing short-term delivery, widely varying customer priorities, and long-term technical initiatives.
Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects.
Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring). This involves working with customers’ engineering teams at every stage of the customer journey including: sales, implementation, and expansion.
Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers
Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs.
Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution.
Requirements
Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related field.
4+ years of professional software engineering experience, including 1+ year in a leadership or mentorship capacity.
Strong programming skills in Python, with production experience in building or optimizing ML inference systems.
Proven experience with LLMs, inference optimization, or serving frameworks (e.g., vLLM, TensorRT, Triton, Hugging Face, Ray Serve).
Familiarity with observability, profiling, and cost/performance tradeoffs in production ML systems.
Excellent communication and collaboration skills—able to lead cross-functional efforts and drive outcomes in ambiguous, fast-paced environments.
Benefits
Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Project Manager responsible for engineering management at Blue Yonder, a digital supply chain leader. Focused on delivering project objectives and supporting team development in a cloud - based environment.
Software Engineering Manager leading a small team at a Fintech startup optimizing the lending process in Canada. Responsibilities include technical leadership, team management, and product execution.
Senior Engineering Manager overseeing complex naval ship projects as part of a global defence organisation. Leading teams in delivering technical scopes safely and effectively during a secondment in Indonesia.
Operations Engineering Manager 3 managing engineering operations in Fort Worth, supporting production business units and driving project success. Requires extensive experience in engineering and team leadership.
Engineering Manager leading Linux Kernel development and systems engineering teams at Cloudflare. Overseeing delivery processes and fostering collaboration within a global community.
Technical lead managing solar tracker system planning and execution for AgriPV. Responsibilities include engineering collaboration and lifecycle performance oversight in Munich - based firm.
Engineering Manager leading the Retention squad at Prose, a custom hair and skin care company in Paris. Balancing technical execution with team mentorship and collaboration for subscription systems.
Engineering Manager leading a team for the revenue - critical Virtuals vertical at KingMakers. Overseeing delivery and technical direction within a growing iGaming platform.
Engineering Manager focused on machine learning for self - driving technology at Woven By Toyota. Leading a talented team in advancing capabilities for prediction and motion planning.