Software Engineer focusing on ML performance at Baseten, driving optimizations for large language models. Join a dynamic team contributing to advanced AI applications.
Responsibilities
Implement, refine, and productionize cutting-edge techniques (quantization, speculative decoding, kv cache reuse, chunked prefill and LoRA) for ML model inference and infrastructure.
Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to debug ML performance issues.
Apply and scale optimization techniques across a wide range of ML models, particularly large language models.
Collaborate with a diverse team to design and implement innovative solutions.
Own projects from idea to production.
Requirements
Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.
Experience with one or more general-purpose programming languages, such as Python or C++.
Tech Lead at Doxallia managing technical solutions and leading a development team. Requires strong coding skills and at least 5 years of relevant experience in agile environments.
Lead Full - stack Engineer developing Anti - Money Laundering enterprise Fraud applications. Seeking expertise in Java Springboot and API Development for high - stakes financial software.
Embedded Linux Development Engineer at Hirsch France working on security solutions in a collaborative team. Focusing on innovation and optimization within the R&D department in Aix en Provence.
Technical Lead overseeing AI product development and engineering teams at Ironclad. Building innovative solutions for legal professionals with extensive experience in software engineering.
Staff Software Engineer at Ironclad developing features for a leading AI contracting platform. Collaborating with cross - functional teams to empower legal and business functions through modern technology.
Senior Software Engineer at Ironclad developing scalable web applications for an AI contracting platform. Working collaboratively with teams to build high - quality systems and features in a hybrid environment.
Senior Software Engineer joining Docker's AI team to build containerized AI agents. Collaborating on cutting - edge technologies for scalable and intelligent agent deployment.
Senior Software Engineer within PNC's Technology organization owning and evolving Workday integrations. Leading ETL workflows and ensuring reliable, secure integrations across HR/Payroll applications.
Embedded Software Engineer I at Gentex creating software for embedded platforms. Collaborating with cross - functional teams to develop and test products meeting customer needs.
Software Developer Intern at BECU contributing to enterprise - class software development and collaborating with experienced professionals. Engage in coding, testing, and debugging efforts to support the business needs.