ML Engineer responsible for optimizing model inference and managing deployment for AdTech solutions. Collaborating with product team to enhance LLM models in a hybrid environment.
Responsibilities
Optimize model inference for performance and cost, using techniques like quantization and LoRA
Manage the end-to-end deployment lifecycle of models into a production environment
Take projects from a prototype (e.g., a Jupyter notebook) to a fully integrated, production-grade service
Collaborate with software developers and product managers to define requirements and implement solutions
Requirements
Proven experience in Python and its core ML ecosystems
Hands-on experience in deploying, fine-tuning, and inferencing LLMs or other deep learning models in a production setting
Experience with cloud platforms (AWS, GCP, or Azure) and containerization technologies (Docker, Kubernetes)
Ability to work independently and manage the full lifecycle of a machine learning service
Benefits
health insurance
fully covered 10 working days of sick leave per year
Senior Software Developer working on ML Infrastructure and Deployment at Verafin. Helping develop cutting - edge fraud detection tools alongside analytics teams using AWS and Terraform.
Machine Learning Engineer developing advanced SLAM systems for autonomous trucking environments at Bot Auto. Collaborating with cross - functional teams to optimize mapping solutions and ensure operational stability.
Graduate Deep Learning Algorithm Developer developing perception technologies for autonomous driving. Tackling challenges in object detection and 3D perception using state - of - the - art deep learning models.
Principal AI/ML Engineer leading the AI/ML infrastructure development for WEX's risk service needs. Focused on innovative engineering and technology solutions within a high - stakes environment.
AI/ML Engineer developing solutions in artificial intelligence for HPE. Responsible for conducting research, designing AI solutions, and mentoring team members.
Machine Learning Engineer focusing on modeling cancer cells and developing related tools. Collaborating with researchers and scientists to advance cancer treatment through ML.
Machine Learning Engineer II developing production - grade ML models for fraud detection at GEICO. Collaborating on system architecture and ensuring optimal performance of fraud assessment systems.
AI/ML Engineer III designing and architecting AI solutions at Hewlett Packard Enterprise. Collaborating with teams to drive innovation and tackle complex problems.
AI/ML Engineer deploying state - of - the - art AI models to solve real - world problems at Brain Co. Working in healthcare, government, and energy sectors for impactful results.
Trainer at WeAndTheMany facilitating learning by sharing experiences and creating interactive sessions. Engaging with students to enhance their skills and knowledge through dynamic teaching methods.