Build and release the Red Hat AI Inference runtimes and continuously improve processes and tooling used by the DevOps team
Work closely with product and research teams to scale SOTA deep learning products and software for enterprise deployments
Create and manage model training and deployment pipelines
Create DevOps and CI/CD infrastructure and scale the current technology stack
Actively contribute to managing and releasing upstream and midstream product builds
Test to ensure correctness, responsiveness, and efficiency
Troubleshoot, debug and upgrade Dev & Test pipelines
Identify and deploy cybersecurity measures via continuous vulnerability assessment and risk management
Collaborate with cross-functional teams on market requirements and best practices
Keep abreast of the latest technologies and standards in the field
Requirements
2+ years of experience in MLOps, DevOps, Automation and modern Software Deployment practices
Strong experience with Git, Github Actions including self-hosted runners, Terraform, Jenkins, Ansible, and common technologies for automation and monitoring
Highly experienced with administering Kubernetes/Openshift
Familiar with Agile development methodology
Experience with Cloud Computing using at least one of the following: AWS, GCP, Azure, or IBM Cloud
Solid programming skills especially in Python
Solid troubleshooting skills
Experience maintaining an infrastructure and ensuring stability
Ability to interact comfortably with members of a large, geographically dispersed team
Familiarity with contributing to the vLLM CI community is a big plus
While a Bachelor’s degree or higher in computer science, mathematics, or a related discipline is valued, technical prowess, initiative, problem solving, and practical experience are prioritized
Benefits
Comprehensive medical, dental, and vision coverage
Flexible Spending Account - healthcare and dependent care
Health Savings Account - high deductible medical plan
Retirement 401(k) with employer match
Paid time off and holidays
Paid parental leave plans for all new parents
Leave benefits including disability, paid family medical leave, and paid military leave
Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, employee assistance program, and more
Possibility of bonus, commission, and/or equity
Flexible work arrangements (in-office, office-flex, or fully remote depending on role)
Machine Learning Engineer responsible for optimizing AI pipelines at Easy2Parts. Join a growing team to revolutionize component sourcing with AI technology.
AI/ML Engineer developing and deploying machine learning solutions for Nokia's network optimization projects. Collaborating with cross - functional teams to enhance network planning capabilities.
Machine Learning Platform Engineer for Coinbase, building foundational components for ML at scale. Collaborating on fraud combat, personalizing user experiences, and blockchain analysis.
Machine Learning Engineer focused on building sophisticated models to protect Coinbase users from fraud. Engaging in hands - on technical role with modern AI/ML methodologies.
Senior ML Platform Engineer developing and maintaining scalable ML infrastructure at GEICO. Focused on Large Language Models and collaborating with data science and engineering teams.
Staff ML Engineer developing GenAI infrastructure at Zendesk. Leading design and optimization of ML platforms while fostering technical excellence and collaboration.
Senior Deep Learning Engineer developing deep learning models for wireless communications. Working on next - gen signal processing and radio access technologies at NVIDIA's Vietnam R&D center.
Leading a team of ML Engineers to design and deploy AI - driven solutions at Welldoc. Overseeing critical ML projects while collaborating with international teams.
Senior ML Platform Engineer building and scaling machine learning infrastructure for AI applications. Responsible for LLM deployment, Kubernetes management, and mentoring engineering teams.
Internship in AI and machine learning for internal process optimization at Dräger. Collaborate with cross - functional teams and develop predictive models in Lübeck.