Machine Learning Systems Research Intern at Red Hat working on AI inference and model optimization techniques. Collaborating with experts in the field while gaining hands-on experience in applied ML research.
Responsibilities
Research and implement techniques for LLM inference and LLM optimizations.
Conduct experiments to evaluate the impact of optimization methods on model accuracy, latency, and throughput.
Collaborate with researchers and engineers to integrate optimizations into real-world machine learning workflows.
Document findings and contribute to technical reports, blog posts, or research publications.
Requirements
Currently pursuing a Ph.D. degree in Computer Science, Electrical Engineering, Machine Learning, or a related field
Strong programming skills in C++, CUDA, and Python
Experience with tensor math libraries such as PyTorch
Familiarity with AI model optimization techniques such as quantization (e.g., INT4, FP8), pruning, and knowledge distillation
Deep understanding and experience in GPU performance optimizations
Excellent knowledge of large language model architectures
Strong analytical and problem-solving skills
Excellent communication skills and ability to work in a team-oriented research environment
Background in efficient inference techniques for large-scale language models or computer vision models
Prior experience contributing to open-source ML frameworks or research publications
1 or more co-authored papers at a top tier conference like NeurIPS, ICLR, ACL, CVPR, MLSys is a big plus.
Benefits
Competitive stipend
Mentorship from leading experts in machine learning and model efficiency
Opportunity to contribute to research papers, patents, or open-source projects
Machine Learning Engineer at Capital One responsible for productionizing ML applications in Agile teams. Focused on ML architectural design, coding, and maintaining high availability of models.
Lead Machine Learning Engineer at Capital One focused on machine learning applications and systems. Collaborate with Agile teams to develop scalable solutions for business problems.
Senior Machine Learning Engineer focused on Machine Learning Ops for Autodesk software products. Building production infrastructures, ensuring AI - powered experiences, and collaborating with cross - functional teams.
Senior Machine Learning Engineer developing and deploying machine learning models for autonomous trucks. Collaborating with various teams to enhance safe and efficient decision - making in freight environments.
AI Engineer at PayPal designing, building, and deploying autonomous AI systems powered by LLMs. Collaborating across teams on AI engineering, distributed systems, and product development.
Machine Learning Engineer designing and deploying advanced training capabilities to support U.S. Navy operational readiness. Collaborate on machine - learning models to enhance combat system training environments.
Cloud MLOps Engineer supporting Data Science and Engineering teams by automating CI/CD pipelines and managing multi - cloud infrastructure for ML production.
Lead development of Agentic AI capabilities and LLM applications for multiple mission management applications. Mentor teams to implement ML algorithms addressing customer challenges.
Staff AI/ML Engineer at CACI responsible for developing AI/ML algorithms and analyzing datasets. Join a high - performing team supporting national safety missions.
AI/ML Engineer at CACI developing machine learning algorithms for multiple applications. Collaborating with a research team to implement cutting - edge AI/ML solutions for customer missions.