Software Engineer focusing on ML infrastructure for drug discovery at Genesis AI. Leading engineering efforts to enhance scalable platforms for generative modeling and large-scale simulations.
Responsibilities
Lead engineering efforts focused on continuous improvement of the AI platform, focused on rapid build out and iteration on scalable and robust distributed infrastructure for ML training, inference, and evaluation.
Support model training and deployment across multiple clusters and multiple clouds, optimizing for throughput and cost.
Optimizing efficiency of ML models and other workloads in terms of latency, throughput, memory consumption, etc. (e.g., via GPU performance engineering), pushing the limits of what’s possible with the current hardware.
Contribute to the long-term vision for Genesis’ infra platform.
Requirements
Strong engineer who constantly strives for technical excellence. You can write clean code and have a deep understanding of the codebases you work in.
Deeply experienced with distributed training and inference of large models on GPU clusters and some of the core libraries and frameworks we use: Pytorch, Pytorch Lightning, Pytorch Geometric, and Ray.
Independent thinker with a strong sense of ownership and capability of engineering robust systems from first-principles-based conceptualization to state-of-the-art realization.
Curious, problem-oriented thinker who is excited to dive deep into the emerging field at the intersection of AI, physics, chemistry, and biology and make foundational contributions and discoveries (no previous experience in anything but ML necessary).
Experienced with building, maintaining and debugging low-level cluster infrastructure running on multiple clouds using Kubernetes and Terraform.
Experienced GPU engineer who can quickly figure out performance bottlenecks and architect highly performant code for large scale ML workloads.
Experience with XLA, Triton, CUDA, or similar accelerator programming languages and/or deep learning compiler stacks.
Experience working with some of the following: molecular systems (protein sequences and 3D structures, small molecules, etc.), ML force fields or other physics-informed models and methods, or point cloud data in other application domains, such as 3D graphics.
Benefits
Competitive compensation package that includes salary and equity.
Comprehensive health benefits: Medical, Dental, and Vision (covered 100% for the employees).
401(k) plan.
Open (unlimited) PTO policy.
Free lunches and dinners at our offices.
Paid family leave (maternity and paternity).
Life and long- and short-term disability insurance.
Machine Learning Engineer designing and implementing AI systems focused on Japanese language challenges at Woven by Toyota. Involves technical R&D, system design, and collaboration with cross - functional teams.
Principal Software Engineer leading MLOps within Analytics Platform at Sun Life. Focused on AWS and machine learning operations, collaborating across technical and business teams.
Machine Learning Engineer designing and optimizing deep learning models for safety - critical environments at Destinus. Shaping the future of high - speed, autonomous flight technologies.
Machine Learning Engineer optimizing personalization systems for Spotify's audio streaming service. Collaborating with cross - functional teams to enhance user experience and deliver recommendations.
Principal Machine Learning Engineer developing ML and GenAI solutions in a cloud - native environment at Flexera. Leading a high - impact team and driving operational excellence for ML infrastructure.
Senior ML Platform/Ops Engineer building AI - powered ML pipelines for a dynamic Ed - Tech company. Collaborating with ML scientists and engineers to ensure reliable deployment and observability.
Senior ML Platform/Ops Engineer building ML systems for AI - powered learning at Preply. Productionizing machine learning with high reliability, performance, and observability in a hybrid environment.
Machine Learning Engineer developing advanced Deep Learning models for autonomous driving technology at Mobileye. Collaborating in a high - end algorithmic engineering team on critical computer vision challenges.
Machine Learning Engineer focusing on vulnerabilities and security of AI systems at Carnegie Mellon University. Collaborating with a team to build robust prototypes and provide solutions for government sponsors.