Machine Learning Infrastructure Engineer optimizing scalable infrastructure for ML at Gridmatic startup. Focus on machine learning, distributed systems, and renewable energy transformation.
Responsibilities
Own and optimize scalable, robust distributed infrastructure for ML training
Optimize throughput and cost across clusters and clouds
Improve efficiencies by optimizing latency and memory consumption
Help define long-term vision for ML platform
Mentor junior engineers and contribute to team culture
Requirements
3+ years of experience in machine learning and software engineering
Strong understanding of codebases and ability to write scalable code
Experience in researching and implementing deep learning models
Knowledge of GPU clusters and core libraries (PyTorch, PyTorch Lightning, Ray)
Familiarity with large-scale data storage (Zarr, SQL, feature stores)
Independence and ownership in engineering robust systems
Enthusiasm for renewable energy and a willingness to learn
Journeyman Infrastructure Engineer supporting the delivery and enhancement of enterprise data and analytics products. Working with government partners and teams on scalable, production - ready solutions.
Journeyman Infrastructure Engineer supporting DoD enterprise data and analytics program. Collaborating with teams to deliver scalable, production - ready IT solutions for national security.
Public Cloud Infrastructure Engineer at Lloyds Banking Group focused on scalable cloud services for developers. Assist in building secure automated cloud platform capabilities using modern infrastructure practices.
Infrastructure Engineer focusing on automation and platform enablement for data protection within the DLM team. Involves designing automated pipelines and transitioning to policy - as - code models in a hybrid working environment.
Cloud Infrastructure Engineer at Lead Forensics managing AWS infrastructure and working on hybrid platforms. Supporting internal operations and customer - facing services with a focus on security and performance.
IT Infrastructure Engineer maintaining diverse infrastructure for Arden University. Delivering IT vision, supporting students and staff with a high - performing technology environment.
Cloud Infrastructure Engineer focusing on building and maintaining OCI environments for AI/ML - enabled programs. Collaborating with Army personnel to integrate AI models into operational architecture.
Cloud Infrastructure Engineer building and securing environments for AI/ML model testing in DoD settings. Requires extensive experience in Cloud technologies and collaboration with government personnel.
Support internal operations and exceed SLAs as a Sr Infrastructure Engineer for Resideo Technologies. Design and implement solutions to enhance system reliability and performance.
Infrastructure Architect leading design and governance of system resiliency across global financial services firm. Ensures robust, fault - tolerant infrastructure capable of rapid recovery from disruptions.