Lead the design and architecture of an GPU based AI infrastructure platform
Work with Biomedical research scientists to develop and implement technical solutions for ML/Ops (Run:AI) hosted on K8 EKS cluster
Oversee the design and implementation of data storage, retrieval, and processing pipelines
Collaborate with Biomedical research & Data scientists and other business stakeholders to understand their needs and translate them into technical solutions
Optimize the performance and cost-efficiency of the platform
Requirements
Bachelor’s degree in Information Technology, Computer Science, or Engineering
AWS Solution Architect certification – professional
8+ years of strong technical hands-on experience of delivering infrastructure and platform services across geographic and business boundaries
Working experience on GPU based AI Infrastructure
Experience in NVIDIA DGX Infra is highly preferred
Deep understanding of Architecture and Design of Platform Engineering products with focus mainly on Data science, ML/Ops and Bio science or Pharma Gen AI foundational models
Experience in NVIDIA BioNeMo or Clara is highly preferred
Extensive experience in building infra solutions on AWS, particularly with services like AWS Bedrock, Amazon Q, SageMaker, ECS/EKS
Knowledge of containerization and orchestration technologies, such as Docker and Kubernetes
Experience with DevOps practices and tools, including CI/CD pipelines, infrastructure as code (IaC), and monitoring solutions
Excellent skills in collaborating with business users, Product team, Operationalizing the delivered products and working closely with Security for implementing compliance
Good knowledge on implementing well defined & industry standard Change management process for platform & its products
Have a well-structured Use-case onboarding process
Should ensure to have documentation for Platform products and implementations done
Experience with DevOps Orchestration/Configuration/Continuous Integration Management technologies
Good understanding of High Availability and Disaster Recovery concepts for infrastructure
Ability to analyze and resolve complex infrastructure resource and application deployment issues.
Senior Cloud Architect responsible for Microsoft Azure infrastructure optimization at Quest One, contributing to climate protection efforts. Engaging in IT governance and providing user support in Hamburg, Germany.
Cloud Computing Platform Administrator responsible for maintaining CI/CD tools for Desjardins Group. Involved in DevOps practices and application security integration.
Infrastructure and Cloud Computing Architecture Consultant designing and automating technology architectures at Beneva. Collaborating with IT and business teams for strategic project support.
Senior Cloud Platform Engineer at Smarsh, focusing on architecting and building hybrid cloud platforms. Contribute to risk management and compliance in digital communications.
Intern Cloud Engineer focusing on Azure technologies at HF Sinclair. Assisting with deployments and engaging in cross - functional collaboration for digital initiatives.
Director of Private Cloud Platform Engineering at Ford overseeing private cloud modernization and development. Leading teams, driving strategic direction, and enhancing developer experience in a hybrid role.
Cloud DevOps Engineer enabling cloud - native AWS applications delivery at Boeing. Collaborating on CI/CD and Infrastructure - as - Code for secure and observable deployments.
Senior Cloud Platform Engineer at Yora designing and maintaining serverless applications. Leading technical decisions and collaborating with product owners in a dynamic tech environment.
SAP Basis & Cloud Engineer joining client for international modernization and cloud deployment projects. Focused on SAP systems administration and integrations in a hybrid work environment.
Data Engineer focused on optimizing data pipelines and processing for a leading data analytics company. Collaborating with stakeholders to drive data - informed decision - making through business intelligence solutions.