Solutions Architect at NVIDIA driving AI and ML solutions on cloud platforms. Collaborating with multi-functional teams and mentoring customers to improve GPU-enabled machine learning workflows.
Responsibilities
Help cloud customers craft, deploy, and maintain scalable, GPU-accelerated inference pipelines on cloud ML services and Kubernetes for large language models (LLMs) and generative AI workloads.
Enhance performance tuning using TensorRT/TensorRT-LLM, vLLM, Dynamo, and Triton Inference Server to improve GPU utilization and model efficiency.
Collaborate with multi-functional teams (engineering, product) and offer technical mentorship to cloud customers implementing AI inference at scale.
Build custom PoCs for solution that address customer’s critical business needs applying NVIDIA hardware and software technology
Partner with Sales Account Managers or Developer Relations Managers to identify and secure new business opportunities for NVIDIA products and solutions for ML/DL and other software solutions
Prepare and deliver technical content to customers including presentations about purpose-built solutions, workshops about NVIDIA products and solutions, etc.
Conduct regular technical customer meetings for project/product roadmap, feature discussions, and intro to new technologies.
Establish close technical ties to the customer to facilitate rapid resolution of customer issues.
Requirements
BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Statistics, Physics, or other Engineering fields or equivalent experience.
3+ Years in Solutions Architecture with a proven track record of moving AI inference from POC to production in cloud computing environments including AWS, GCP, or Azure
3+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow
Excellent knowledge of the theory and practice of LLM and DL inference
Strong fundamentals in programming, optimizations, and software design, especially in Python
Experience with containerization and orchestration technologies like Docker and Kubernetes, monitoring, and observability solutions for AI deployments
Junior Solutions Engineer providing technical qualification and support for automotive SaaS solutions. Collaborating with commercial and client services teams in Milan, Italy.
Solutions Architect transforming businesses through Generative AI applications and strategies. Engaging with customers to ensure value realization from their investment in OpenAI's technologies.
Solutions Architect designing and implementing solutions for complex business problems. Collaborating with stakeholders and leading a team while ensuring adherence to governance processes.
Solutions Architect at INTEGRIS Health shaping application technologies through design and implementation. Mentoring project teams and advancing strategic goals in clinical or business systems.
Senior Weapons Systems Integration Engineer in charge of military aircraft systems integration and modernization efforts. Focusing on weapons integration and logistics readiness at Wright - Patterson AFB.
Solutions Architect driving API standards and integration patterns for Zantech's DoD projects. Involving multi - cloud architecture and cross - domain solutions, ensuring secure data flows.
Anaplan Solutions Architect managing and optimizing financial planning models for OpenAI's growth. Collaborating with finance and tech teams to ensure data integrity and scalability.
Principal Solutions Architect at Paramount improving media platforms through design and AI technology. Overseeing MAM/PAM ecosystems and architecture for media supply chains.
Senior Elastic Stack Data Integration Engineer designing and maintaining data ingestion pipelines for Missile Defense Agency. Focused on building resilient and scalable Logstash architectures.
Senior Security Integration Engineer supporting Missile Defense Agency through Elastic Stack integration and optimization of security data. Leading customer engagements and technical discussions while mentoring junior team members.