Lead AI Engineer, FM Hosting, LLM Inference at Capital One | Hybrid Hired

About the role

Lead AI Engineer responsible for developing AI software components and delivering AI-powered products at Capital One. Collaborating with cross-functional teams to enhance customer interaction with innovative AI solutions.

Responsibilities

Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One
Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc
Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more
Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems
Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One

Requirements

Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies, or a Master's degree in the same fields plus at least 2 years of experience
At least 4 years of experience programming with Python, Go, Scala, or Java
Experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)
Experience designing, developing, delivering, and supporting AI services
Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang
Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost
Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production

Benefits

Comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being
Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)

Similar roles

Browse all Ai Engineer jobs

19 hours ago

VA

Applied AI Engineer

Valence

Founding Applied AI Engineer defining AI - powered leadership coaching for enterprises. Collaborating with cross - functional teams to build intelligent systems at enterprise scale.

Hybrid Role

New York City United States Ai Engineer

yesterday

KL

Legal Operations – AI Lead

KLA

Legal Operations Specialist at KLA optimizing legal workflows through AI implementation. Collaborating with legal and IT teams, enhancing efficiencies across various legal domains.

Hybrid Role

Ann Arbor United States Ai Engineer

$82,200 - $139,700 per year

yesterday

LG

Principal AI Architect

LSEG (London Stock Exchange Group)

Principal AI Architect at LSEG overseeing AI/ML platform design and development. Shaping future AI capabilities and driving innovation in machine learning and generative AI technologies.

Onsite Role

London United Kingdom Ai Engineer

yesterday

MA

Gen AI Developer

Minor Hotels Europe and Americas

Gen AI developer at Capgemini focusing on connectivity and network engineering while applying engineering practices in various technologies. Involves network systems integration and operations.

Onsite Role

Gurgaon India Ai Engineer

2 days ago

CU

AI Engineer – Mission Innovation Lab

Carnegie Mellon University

AI Engineer translating cutting - edge AI concepts into mission - scale solutions for the warfighting community. Collaborate with researchers and government sponsors to develop secure, reliable AI capabilities.

Hybrid Role

Pittsburgh United States Ai Engineer

2 days ago

IB

AI Engineer

Intent HQ | FT1000 fastest growing European business

AI Engineer building next - gen intelligent systems to help clients understand data. Work involves hands - on development, collaboration, and deployment in a fast - paced environment.

Hybrid Role

Barcelona Spain Ai Engineer

2 days ago

EH

Gen AI Engineer

Elevance Health

Gen AI Engineer role at Elevance Health analyzing organizational data for AI insights. Collaborating on LLM development and integrating machine learning models into production.

Hybrid Role

United States Ai Engineer

2 days ago

WO

AI Engineer

Workday

AI Engineer responsible for designing and developing software solutions in the Corporate Bank AI Team at Barclays. Utilizing various engineering methodologies to provide technology capabilities for customers and colleagues.

Onsite Role

Glasgow United Kingdom Ai Engineer

2 days ago

DA

AI Engineer

Darwin AI

Join AI Special Forces team to support clients with AI - related inquiries and technical problems. Interact with customers and internal teams for effective problem resolution.

Hybrid Role

Mexico City Mexico Ai Engineer

3 days ago

VI

AI Engineer – Physics

Vinci4D.ai

AI Engineer designing and building deep learning models predicting physical phenomena for hardware design solutions. Collaborating with experts in geometry, physics, materials science, and computer science.

Hybrid Role

Palo Alto United States Ai Engineer

$180,000 - $220,000 per year