Applied Researcher I – AI Foundations at Capital One | Hybrid Hired

About the role

Applied Researcher I utilizing AI foundations to enhance customer banking experiences at Capital One. Collaborating with cross-functional teams to build and implement innovative AI-powered solutions for improved interactions.

Responsibilities

Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI-powered products that change how customers interact with their money.
Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Huggingface, Lightning, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data.
Build AI foundation models through all phases of development, from design through training, evaluation, validation, and implementation.
Engage in high impact applied research to take the latest AI developments and push them into the next generation of customer experiences.
Flex your interpersonal skills to translate the complexity of your work into tangible business goals.

Requirements

Currently has, or is in the process of obtaining, a PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields.
M.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 2 years of experience in Applied Research.
PhD in Computer Science, Machine Learning, Computer Engineering, Applied Mathematics, Electrical Engineering or related fields preferred.
LLM PhD focus on NLP or Masters with 5 years of industrial NLP research experience preferred.
Multiple publications on topics related to the pre-training of large language models.
Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens).
Publications in deep learning theory.
Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR.
Optimization (Training & Inference) PhD focused on topics related to optimizing training of very large deep learning models.
Multiple years of experience and/or publications on one of the following topics: Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, Model Compression.
Experience optimizing training for a 10B+ model.
Deep knowledge of deep learning algorithmic and/or optimizer design.
Experience with compiler design.
Finetuning PhD focused on topics related to guiding LLMs with further tasks.
Demonstrated knowledge of principles of transfer learning, model adaptation and model guidance.
Experience deploying a fine-tuned large language model.

Benefits

Comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being

Similar roles

Browse all Artificial Intelligence jobs

45 minutes ago

GG

IT Process Manager – Artificial Intelligence

Garant Maschinenhandel GmbH

IT Process Manager using AI to improve process landscape for packaging machinery. Collaborating on AI use cases and managing their implementation in projects.

Onsite Role

Lengerich Germany Artificial Intelligence

14 hours ago

VE

Business Configuration Analyst – AI

Verisk

Business Configuration Analyst implementing AI - powered solutions across insurance platforms. Collaborating with engineers and customers while configuring and optimizing AI technologies.

Hybrid Role

Holmdel United States Artificial Intelligence

$61,200 - $116,800 per year

14 hours ago

SM

Senior Manager – AI Transformation

Solstice Advanced Materials

Sr. Manager leading enterprise - wide AI transformation initiatives at Solstice Advanced Materials. Collaborating with stakeholders to redesign business processes and drive adoption of AI solutions.

Hybrid Role

Morris Plains United States Artificial Intelligence

$190,000 - $220,000 per year

yesterday

HE

Technical Marketing Manager – AI

Hewlett Packard Enterprise

Technical Product Marketing Manager focused on product marketing strategy for HPE Private Cloud AI. Responsible for technical content execution and collaborative efforts with product management.

Hybrid Role

Spring United States Artificial Intelligence

$105,500 - $243,000 per year

yesterday

RE

AI Prompt Engineer

RELX

AI Prompt Engineer focusing on developing conversational AI experiences for healthcare professionals at Elsevier. Join a team creating innovative solutions powered by generative AI.

Onsite Role

Philadelphia United States Artificial Intelligence

$95,300 - $158,800 per year

yesterday

HF

Junior AI Videographer

HFM

Junior AI Videographer creating engaging AI - driven video and visual content for a multi - asset broker. Collaborating on marketing campaigns and digital storytelling.

Hybrid Role

Larnaca Cyprus Artificial Intelligence

yesterday

AV

AI Bootcamp

Avanade

Technology Consultant role with Avanade focusing on IT and digital solutions after completing a foundational training program. Join a community passionate about technology and innovation.

Onsite Role

London United Kingdom Artificial Intelligence

£30,320 per year

yesterday

AT

Manager, Data & AI – Defense

Atos

Manager in Data & AI for Defense at Atos, responsible for structuring AI consulting practice. Leading projects related to AI sovereignty and resilience for defense and aerospace sectors.

Onsite Role

Bezons France Artificial Intelligence

€75,000 - €85,000 per year

yesterday

DS

Junior Software Developer, AI

Digitale Leute School

Junior Software Developer with AI knowledge at Digitale Leute School offering training and support for career advancement in software development.

Hybrid Role

Hamburg Germany Artificial Intelligence

yesterday

CO

Applied Researcher I – AI Foundations, LLM Core, Agentic AI

Capital One

Applied Researcher leveraging AI technologies to enhance customer interactions at Capital One. Collaborating with experts to build, evaluate, and implement advanced AI models across financial services.

Onsite Role

New York City United States Artificial Intelligence

$218,700 - $272,300 per year