About the role

  • Gen AI Engineer role at Elevance Health analyzing organizational data for AI insights. Collaborating on LLM development and integrating machine learning models into production.

Responsibilities

  • Analyzing and modeling organizational data for the Artificial Intelligence (AI) function to draw business insights, which can be used to make business decisions
  • Applies data extraction, transformation and loading techniques in order to connect large data sets from a variety of sources
  • LLM development and fine-tuning strategies, best practices, and standards to enhance AI ML model deployment and monitoring efficiency
  • Develop roadmap and strategy for NLP, LLM, Gen AI model development and lifecycle implementation
  • Responsible for the design and development of custom ML, Gen AI, NLP, LLM Models for batch and stream processing-based AI ML pipelines including data ingestion, preprocessing modules, search and retrieval, Retrieval Augmented Generation (RAG)
  • Collaborate closely with the MLOps, product teams, business stakeholders, machine learning engineers, and software engineers for the deployment of machine learning models into production environments, ensuring smooth integration, reliability and scalability
  • Identify and implement model optimizations to improve system efficiency
  • Ensure the use of standards, governance and best practices in ML model development, and adherence to model and data governance standards

Requirements

  • Requires a Bachelor’s degree in a highly quantitative field (Computer Science, Machine Learning, Operational Research, Statistics, Mathematics, etc.) or equivalent degree and 4 or more years of experience
  • Advanced Python proficiency
  • 4+ years of professional hands-on experience leveraging large sets of structured and unstructured data to develop data-driven tactical and strategic analytics and insights using ML, NLP, and computer vision solutions
  • Demonstrated 4+ years hands-on experience with Python, SQL, Hugging Face, TensorFlow, Keras, PyTorch, and Spark
  • Experience with GCP/AWS cloud platforms
  • Strong knowledge of and measurable hands-on experience with developing or tuning Large Language Models (LLM) and Generative AI (GAI)
  • Experience with NLP, LLMs (extractive and generative), fine-tuning and LLM model development
  • Experience developing and optimizing high-quality prompts for NLP applications
  • Excellent written & verbal communication and stakeholder management skills
  • 4+ years project leadership experience including Agile project management, Scaled Agile Frameworks (SAFE)
  • LLM Infrastructure & Deployment: LLM serving platforms (vLLM, Text Generation Inference, FastAPI); Model quantization for LLMs (GPTQ, AWQ, bitsandbytes); GPU memory optimization techniques (tensor parallelism, pipeline parallelism); LLM caching strategies for inference optimization; RAG architecture design and implementation
  • Advanced cloud infrastructure (AWS EKS/ECS, GCP GKE, Azure AKS) knowledge
  • Containerization strategies for ML workloads; Canary deployments for ML models

Benefits

  • merit increases
  • paid holidays
  • Paid Time Off
  • incentive bonus programs
  • medical
  • dental
  • vision
  • short and long term disability benefits
  • 401(k) +match
  • stock purchase plan
  • life insurance
  • wellness programs
  • financial education resources

Job title

Gen AI Engineer

Job type

Experience level

Mid levelSenior

Salary

Not specified

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job