Data Scientist role designing and deploying scalable AI systems for multi-user applications. Collaborating with cross-functional teams using advanced tools and frameworks in AI.
Responsibilities
Lead the architecture, development, and deployment of scalable machine learning systems, focusing on real-time inference for LLMs serving multiple concurrent users.
Optimize inference pipelines using high-performance frameworks like vLLM, Groq, ONNX Runtime, Triton Inference Server, and TensorRT to minimize latency and cost.
Design and implement agentic AI systems utilizing frameworks such as LangChain, AutoGPT, and ReAct for autonomous task orchestration.
Fine-tune, integrate, and deploy foundation models including GPT, LLaMA, Claude, Mistral, Falcon, and others into intelligent applications.
Develop and maintain robust MLOps workflows to manage the full model lifecycle including training, deployment, monitoring, and versioning.
Collaborate with DevOps teams to implement scalable serving infrastructure leveraging containerization (Docker), orchestration (Kubernetes), and cloud platforms (AWS, GCP, Azure).
Implement retrieval-augmented generation (RAG) pipelines integrating vector databases like FAISS, Pinecone, or Weaviate.
Build observability systems for LLMs to track prompt performance, latency, and user feedback.
Work cross-functionally with research, product, and operations teams to deliver production-grade AI systems handling real-world traffic patterns.
Stay updated on emerging AI trends, hardware acceleration techniques, and contribute to open-source or research initiatives where possible.
Requirements
Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or related fields.
6–7 years of experience in machine learning engineering, applied AI, or MLOps roles.
Strong proficiency in Python and ML frameworks such as PyTorch, TensorFlow, and Hugging Face Transformers.
Deep knowledge of NLP, transformer-based architectures, and generative AI models.
Hands-on experience with scalable LLM inference optimization using tools like vLLM, Groq, Triton Inference Server, TensorRT, or ONNX Runtime.
Proven ability to serve AI models to concurrent users with low latency and high throughput.
Experience in deploying ML systems on cloud platforms (AWS, GCP, Azure).
Expertise in containerization (Docker), orchestration (Kubernetes), and CI/CD pipelines.
Familiarity with vector search technologies (FAISS, Pinecone, Weaviate) and RAG implementations.
Lead AI and Data Scientist shaping impactful AI solutions in Madrid's EMEA Digital Innovation Hub. Collaborating globally to apply advanced machine learning techniques and foster innovation.
Senior Associate at PwC focusing on data analytics to drive insights and guide client strategies. Involves advanced techniques and collaboration on AI and GenAI solutions.
Data Scientist responsible for analyzing complex data sets and developing methods to create actionable insights. Collaborate with engineering teams to improve data quality and deliver business value.
Senior Director driving product development in data science for TransUnion. Leading initiatives in AI and analytics for the Specialized Risk portfolio.
Data & Analytics Lead at AstraZeneca driving data - driven solutions in clinical product development. Leading teams and collaborating with stakeholders across global platforms.
Mid - Level Engineering Data Scientist for Boeing's Global Services Analytics team. Creating analytics models and collaborating on health management solutions for KC - 46 platform.
AI expert managing predictive modeling and statistical validation for TEHORA. Integrating predictive models into API architecture and producing performance metrics.
Head of Data Strategy leading and developing data initiatives for Zurich's GI Business. Focusing on data strategy, governance, and analytics while fostering collaboration across teams.
Medical Analyst analyzing engagement effectiveness with advanced analytics solutions aligned with Medical business strategies. Collaborating with cross - functional teams to provide insights for US Medical Affairs.
Research Fellow/Trainee in Women's Health using Data Science and Health Information Technology. Developing interdisciplinary research skills and methodologies focusing on health research.