Design and develop end-to-end ML solutions — from data exploration and feature engineering to model training, validation, and deployment.
Collaborate cross-functionally with engineers, analysts, and product teams to integrate predictive and recommendation models into customer-facing and internal applications.
Implement scalable ML pipelines using Databricks, PySpark, and Delta Lake, ensuring reproducibility, performance, and maintainability.
Run controlled experiments (A/B tests, uplift modelling, causal inference) to measure model performance and quantify business impact.
Operationalise models through CI/CD and MLOps best practices, including model versioning, monitoring, retraining strategies, and governance.
Monitor production systems for drift, performance degradation, and anomalies, applying explainability and fairness techniques where needed.
Contribute to the development of feature stores and reusable data assets to accelerate experimentation and deployment cycles.
Stay current with emerging trends in ML, MLOps, and cloud data technologies to continuously improve model accuracy, scalability, and efficiency.
Requirements
5+ years of hands-on experience applying machine learning and statistical modelling in production or product-oriented environments.
Proven understanding of the full spectrum of ML techniques — from traditional models (linear/logistic regression, tree-based methods, ensemble learning) to modern deep learning architectures (CNNs, RNNs, transformers, graph neural networks, diffusion and foundation models).
Strong experience in Python, with proficiency in Scikit-learn, Autogluone, PyTorch or TensorFlow, and PySpark MLlib.
Demonstrated ability to design scalable ML pipelines and automate workflows with MLOps tools (MLflow, Kubeflow, Databricks ML runtime, AWS Sagemaker, or AWS Bedrock).
Familiarity with retrieval-augmented generation (RAG) and fine-tuning of large language models is a plus.
Proficiency in SQL and distributed data frameworks, with experience in feature engineering at scale.
Senior/Staff Data Scientist developing AI for commerce in the Middle East. Architecting systems for merchant and customer AI assistants and content generation.
Data Scientist leveraging statistical methods and machine learning techniques at FUCHS. Focus on data analysis, modeling, and collaboration for data - driven solutions.
Data Science Intern leveraging AI and ML technologies for product development at Seagate. Hands - on experience with data analysis, model development, and actionable insights generation.
Analyst within Credit Risk Management team identifying credit segmentation opportunities using statistical methods. Collaborating with teams to enhance credit decision process and policies.
Data Manager managing and analyzing company data at Amoddex, a consultancy for IT transformation projects. Ensuring data integrity and supporting strategic decision - making in a collaborative environment.
Data Scientist at Capital One on the LLM Customization Team utilizing the latest in computing and machine learning technologies. Collaborating with data scientists and engineers to deliver AI powered products.
Lead Full Stack Data Scientist at Tilt, building the intelligence layer for data - based decisions. Driving data science strategy and analytics to enhance product and growth insights.
Data Scientist focusing on Generative AI applications and engineering problem - solving at Ford. Collaborating with cross - functional teams to innovate and improve technology solutions in the automotive sector.
AI Engineer/Data Scientist in Ford's Global Data Insights & Analytics team. Developing advanced AI/ML solutions and collaborating on cloud - native data products.
Data Scientist transforming customer data into insights that guide strategic decisions for Riachuelo. Collaborating with teams to analyze and visualize data trends for business growth.