Data Scientist at SiteMinder building machine learning solutions. Collaborating with teams to integrate models and tackle complex data science challenges.
Responsibilities
Design and develop end-to-end ML solutions — from data exploration and feature engineering to model training, validation, and deployment.
Collaborate cross-functionally with engineers, analysts, and product teams to integrate predictive and recommendation models into customer-facing and internal applications.
Implement scalable ML pipelines using Databricks, PySpark, and Delta Lake, ensuring reproducibility, performance, and maintainability.
Run controlled experiments (A/B tests, uplift modelling, causal inference) to measure model performance and quantify business impact.
Operationalise models through CI/CD and MLOps best practices, including model versioning, monitoring, retraining strategies, and governance.
Monitor production systems for drift, performance degradation, and anomalies, applying explainability and fairness techniques where needed.
Contribute to the development of feature stores and reusable data assets to accelerate experimentation and deployment cycles.
Stay current with emerging trends in ML, MLOps, and cloud data technologies to continuously improve model accuracy, scalability, and efficiency.
Requirements
5+ years of hands-on experience applying machine learning and statistical modelling in production or product-oriented environments.
Proven understanding of the full spectrum of ML techniques — from traditional models (linear/logistic regression, tree-based methods, ensemble learning) to modern deep learning architectures (CNNs, RNNs, transformers, graph neural networks, diffusion and foundation models).
Strong experience in Python, with proficiency in Scikit-learn, Autogluone, PyTorch or TensorFlow, and PySpark MLlib.
Demonstrated ability to design scalable ML pipelines and automate workflows with MLOps tools (MLflow, Kubeflow, Databricks ML runtime, AWS Sagemaker, or AWS Bedrock).
Familiarity with retrieval-augmented generation (RAG) and fine-tuning of large language models is a plus.
Proficiency in SQL and distributed data frameworks, with experience in feature engineering at scale.
Data Scientist at Kpler enhancing Gas and Power teams to aggregate data for future forecasts. Collaborating with engineers and product teams for model deployment and performance enhancement.
Data Scientist developing ML models and analyzing various data sources at Taikonauten GmbH. Contributing to user - centered product and project development in R&D team.
Lead AI and Data Scientist shaping impactful AI solutions in Madrid's EMEA Digital Innovation Hub. Collaborating globally to apply advanced machine learning techniques and foster innovation.
Senior Associate at PwC focusing on data analytics to drive insights and guide client strategies. Involves advanced techniques and collaboration on AI and GenAI solutions.
Data Scientist responsible for analyzing complex data sets and developing methods to create actionable insights. Collaborate with engineering teams to improve data quality and deliver business value.
Senior Director driving product development in data science for TransUnion. Leading initiatives in AI and analytics for the Specialized Risk portfolio.
Data & Analytics Lead at AstraZeneca driving data - driven solutions in clinical product development. Leading teams and collaborating with stakeholders across global platforms.
Mid - Level Engineering Data Scientist for Boeing's Global Services Analytics team. Creating analytics models and collaborating on health management solutions for KC - 46 platform.
AI expert managing predictive modeling and statistical validation for TEHORA. Integrating predictive models into API architecture and producing performance metrics.