Data Scientist delivering data science projects for media companies at Samba TV in Warsaw. Collaborating with engineering and mentoring junior data scientists on the team.
Responsibilities
Own end-to-end delivery of significant data science projects — from problem scoping and approach design through to production deployment, with a focus on knowledge graph and identity solutions
Make sound, independently-reasoned decisions on methodology, model selection, and evaluation; document them clearly in technical solution documents covering problem statement, approach, metrics, and timeline
Build production-quality Python and PySpark code on Databricks — well-tested, documented, and reusable — and implement advanced ML and AI-powered workflows including entity resolution, probabilistic record linkage, embedding-based matching, semantic similarity, and LLM-augmented pipelines
Lead solution design for your own initiatives; break down complex epics into well-scoped user stories with clear acceptance criteria, adopting DataOps and MLOps best practices throughout (experiment tracking, pipeline orchestration, model monitoring, reproducibility)
Develop and maintain reusable tools, libraries, and documentation that improve team efficiency and technical standards; conduct code reviews with constructive, specific feedback that raises the bar
Mentor junior data scientists on technical execution, code quality, and career development; lead internal talks or workshops on knowledge graphs, identity, or ML topics
Collaborate cross-functionally with product, engineering, and operations — translate business requirements into technical specifications and partner with data engineering on scalable pipeline design; participate in cross-functional design reviews and working groups
Requirements
Bachelor's degree required in Statistics, Data Science, Computer Science, Mathematics or related field; Master's preferred
5-7 years of hands-on data science experience with 1-2+ years in a direct people-management or team-lead role with demonstrated ability to develop, retain, and hire data scientists
Solid command of core statistics and ML - hypothesis testing, probability, regression, classification, clustering, model evaluation, and experimental design
Strong Python (pandas, NumPy, PySpark, scikit-learn) and SQL; Databricks or similar platform experience essential
Familiarity with MLOps practices: experiment tracking, pipeline orchestration (Airflow), reproducible model deployment
Detail-oriented and proactive in anticipating delivery risks
Comfortable running Agile ceremonies and maintaining consistent sprint cadence across a distributed team
Strong communicator - able to give direct, constructive feedback and advocate for your team to key stakeholders.
Benefits
Samba TV is an equal opportunity employer.
We celebrate diversity and are committed to creating an inclusive environment for all employees.
We strive to empower connection with one another, reflect the communities we serve, and tackle meaningful projects that make a real impact.
Customer Data Lead overseeing client - specific data operations for CUBE's regulatory data services. Ensuring best - in - class data services across North America.
Advanced Data Scientist developing in - house and third - party analytical systems at Honeywell. Leverage AI - ML models and data mining techniques to enhance security insights.
Head of Analytics leading strategies for Customer Care and Shipping Analytics at Back Market. Focused on elevating customer satisfaction and operational efficiency through data - driven insights.
Data Scientist at Assembly building and automating media intelligence models. Collaborating with consultancy leadership to define analytical approaches to business challenges.
Senior Data Scientist focusing on ML systems for Walmart's Trust and Safety. Collaborating on compliance models and overseeing the full model lifecycle.
Head of Data leading development and execution of data strategy at Verity. Mentoring a team to deliver insights and drive business growth while collaborating with multiple departments.
Senior Data Scientist responsible for credit modeling at Clair, utilizing machine learning to assess risk and optimize decisions. Collaborating with cross - functional teams and deploying models in production environments.
Senior Product Manager responsible for transforming user needs into scalable products for Seyna. Collaborating with internal teams to enhance insurance programs and streamline workflows.
Founding AI Data Scientist helping Grand build core AI and data systems for decision - making. Collaborating with teams to leverage complex data for impactful AI solutions.
Data Scientist responsible for AI model development and deployment across client projects. Building reliable AI applications ensuring client success and operational excellence.