About the role

Data Scientist delivering data science projects for Samba TV. Working on knowledge graphs, audience modeling, and mentoring junior team members.

Responsibilities

Own end-to-end delivery of significant data science projects — from problem scoping and approach design through to production deployment, with a focus on knowledge graph and identity solutions
Make sound, independently-reasoned decisions on methodology, model selection, and evaluation; document them clearly in technical solution documents covering problem statement, approach, metrics, and timeline
Lead solution design for your own initiatives; break down complex epics into well-scoped user stories with clear acceptance criteria, adopting DataOps and MLOps best practices throughout — experiment tracking, pipeline orchestration, model monitoring, and reproducibility
Build production-quality Python and PySpark code on Databricks — well-tested, documented, and reusable — and implement advanced ML and AI-powered workflows including entity resolution, probabilistic record linkage, embedding-based matching, semantic similarity, and LLM-augmented pipelines
Develop and maintain reusable tools, libraries, and documentation that improve team efficiency and technical standards; conduct code reviews with constructive, specific feedback that raises the bar
Mentor junior data scientists on technical execution, code quality, and career development; lead internal talks or workshops on knowledge graphs, identity, or ML topics
Collaborate cross-functionally with product, engineering, and operations — translate business requirements into technical specifications, partner with data engineering on scalable pipeline design, and participate in cross-functional design reviews and working groups

Requirements

Bachelor's degree required in Statistics, Data Science, Computer Science, Mathematics or a related quantitative field; Master's strongly preferred
3–5 years of hands-on data science experience with demonstrated ability to own and deliver complex, multi-sprint projects independently
Advanced Python with production-quality code, testing, and documentation; strong SQL and PySpark for billion-row datasets
Databricks workflows, Delta Lake, and job orchestration; working knowledge of cloud platforms (AWS or GCP)
Solid command of core ML — regression, classification, clustering, model evaluation, and experimental design — applied to complex, high-volume data
Proficiency with MLOps practices: experiment tracking, pipeline orchestration (Airflow), and reproducible model deployment
Exposure to modern AI methodologies: RAG systems, LLM-augmented models, vector databases, and semantic search
Strong communicator — able to translate technical work into clear documentation, user stories, and cross-functional conversations
Demonstrated ability to mentor junior data scientists and contribute to team standards

Benefits

Equal opportunity employer
Inclusive environment
Employee empowerment

Hybrid Data Scientist – Knowledge Graph, Identity

at Samba TV

About the role

Responsibilities

Requirements

Benefits

Job title

Job type

Experience level

Salary

Degree requirement

Tech skills

Location requirements

Report this job

Similar roles

Data Scientist / Machine Learning Engineer

Zero to One Search | Recruitment Agency

Data Scientist, Fintech, Onchain Analytics

PulseRise Technologies

Senior Data Scientist, Member Analytics – Strategy

Navy Federal Credit Union

Senior Data Scientist

BTECH

Data Scientist

Ziphire HR

Cientista de Dados Júnior

AZ Tecnologia em Gestão

Data Scientist, Expérimenté

Klee Group

Senior Data Scientist

Keyrus

Data Manager

DEDIENNE AEROSPACE

Applied Data Scientist, Operations Analytics

Rowan Digital Infrastructure