Data Scientist focusing on perception technologies and automated driving solutions for Woven by Toyota. Collaborating with cross-functional teams and utilizing large-scale data and machine learning techniques.
Responsibilities
Come up with data strategies on how to better leverage our labeled datasets to improve the performance of our auto-labeling pipeline (data sampling)
Develop and maintain data pipelines to monitor annotation quality metrics, model failure patterns, and dataset characteristics at scale using data warehouses and distributed processing tools
Collaborate with the ML team to diagnose model bottlenecks through statistical analysis and data slicing, identifying whether issues come from insufficient training samples, distribution shifts, or systematic biases in the annotation process
Participate in data-related activities within the team, across the company in collaboration with Toyota group, and enhancing team members' capability for data science
Requirements
2+ years experience in following industries or research areas:
・AD/ADAS
・Robotics
・Computer Vision
2+ years experience in data science or related areas, including theoretical aspects of data science like machine learning (deep learning, statistical analysis, and mathematical modeling)
Experience writing software using 1) Python for data science (numpy, scipy, scikit, pandas), 2) database, and 3) cloud platform services (AWS, GCP, Azure)
Bachelor's degree in science or engineering
Business-level proficiency in English
NICE TO HAVES
Master's degree or Ph.D. in related field
5+ years experience in data science or related areas
Hands-on experience in the following:
・Experience with computer vision datasets and annotation pipelines, particularly for autonomous systems or multi-camera setups, with a track record of identifying and resolving data quality issues
・Familiarity with active learning strategies and uncertainty quantification techniques to prioritize which samples need human review or re-annotation
・Proficiency with data visualization tools and statistical methods for large-scale dataset analysis, enabling quick identification of distribution shifts, labeling inconsistencies, or underrepresented scenarios in the training data
・Strong Python programming skills with experience using Git for version control and collaborative development in a team environment
・Hands-on experience with data infrastructure tools such as data warehouses, dbt for data transformation, and Spark for large-scale data processing
Business-level proficiency in Japanese (especially, smooth reading and listening)
Benefits
Competitive Salary - Based on experience
Work Hours - Flexible working time
Paid Holiday - 20 days per year (prorated)
Sick Leave - 6 days per year (prorated)
Holiday - Sat & Sun, Japanese National Holidays, and other days defined by our company
Japanese Social Insurance - Health Insurance, Pension, Workers’ Comp, and Unemployment Insurance, Long-term care insurance
Housing Allowance
Retirement Benefits
Rental Cars Support
In-house Training Program (software study/language study)
Senior Associate at PwC focusing on data analytics to drive insights and guide client strategies. Involves advanced techniques and collaboration on AI and GenAI solutions.
Data Scientist responsible for analyzing complex data sets and developing methods to create actionable insights. Collaborate with engineering teams to improve data quality and deliver business value.
Senior Director driving product development in data science for TransUnion. Leading initiatives in AI and analytics for the Specialized Risk portfolio.
Data & Analytics Lead at AstraZeneca driving data - driven solutions in clinical product development. Leading teams and collaborating with stakeholders across global platforms.
Mid - Level Engineering Data Scientist for Boeing's Global Services Analytics team. Creating analytics models and collaborating on health management solutions for KC - 46 platform.
AI expert managing predictive modeling and statistical validation for TEHORA. Integrating predictive models into API architecture and producing performance metrics.
Head of Data Strategy leading and developing data initiatives for Zurich's GI Business. Focusing on data strategy, governance, and analytics while fostering collaboration across teams.
Medical Analyst analyzing engagement effectiveness with advanced analytics solutions aligned with Medical business strategies. Collaborating with cross - functional teams to provide insights for US Medical Affairs.
Research Fellow/Trainee in Women's Health using Data Science and Health Information Technology. Developing interdisciplinary research skills and methodologies focusing on health research.
Senior Data Scientist developing Asset Management Analytics for Queensland Rail. Contributing to organisational KPIs and enhancing asset performance through data analysis and modelling.