Lead Data Scientist for Sahabat-AI, developing multilingual LLM tailored for Indonesia. Collaborate on AI innovation and deployment at GoTo Group in Singapore.
Responsibilities
Work with large-scale multilingual corpora, including text, audio, and image modalities
Build high-quality datasets for both continual pretraining, post-training (SFT, RLHF, DPO), and benchmark evaluation
Contribute to the training and scaling of multilingual LLMs – from continual pretraining to supervised fine-tuning and alignment.
Implement state-of-the-art methods and research for efficient and scalable operations.
Implement and improve safety alignment and guardrail systems to ensure responsible and culturally appropriate model behaviour.
Collaborate closely with business/product engineers to deploy production-grade LLM-powered solutions.
Stay current with advancements in AI technologies. Frontier models, training methodologies, etc
Requirements
7+ years of experience in deep learning, NLP, and LLM
Understanding in computer vision and voice will be a plus point
Proficient in data preprocessing, model training, evaluation, and optimisation.
Practical experience in applying deep learning to solve real business problems, with models successfully deployed and used in production environments.
Proficient with Python and deep learning frameworks such as PyTorch or Tensorflow.
Experience with cloud platforms like Alibaba Cloud, GCP or AWS.
Strong communication skills to understand business needs and effectively convey analytical solutions.
Ability to write clear and concise technical documentation.
A Master’s or PhD in Computer Science, Data Science, AI, or a related field.
Data Scientist enhancing merchandising and inventory performance using machine learning techniques at Nordstrom. Collaborating with teams to develop data - driven solutions for better customer experiences.
Clinical Data Manager supporting research projects and managing patient databases at Mass General Brigham. Interact with patients and maintain data integrity in clinical research.
Technical Lead in Data Science guiding analytical systems design and team mentorship for a finance technology company. Collaborating across engineering and research to drive innovation and data excellence.
Technical Lead in Data Science at Voleon, driving the design and implementation of analytical systems. Mentoring a growing team of data scientists while ensuring methodological rigor in finance applications.
Data Scientist developing analytical solutions for customer data transformation and supporting critical business decisions. Focus on quality and integrity of data through statistical models and analysis.
Junior Data Scientist developing demand forecasting models for Nestlé’s Supply Chain team. Collaborating with analysts and planners to improve forecasting tools and processes.
HCM Data and Process Analytics Lead at ABB building business intelligence for HCM solutions. Analyzing data to drive improvements in performance and data quality across the organization.
Data Scientist owning and delivering production - grade data pipelines at Simbe Robotics. Collaborating with Product Management, Engineering, and Commercial teams to surface insights from retail data.
Data and Analytics Manager defining strategies for a digital transportation company connecting freight and truckers in Brazil. Leading initiatives to maximize data value and promote a culture of data - driven decision - making.
Mid - Level Data Scientist leveraging data science techniques for USAA's financial security solutions. Collaborating with various teams to develop advanced analytics and model deployment practices.