Lead Data Scientist for Sahabat-AI, developing multilingual LLM tailored for Indonesia. Collaborate on AI innovation and deployment at GoTo Group in Singapore.
Responsibilities
Work with large-scale multilingual corpora, including text, audio, and image modalities
Build high-quality datasets for both continual pretraining, post-training (SFT, RLHF, DPO), and benchmark evaluation
Contribute to the training and scaling of multilingual LLMs – from continual pretraining to supervised fine-tuning and alignment.
Implement state-of-the-art methods and research for efficient and scalable operations.
Implement and improve safety alignment and guardrail systems to ensure responsible and culturally appropriate model behaviour.
Collaborate closely with business/product engineers to deploy production-grade LLM-powered solutions.
Stay current with advancements in AI technologies. Frontier models, training methodologies, etc
Requirements
7+ years of experience in deep learning, NLP, and LLM
Understanding in computer vision and voice will be a plus point
Proficient in data preprocessing, model training, evaluation, and optimisation.
Practical experience in applying deep learning to solve real business problems, with models successfully deployed and used in production environments.
Proficient with Python and deep learning frameworks such as PyTorch or Tensorflow.
Experience with cloud platforms like Alibaba Cloud, GCP or AWS.
Strong communication skills to understand business needs and effectively convey analytical solutions.
Ability to write clear and concise technical documentation.
A Master’s or PhD in Computer Science, Data Science, AI, or a related field.
Business Analyst acting as a critical link between business and technical teams at Vodafone. Involves gathering requirements and ensuring technical specifications in telecom projects.
Data Scientist developing statistical models and rules for Allegro's eCommerce platform. Driving insights and collaborating across teams to improve product catalog and selection.
Data Scientist developing statistical models and improving product quality for Allegro's eCommerce platform. Collaborating with cross - functional teams to deliver insights and drive business solutions.
Data Scientist at Imprint leveraging data analytics to enhance co - branded credit card offerings and optimize financial product decisions. Collaborating with cross - functional teams in a high - impact role.
Senior Data Scientist leveraging advanced analytical techniques to support Walmart's strategic objectives. Collaborating across functions, mentoring team members, and ensuring data governance standards.
Principal Data Scientist formulating strategies for AI/ML deployment in production at Walmart Global Tech. Leading projects involving state - of - the - art LLMs and collaborating with cross - functional teams.
Senior Clinical Data Manager providing project support and oversight for clinical data management at AstraZeneca. Ensuring data quality and regulatory compliance in clinical studies.
Senior Data Scientist developing data - driven insights for USAA's Life Company partners. Leveraging advanced analytics and machine learning to enhance business growth.
Data Science/Gen AI Specialist designing applications in NLP/LLM/GenAI for automotive use cases. Collaborating with global teams and working across multiple modalities and data sources.
Data Scientist assisting with prospective analysis and risk parameter modelling for Desjardins Group. Developing econometric models and projecting financial risks to support strategic management.