Data Engineer ETL responsible for AI model data cleaning and development. Leading projects in a hybrid work environment in Singapore.
Responsibilities
1. Responsible for data cleaning (ETL) and data warehouse construction to support large-scale AI models.
2. Responsible for training and fine-tuning large AI models to meet the requirements of specific business scenarios.
3. Responsible for developing supporting tools, such as dashboards and general business logic, to ensure the practicality of AI model applications.
4. Must have hands-on development experience and be able to lead a team or independently complete projects related to data collection and development.
Requirements
1. A degree in computer science or a related field is preferred. Must be familiar with professional knowledge in machine learning, deep learning, and natural language processing, with at least 1 year of experience in GPT or Gemini application development, and proficient in deep learning frameworks such as PyTorch or TensorFlow.
2. Familiar with models such as Transformer, BERT, GPT, and fine-tuning algorithms like LoRA, with experience in fine-tuning models.
3. Must have Java programming experience.
4. Experience in backend Java development for data engineering use cases, particularly real-time processing with Apache Flink.
5. Must have experience in data warehouse development and construction, such as using Flink and building ETL data cleaning pipelines.
6. Experience with large model pre-training and practical application in business scenarios is a plus.
7. Must have hands-on experience in setting up large models based on open-source frameworks.
8. Experience in conversational AI, marketing content generation, or machine translation is preferred.
9. Priority will be given to candidates with hands-on experience in Google Cloud Platform (GCP), particularly those with experience in BigQuery.
Benefits
1. Lead community-building for Southeast Asia's largest parenting ecosystem
2. Be at the forefront of connecting brands with real parents in authentic and impactful ways.
3. Work with a passionate team driving innovation in the parenting space.
4. Regional exposure across three of SSEA's most dynamic markets.
EU Commercial Data Engineer developing scalable data solutions for Genmab’s commercial teams. Collaborating with cross - functional teams to enhance business insights and decision making through reliable data.
Principal Data Engineer designing and developing innovative data analytical solutions for the gaming industry. Leading and mentoring while engaging with clients to fulfill their data engineering needs.
Specialist, Data Engineering at CoverMyMeds enhancing and expanding data platforms for commercial data products. Collaborating with multiple teams to design scalable data solutions from various sources.
Team Lead in Data Engineering at Avanquest mentoring data engineering team and ensuring efficient data management across platforms. Collaborating with departments to align solutions and optimize workflows.
Data Architect at RSM leading AI - driven data migration initiatives within Salesforce ecosystem. Implementing data governance and optimizing performance across complex datasets.
Senior Data Engineer at Capgemini designing and optimizing scalable data architectures on Databricks and GCP. Collaborating across teams to transform business needs into reliable technical solutions.
Data Engineer transforming legacy on - premises systems to cloud - native architectures for advanced data analytics. Collaborating with teams to build efficient data solutions using Python and AWS.
Data Engineering Academy focused on Snowflake and Databricks for professionals interested in expanding their technical capabilities. Fully remote with future office work in Monterrey or Saltillo after completion.
Senior Data Engineer at Intent HQ designing and scaling data platforms. Building high - impact intelligence from millions of customer insights with a focus on performance and reliability.
SAP Data Engineer supporting MERKUR GROUP's evolution into a data - driven company. Responsible for data integration, modeling, and collaboration with various departments in Group Finance.