Design, develop, and maintain robust data pipelines to support data ingestion, transformation, and storage to support Advanced Process Innovation program
Design and optimize data architecture (and potentially storage solutions) for efficient access and processing of large datasets from multiple data sources
Collaborate with data scientists to understand model requirements and ensure the availability of clean, structured data for analysis
Collaborate with IT department to design data governance and documentation protocol suited for Advanced Process Innovation program implementation
Implement data quality checks and monitoring processes to ensure the integrity and reliability of data
Support the deployment of AI/ML models by integrating them into production systems and workflows
Work with stakeholders to identify data needs and provide insights to enhance decision-making processes
Stay updated on industry trends and best practices in data engineering, AI, and machine learning
Requirements
Bachelor’s degree in Computer Science, Data Science, Engineering, or a related field
Minimum of 5 years of experience in Data Engineering or in a similar role
Experience within chemical or manufacturing industry is preferred
Strong knowledge of RESTful API design
Proficiency in programming languages such as Python or Java
Experience with collaborative tools like Azure DevOps
Experience with data visualization tools (i.e., Tableau, PowerBI)
Experience with data processing frameworks and ETL tools (i.e., Stitch, Alteryx, Dataiku, etc)
Strong knowledge of SQL and database management systems (e.g., PostgreSQL, MySQL)
Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud) and data warehousing solutions (e.g. Databricks)
Understanding of machine learning concepts and experience with ML frameworks (e.g., TensorFlow, PyTorch) is a plus
Familiarity with engineering/process control related software (ASPEN, PACE, etc) is a plus
Excellent communication skills, intercultural competence and team spirit
Ability to present and suggest sound data-centric recommendations based on analytical thinking
Benefits
performance-based remuneration
occupational health benefits
hybrid and flexible working environments with #SmartWork
Data Engineer at Kyndryl transforming raw data into actionable insights using ELK Stack. Responsible for developing, implementing, and maintaining data pipelines and processing workflows.
Senior Data Engineer at Clorox developing cloud - based data solutions. Leading data engineering projects and collaborating with business stakeholders to optimize data flows.
Data Engineer building solutions on AWS for high - performance data processing. Leading initiatives in data architecture and analytics for operational support.
Senior Data Engineer overseeing Databricks platform integrity, optimizing data practices for efficient usage. Leading teams on compliance while mentoring a junior Data Engineer.
Associate Data Engineer contributing to software applications development and maintenance using Python. Collaborating with teams for clean coding and debugging practices in Pune, India.
Lead Data Engineer responsible for delivering scalable cloud - based data solutions and managing cross - functional teams. Collaborating with global stakeholders and ensuring high - quality project execution in a fast - paced environment.
Data Engineer focusing on development and optimization of data pipelines in an insurance context. Ensuring data integrity and supporting data - driven decision - making processes.
Data Engineer designing and implementing data pipelines and services for Ford Pro analytics. Working with diverse teams and technologies to drive data - driven solutions.
Full Stack Data Engineer on a Central Engineering Portfolio Team in Chennai delivering curated data products and collaborating with data engineers and product owners.
Data Engineer developing best - in - class data platforms for ClearBank with a focus on data insights and automation. Collaborating closely with stakeholders and supporting data science initiatives.