Data Engineer responsible for developing data solutions and integrating systems for advanced analytics at Lilly. Focusing on data pipelines and solutions ensuring data quality and compliance.
Responsibilities
Engage with business stakeholders to design, develop, and maintain the data pipelines and data solutions that ensure the availability and quality of data sets and actionable insights for the Foundry
Includes data capture, integration, acquisition, contextualization, and harmonization, leading to the delivery of data-as-a-product and reusable data domains and products
Focus on integrating IT/OT systems with cloud data lakehouse architecture (AWS/Azure) to enable advanced analytics and AI/ML capabilities while ensuring data integrity and compliance with relevant regulatory standards and best practices
Work closely with the Data Architect and Data Scientists
Collaborate with business and IT groups beyond the data sphere, understanding the enterprise infrastructure and the many source systems
Requirements
Bachelor’s degree in Computer Science, Data Science, Engineering or related field or equivalent work experience
At least 3 years of experience in several of the following disciplines: statistical methods, data modeling, ETL/ELT, ontology development, semantic graph construction and linked data, relational schema design
1-3 years of experience designing large scale data models for functional, operational, and analytical environments (Conceptual, Logical, Physical & Dimensional)
Demonstrated SQL and data modeling proficiency
Experience with data modeling tools such as, ER*Studio and Erwin or TOAD
Experience with cloud platforms (e.g., AWS, Azure)
Experience with AI/ML/LLM Concepts and tools and building agentic AI solution sets
Experience with data integration such as data streaming, Industrial IOT, using MQTT, AQMP, Kafka and related protocols
Understanding of modern data architecture, data lakehouse, data warehousing and/or big data concepts
Experience with security models and development on large data sets
Experience with multiple database solutions (e.g. Postgres, Redshift, Aurora, Athena, Graph DB like Neptune, No SQL like DynamoDB, MongoDB) and formal database designs (3NF, Dimensional Models)
Experience with Agile Development, CI/CD, Github, Automation platforms
Demonstrated ability to analyze large, complex data domains and craft practical solutions for subsequent data exploitation via analytics
Ability to review and provide practical recommendations on design patterns, performance considerations & optimization, database versions, and database deployment strategies
Knowledgeable in data functions such as Data Governance, Master Data Management, Business Intelligence
Prior work experience working in pharma or other GMP setting
Solid knowledge of Computer System Validation process
Demonstrated ability to analyze, anticipate, and resolve complex issues through sound problem-solving skills
Demonstrated learning agility and curiosity
Desire and ability to communicate using a variety of methods in diverse forums.
Benefits
eligibility to participate in a company-sponsored 401(k)
pension
vacation benefits
eligibility for medical, dental, vision and prescription drug benefits
flexible benefits (e.g., healthcare and/or dependent day care flexible spending accounts)
life insurance and death benefits
certain time off and leave of absence benefits
well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities)
Junior Data Engineer assisting with data - driven use - cases in the payment sector. Contributing to the establishment of a central data platform at S - Payment.
Senior Data Engineer leading tailored data - driven solutions delivery for public sector clients. Involves data transformation projects and building AI - powered tools for decision making.
Technical Lead in Data Engineering at Intentsify, building scalable applications for B2B marketing solutions. Leading a small team and making key technological decisions.
Data Engineer developing scalable data pipelines for RunBuggy's automotive logistics platform. Collaborate with cross - functional teams to unlock powerful insights and optimize data infrastructure.
Working Student in Data Engineering supporting the development of an energy management app's data backbone across Europe. Collaborate with diverse teams to ensure data quality and optimization.
Senior Data Engineer at Minsait responsible for designing and maintaining data infrastructure. Ensuring efficient and secure data collection, storage, and processing across various sectors.
Senior Data Engineer developing and maintaining scalable data pipelines at Quality Digital. Ensuring data quality, security, and compliance with best practices while collaborating with data teams.
AI Data Engineer at Convatec designing and deploying data and AI workflows. Collaborating with AI Engineers and Data Scientists to maintain data pipelines and support analytics.
Data Engineer designing and developing data pipelines and infrastructure for processing and analyzing large data volumes at CIAL. Collaborating with data scientists to meet data requirements.
Data Engineer building and optimising data infrastructure at LifeByte. Collaborating with data scientists and analysts to improve data processing efficiency.