AI Engineer enhancing data processing with generative AI solutions. Working with the Azure ecosystem to automate and optimize data workflows.
Responsibilities
Design and implement GenAI-based solutions that optimize critical stages of the DDLC (Data Development Life Cycle).
Develop agents that act as assistants in creating Databricks notebooks, pipelines in ADF (Azure Data Factory), and structures in Microsoft Fabric.
Build workflows that integrate LLMs into our ecosystem (SQL Server and Power BI), automating tasks such as data documentation and creation of DAX measures.
Apply advanced prompt engineering techniques to ensure the AI generates optimized, secure, and high-performance PySpark and T-SQL code.
Define quality metrics to validate AI-generated artifacts (e.g., ensuring the efficiency of generated queries).
Ensure solutions align with Data Governance practices, security, and privacy.
Stay up to date on AI capabilities within Microsoft Fabric and Databricks to maximize the use of native tools versus custom developments.
Requirements
Degree in Computer Science, Data Engineering, Statistics, or a related field.
Strong experience in Data Engineering and a deep understanding of CI/CD pipelines applied to data.
Required experience with AI agent orchestration frameworks (e.g., LangChain, Semantic Kernel, or AutoGen).
Technical expertise in the Azure Data ecosystem (Databricks, Data Factory, and Fabric) and in SQL and Python/PySpark.
Hands-on experience building GenAI solutions (OpenAI API, Azure OpenAI, or open-source models) with a strong focus on prompt engineering is essential.
Ability to translate business challenges into agents that can assist in data modeling and dashboard creation.
DataOps mindset: understanding that data requires testing, versioning, and monitoring.
Benefits
Health and dental insurance;
Meal and food allowance;
Childcare assistance;
Extended parental leave;
Partnerships with gyms and health & wellness professionals via Wellhub (Gympass) / TotalPass;
Profit Sharing (PLR);
Life insurance;
Continuous learning platform (CI&T University);
Discount club;
Free online platform dedicated to physical and mental health and wellbeing;
Full - Stack Data Engineer designing and optimizing complex data solutions for automotive content. Collaborating with teams to enhance user experience across MOTOR's product lines.
Principal Data Engineer designing and evolving enterprise data platform. Collaborating with analytics teams to enable AI and data products at American Tower.
BI Data Engineer II supporting scalable Lakehouse data pipelines at Boston Beer Company. Collaborating with stakeholders to drive data ingestion and maintain enterprise data quality.
Senior Data Engineer at A Kube Inc responsible for building and maintaining data pipelines for product performance. Collaborating with product, engineering, and analytics teams to ensure data quality and efficiency.
Data Engineer engineering DUAL Personal Lines’ strategic data platforms for global insurance group. Providing technical expertise in data engineering and collaborating with internal teams for solution delivery.
Data Engineer role focused on creating and monitoring data pipelines in an innovative energy company. Collaborate with IT and departments to ensure quality data availability in a hybrid work environment.
SQL Migration Data Engineer at Auxo Solutions focusing on Azure SQL/Fabric Lakehouse migrations and building data pipelines. Collaborating on technical designs and data governance for modernization initiatives.
Data Engineer developing cloud solutions and software tools on Microsoft Azure big data platform. Collaborating with various teams for data analysis and visualization in healthcare.