Transforming business strategies into high-performance solutions using Databricks. Leading data engineering, advanced analytics, and generative AI initiatives in a scalable architecture.
Responsibilities
Lakehouse Architecture: Design and evolve the enterprise platform on Databricks (Delta Lake), establishing ingestion patterns (batch/streaming), storage, and consumption standards.
Governance & Security: Implement governance frameworks using Unity Catalog, ensuring data quality, lineage, security (RBAC/ABAC), and compliance with the LGPD (Brazilian Data Protection Law).
AI & ML Support: Design infrastructure for the Machine Learning lifecycle (MLOps) and GenAI initiatives, supporting everything from feature engineering to LLM deployment.
Data Engineering: Define technical standards for complex pipelines, integrating critical systems such as SAP (S/4HANA) and legacy databases.
Technical Leadership: Act as an advisor for executive decisions and guide engineering teams in applying best practices for versioning and resilience.
Requirements
Proven track record as a Data Architect in complex environments.
Strong command of the Databricks ecosystem (Unity Catalog, Delta Lake, Jobs, Workflows).
Advanced experience with Azure Cloud.
Expertise in data modeling and high-performance SQL.
Clear understanding of scalable pipelines and governance/security patterns.
Ability to translate complex business needs into clear technical diagrams.
Comfortable collaborating across Security, Infrastructure, and Business teams.
Pragmatic innovation mindset: focused on continuous improvement with an emphasis on delivering organizational value.
Preferred qualifications:
Knowledge of MLOps (MLflow, monitoring, and retraining).
Experience with Data Mesh and domain-oriented architectures.
Experience in the Energy or Oil & Gas sectors.
Familiarity with SAP technologies (S/4HANA, Datasphere).
Benefits
Health and dental insurance
Meal and grocery allowance
Childcare allowance
Extended parental leave
Partnerships with gyms and health & wellness professionals via Wellhub (Gympass) / TotalPass
Profit Sharing (PLR)
Life insurance
Continuous learning platform (CI&T University)
Discount club
Free online platform dedicated to promoting physical and mental health and wellbeing
Data Architect designing and maintaining enterprise data architecture at Envalior. Driving enterprise - wide impact ensuring scalability and reliability of systems, reporting, and AI initiatives.
Data Engineer role at Valmont focused on data analytics and technology for sustainable agricultural practices. Collaborating with cross - functional teams to enhance data management and analytics tools.
Senior Data Engineer at Barclays building and maintaining data pipelines and warehouses. Collaborating with data scientists and ensuring data accuracy, accessibility, and security.
Lead Data Engineer guiding a team in designing scalable data solutions for iKnowHow S.A. Overseeing development of data pipelines while collaborating with cross - functional teams.
Data Engineer at LPL Financial developing Python - based ETL pipelines. Collaborating with cross - functional teams to ensure reliable data delivery and optimizing pipeline performance.
Senior Data Engineer at Keyrus focusing on data solutions and projects to drive performance. Collaborating with teams globally to enhance data transformation and governance processes.
Data Engineer developing scalable data pipelines for ETL/ELT processes using GCP services. Collaborating with team members to optimize data workflows and ensure data integrity.
Data Governance Engineer in Fintech developing a formal cyber data governance framework. Collaborating with cyber security, analytics, and platform engineering teams on metadata and lineage capabilities.
Junior Data Engineer role at Allegro, focusing on developing ETL/ELT pipelines and processing large datasets. Collaborate with cross - functional teams for data quality and reporting.
Data Engineer at Concept Reply developing innovative data - driven solutions in IoT. Collaborating with teams to unlock the potential of data and cloud computing.