Design and implement a modular ETL pipeline on Databricks and enable parameterized, YAML-driven deployments using Databricks Bundles.
Implement Spark performance optimizations and CI/CD to promote pipelines across environments.
Build a programmatic deployment and management layer for Databricks using the Databricks REST API to create/configure clusters, jobs, and notebooks dynamically and securely.
Architect and implement a secure, scalable file-ingestion API that provides validation, auto-renaming, manifest generation, and reliable transfer to cloud storage (with full traceability).
Requirements
Excellent academics in Computer Science, Engineering, or related field.
Problem-solving is your jam, and you're all about critical thinking.
You're not afraid to roll up your sleeves and get stuff done, even if you're independently on your own with minimal supervision.
You can juggle multiple projects like a pro.
Challenges don't scare you; in fact, you love diving into them.
You can communicate like a champ, whether it's writing reports or presenting in a room full of people.
You're curious, and you love picking up new skills & technologies.
You're a team player, always up for sharing your ideas and best practices.
Benefits
Great company culture.
"Learn and Share" sessions.
You'll get support from your mentors.
Social events and after-work.
A flexible and fun work environment.
Casual dress code.
You'll work with a cool team!
We respect your ideas, and we're all about trying new things.
Senior Associate Data Engineer contributing to Travelers' analytics landscape by building and operationalizing data solutions. Collaborating with teams to ensure reliable data delivery across the enterprise.
Salesforce Data Engineer serving as a subject matter expert in the State of Tennessee. Designing scalable data pipelines and collaborating on cross - agency initiatives.
Data Engineer Senior responsible for building data architecture and optimizing pipelines for Business Intelligence. Collaborating with analysts to develop insights using Power BI and Azure technologies.
Principal Data Engineer driving modernization from legacy systems to cloud - native platforms at Mastercard. Architecting and developing ETL platforms with AI integration and establishing data - driven strategies.
Principal Data Engineer modernizing cloud - native platforms for AI - powered solutions at Mastercard. Leading teams to enhance data processing efficiency and reliability across global operations.
Data Engineer creating data pipelines for Santander's card transactions. Collaborating with an agile team in strategic projects involving Databricks and PySpark.
Data Engineer designing, implementing, and maintaining data pipelines at Sabiá Gaming. Focused on high - quality data access and integration for enhanced decision - making.
Quantitative Data Engineer developing data solutions and automations for MassMutual's investment management. Working with data orchestration tools within a collaborative team environment.
Senior Data Engineer designing and scaling data infrastructure for analytics, machine learning, and business intelligence in a software supply chain security company.