Data Engineer II building pilot datasets and production-grade data platforms at GeoComply. Collaborating with product teams to deliver data-driven features for geolocation compliance.
Responsibilities
Build Pilot Datasets: Rapidly design and develop experimental data models and datasets to support pilot product features and validate hypotheses.
Bridge Business & Data: Collaborate closely with product managers to translate functional requirements into initial data schemas and logic.
Ad-Hoc to Self-Serve: Construct initial logic for ad-hoc data requests and evolve them into standardized, self-serve tools for the product team.
Productionize Pipelines: Take successful pilot datasets and transform them into robust, production-grade data pipelines. Refactor "pilot code" to follow best practices, ensuring high performance and data quality in the production environment.
Foundation Development: Build and maintain the internal libraries, services (Airflow), and Databricks jobs required to run these datasets at scale.
Project Management: Manage the lifecycle of data products from "epic-size" concepts through to delivery and maintenance.
Stakeholder Communication: Effectively communicate technical constraints and data insights to stakeholders during the transition from pilot to production.
Product Ideation: Actively contribute ideas on how data can drive new product benefits during weekly team calls.
Requirements
Four (4) years of relevant experience, with a focus on bridging database technology with product requirements.
You possess strong product thinking skills—the ability to analyze requirements, identify customer pain points, and build data solutions that contribute directly to the product vision.
Strong skills in Databricks, Spark/PySpark, and SparkSQL to manipulate heavy datasets during both the discovery and production phases.
Extensive experience with MySQL (for operational data) and NoSQL, with an understanding of how to model data for analytics vs. applications.
Proven ability to operate on epic-size projects, specifically managing the timeline from "proof of concept" to "delivered feature".
Strong interpersonal skills, adept at explaining complex data concepts to non-technical product stakeholders.
Familiarity with Git, Linux environments, and the Prom/Loki Stack for monitoring data health
Manager of Data Engineering leading data assets and infrastructure initiatives at CLA. Collaborating with teams to enforce data quality standards and drive integration efforts.
Data Engineer building modern Data Lake architecture on AWS and implementing scalable ETL/ELT pipelines. Collaborating across teams for analytics and reporting on gaming platforms.
Chief Data Engineer leading Scania’s Commercial Data Engineering team for growing sustainable transport solutions. Focused on data products and pipelines for BI, analytics, and AI.
Data Engineer designing and building scalable ETL/ELT pipelines for enterprise - grade analytics solutions. Collaborating with product teams to deliver high - quality, secure, and discoverable data.
Entry - Level Data Engineer at GM, focusing on building large scale data platforms in cloud environments. Collaborating with data engineers and scientists while migrating systems to cloud solutions.
Data Engineer responsible for data integrations with AWS technology stack for Adobe's Digital Experience. Collaborating with multiple teams to conceptualize solutions and improve data ecosystem.
People Data Architect designing and managing people data analytics for Gen, delivering actionable insights for HR. Collaborating across teams to enhance data - driven decision - making.
Data Engineer role focused on shaping future connectivity for customers at Vodafone. Involves solving complex challenges in a diverse and inclusive environment.
VP, Senior Data Engineer responsible for designing and developing cloud data solutions for insider risk in Information Security at SMBC. Collaborating with multiple teams to enhance cybersecurity data platform.
Data Engineer responsible for architecting, developing, and maintaining Allegiant’s enterprise data infrastructure. Overseeing transition to cloud hosted data warehouse and developing next - generation data tools.