Data Engineer designing and implementing data pipelines and services for Ford Pro analytics. Working with diverse teams and technologies to drive data-driven solutions.
Responsibilities
Develop EL/ELT/ETL pipelines to make data available in BigQuery analytical data store from disparate batch, streaming data sources for the Business Intelligence and Analytics teams.
Work with on-prem data sources (Hadoop, SQL Server), understand the data model, business rules behind the data and build data pipelines (with GCP, Informatica) for one or more Ford Pro verticals. This data will be landed in GCP BigQuery.
Build cloud-native services and APIs to support and expose data-driven solutions.
Partner closely with our data scientists to ensure the right data is made available in a timely manner to deliver compelling and insightful solutions.
Design, build and launch shared data services to be leveraged by the internal and external partner developer community.
Building out scalable data pipelines and choosing the right tools for the right job. Manage, optimize and Monitor data pipelines.
Provide extensive technical, strategic advice and guidance to key stakeholders around data transformation efforts. Understand how data is useful to the enterprise.
Requirements
Bachelors Degree
3+ years of experience with SQL and Python
2+ years of experience with GCP or AWS cloud services; Strong candidates with 5+ years in a traditional data warehouse environment (ETL pipelines with Informatica) will be considered
3+ years of experience building out data pipelines from scratch in a highly distributed and fault-tolerant manner.
Comfortable with a broad array of relational and non-relational databases.
Proven track record of building applications in a data-focused role (Cloud and Traditional Data Warehouse)
Experience with GCP cloud services including BigQuery, Cloud Composer, Dataflow, CloudSQL, GCS, Cloud Functions and Pub/Sub.
Inquisitive, proactive, and interested in learning new tools and techniques.
Familiarity with big data and machine learning tools and platforms. Comfortable with open source technologies including Apache Spark, Hadoop, Kafka.
1+ year experience with Hive, Spark, Scala, JavaScript.
Strong oral, written and interpersonal communication skills
Comfortable working in a dynamic environment where problems are not always well-defined.
M.S. in a science-based program and/or quantitative discipline with a technical emphasis.
Full - Stack Data Engineer designing and optimizing complex data solutions for automotive content. Collaborating with teams to enhance user experience across MOTOR's product lines.
Principal Data Engineer designing and evolving enterprise data platform. Collaborating with analytics teams to enable AI and data products at American Tower.
BI Data Engineer II supporting scalable Lakehouse data pipelines at Boston Beer Company. Collaborating with stakeholders to drive data ingestion and maintain enterprise data quality.
Senior Data Engineer at A Kube Inc responsible for building and maintaining data pipelines for product performance. Collaborating with product, engineering, and analytics teams to ensure data quality and efficiency.
Data Engineer engineering DUAL Personal Lines’ strategic data platforms for global insurance group. Providing technical expertise in data engineering and collaborating with internal teams for solution delivery.
Data Engineer role focused on creating and monitoring data pipelines in an innovative energy company. Collaborate with IT and departments to ensure quality data availability in a hybrid work environment.
SQL Migration Data Engineer at Auxo Solutions focusing on Azure SQL/Fabric Lakehouse migrations and building data pipelines. Collaborating on technical designs and data governance for modernization initiatives.
Data Engineer developing cloud solutions and software tools on Microsoft Azure big data platform. Collaborating with various teams for data analysis and visualization in healthcare.