Architect, build, and maintain scalable, reliable data pipelines (batch & streaming) to ingest, transform, and deliver data for analytics, reporting, and ML use cases.
Architect software applications, test, and build automated tools.
Translate complex functional and technical requirements into architecture designs and high-performing software solutions.
Select appropriate data solution software and define hardware requirements to support performance and scalability.
Develop and implement standards and processes for data integration projects and initiatives.
Lead the design and development of software applications, testing, and building tools.
Optimize SQL queries (joins, window functions, aggregations, partitioning, indexing) and data schema performance.
Design data models, schemas, and data warehouses/data lakes (dimensional, star, snowflake schemas, normalization/denormalization).
Ensure data quality, correctness, and consistency across datasets (validation, anomaly detection, reconciliation).
Ensure database changes are reviewed and approved according to standards.
Monitor, troubleshoot, and tune performance of pipelines, databases, and workloads.
Drive adoption of engineering best practices: version control, CI/CD, testing (unit and integration for data pipelines), documentation, and code reviews.
Collaborate with software engineers to integrate data systems into production environments.
Provide technical assistance to junior members and to colleagues across the company.
Mentor and coach junior and mid-level engineers, promoting engineering discipline across the team.
Evaluate and propose new tools, frameworks, and technologies for the data platform.
Ensure data security, governance, access control, lineage, and compliance (e.g., GDPR, CCPA, internal standards).
Requirements
Bachelor’s degree in Computer Science or a related technical discipline (Master’s preferred)
5+ years of professional experience in data engineering, software engineering, or data science
Expert-level SQL, including query optimization, advanced joins, windowing, partitioning, and indexing
Proven expertise in Snowflake for data warehousing and advanced analytics
Strong background in data modeling, data engineering best practices, and distributed systems (e.g., Spark, Hadoop, Hive, Presto)
Hands-on experience designing and maintaining ETL/ELT pipelines, data integration (APIs, event streams, logs), and workflow orchestration (Airflow or Astronomer required)
Proficiency with modern data stack tools, including DBT for transformation and modeling
Experience with AWS cloud services for data engineering and infrastructure management
Strong software engineering skills, including clean code practices, modularization, error handling, logging, CI/CD, and automated testing
Knowledge of object-oriented design, data structures, algorithms, and disaster recovery strategies for data systems
Skilled in scalability and performance optimization across pipelines, databases, and workloads
Familiarity with Python (preferred) and other modern programming languages
Demonstrated ability to analyze complex data sets, identify trends, and derive actionable insights
Effective collaborator with cross-functional teams and proven success in mentoring junior engineers
Experience with agile or rapid application development methodologies
Highly creative, detail-oriented, and results-driven with strong problem-solving and prioritization skills
Principal Consulting AI / Data Engineer designing, building, and optimising data and AI solutions at DyFlex Solutions. Leading engagements with executives and mentoring teams in data engineering best practices.
Lead Data Architect at Davis Technology Management in Phoenix, AZ designing scalable data pipelines using Databricks. Collaborating with cross - functional teams and ensuring data quality.
Senior Data Governance SME leading enterprise data governance strategies. Implementing data governance frameworks and collaborating with technical teams for data quality.
Senior Associate Data Engineer contributing to Travelers' analytics landscape by building and operationalizing data solutions. Collaborating with teams to ensure reliable data delivery across the enterprise.
Salesforce Data Engineer serving as a subject matter expert in the State of Tennessee. Designing scalable data pipelines and collaborating on cross - agency initiatives.
Data Engineer Senior responsible for building data architecture and optimizing pipelines for Business Intelligence. Collaborating with analysts to develop insights using Power BI and Azure technologies.
Principal Data Engineer driving modernization from legacy systems to cloud - native platforms at Mastercard. Architecting and developing ETL platforms with AI integration and establishing data - driven strategies.
Principal Data Engineer modernizing cloud - native platforms for AI - powered solutions at Mastercard. Leading teams to enhance data processing efficiency and reliability across global operations.
Data Engineer creating data pipelines for Santander's card transactions. Collaborating with an agile team in strategic projects involving Databricks and PySpark.