Collaborate closely with other data engineers, computational scientists, and researchers to make complex, multimodal data easily accessible for scientific discovery.
Maintain and enhance our AWS-based data platforms (Glue, Athena, S3) while evaluating and implementing new tools and approaches for data delivery.
Design, build, and optimize data pipelines that integrate diverse data sources into a scalable and secure data lake.
Continuously improve data architecture, automation, quality control, and testing processes.
Proactively troubleshoot, optimize, and modernize existing systems to ensure reliability and performance.
Contribute to best practices in data engineering, documentation, and cross-team knowledge sharing.
Assist with architecting solutions having scalability in mind to support future growth in data volume and complexity.
Provide technical mentorship (when needed) to data engineers and contribute to team development.
Requirements
Bachelor’s degree in Computer Science or a related technical field, or equivalent practical experience
6+ years of software development experience, including at least 3 years focused on data engineering
Proficiency with Python and experience working with data frames for transformation and analysis
Hands-on experience with relational (SQL) and NoSQL databases
Solid understanding of cloud platforms (preferably AWS) and ETL/ELT pipeline development
Familiarity with CI/CD for data workflows, Git, and infrastructure as code (e.g., Terraform, CloudFormation)
Strong communication skills and the ability to work effectively in cross-functional teams.
Benefits
Individual must successfully complete pre-employment process, which includes criminal background check, drug screening, credit check (applicable for certain positions) and reference verification
Equal opportunity employer, all qualified applicants will receive consideration for employment without regard to race, religion, color, national origin, gender, gender identity, sexual orientation, age, status as a protected veteran, among other things, or status as a qualified individual with disability.
Principal Consulting AI / Data Engineer designing, building, and optimising data and AI solutions at DyFlex Solutions. Leading engagements with executives and mentoring teams in data engineering best practices.
Lead Data Architect at Davis Technology Management in Phoenix, AZ designing scalable data pipelines using Databricks. Collaborating with cross - functional teams and ensuring data quality.
Senior Data Governance SME leading enterprise data governance strategies. Implementing data governance frameworks and collaborating with technical teams for data quality.
Senior Associate Data Engineer contributing to Travelers' analytics landscape by building and operationalizing data solutions. Collaborating with teams to ensure reliable data delivery across the enterprise.
Salesforce Data Engineer serving as a subject matter expert in the State of Tennessee. Designing scalable data pipelines and collaborating on cross - agency initiatives.
Data Engineer Senior responsible for building data architecture and optimizing pipelines for Business Intelligence. Collaborating with analysts to develop insights using Power BI and Azure technologies.
Principal Data Engineer driving modernization from legacy systems to cloud - native platforms at Mastercard. Architecting and developing ETL platforms with AI integration and establishing data - driven strategies.
Principal Data Engineer modernizing cloud - native platforms for AI - powered solutions at Mastercard. Leading teams to enhance data processing efficiency and reliability across global operations.
Data Engineer creating data pipelines for Santander's card transactions. Collaborating with an agile team in strategic projects involving Databricks and PySpark.