Perform data analysis using Pandas/Spark packages in Python
Follow and complete stories/tasks in an Agile Scrum project
Design, implement, and document CI/CD Pipelines in GitHub
Coordinate and validate data flow/delivery with data engineering groups
Administration of analytics and data visualization tools/environments in Databricks
Perform code review, versioning, and production deployment
Create visualizations in a knowledge graph tool for delivery to stakeholders
Implement business concepts using ontology
Understand context of technical data and how to appropriately normalize data into business metrics
Collaborate with data governance team to implement a data dictionary
Automate analyses and data cleaning procedures via administration of workflows
Research machine learning libraries and algorithms for different analytics tasks
Implement machine learning experiments and develop new algorithms
Statistical analysis to quantify completeness and validity of a data source
Data organization and maintenance in a cloud environment
Requirements
Typically requires a degree in Science, Technology, Engineering or Mathematics (STEM) and minimum 8 years prior relevant experience or an Advanced Degree in a related field and minimum 5 years of experience
Experience in Cloud solutions
Communication and presentation skills in English (verbal and written).
Degree in Software Engineering, or Computer Science
Data, analytics, and reporting experience; analytical and conceptual thinking skills
Python coding experience using Pandas and Spark
Aerospace industry experience
Mentoring/Coaching experience
SAP knowledge and experience
Proficient in Agile methodologies
Experience using JIRA/ Confluence
Experience with Databricks, GitHub, AWS S3, Azure Blob Storage
Experience in Cloud solution, design, and implementation
Ability to influence stakeholders and to determine acceptable solutions
Experience navigating competing priorities, achieving the highest level of engagement, and delivering successful outcomes in critical situations
Collaborative team player who is driven to take initiative and ownership of data, analytics, and reporting, while valuing clarity, consistency and continuous improvement
Candidates should be able to work within cross-functional teams and effectively represent solutions with senior-level leadership
Benefits
Medical, dental, and vision insurance
Three weeks of vacation for newly hired employees
Generous 401(k) plan that includes employer matching funds
Participation in the Employee Scholar Program (ESP)
Life insurance and disability coverage
Employee Assistance Plan, including up to 8 free counseling sessions.
Principal Consulting AI / Data Engineer designing, building, and optimising data and AI solutions at DyFlex Solutions. Leading engagements with executives and mentoring teams in data engineering best practices.
Lead Data Architect at Davis Technology Management in Phoenix, AZ designing scalable data pipelines using Databricks. Collaborating with cross - functional teams and ensuring data quality.
Senior Data Governance SME leading enterprise data governance strategies. Implementing data governance frameworks and collaborating with technical teams for data quality.
Senior Associate Data Engineer contributing to Travelers' analytics landscape by building and operationalizing data solutions. Collaborating with teams to ensure reliable data delivery across the enterprise.
Salesforce Data Engineer serving as a subject matter expert in the State of Tennessee. Designing scalable data pipelines and collaborating on cross - agency initiatives.
Data Engineer Senior responsible for building data architecture and optimizing pipelines for Business Intelligence. Collaborating with analysts to develop insights using Power BI and Azure technologies.
Principal Data Engineer driving modernization from legacy systems to cloud - native platforms at Mastercard. Architecting and developing ETL platforms with AI integration and establishing data - driven strategies.
Principal Data Engineer modernizing cloud - native platforms for AI - powered solutions at Mastercard. Leading teams to enhance data processing efficiency and reliability across global operations.
Data Engineer creating data pipelines for Santander's card transactions. Collaborating with an agile team in strategic projects involving Databricks and PySpark.