We’re looking for a **Senior Data Engineer** to join our growing Data Platform team.
Design and scale the infrastructure and pipelines that power analytics, machine learning, and business intelligence across Sonatype.
Work closely with stakeholders across product, engineering, and business teams to ensure data is reliable, accessible, and actionable.
Architect, build, and maintain the data infrastructure that powers our analytics and data science efforts.
Partner closely with Data Analysts, Data Scientists, and product teams.
Design scalable, reliable data pipelines, manage our Databricks and AWS environments, and ensure best practices for security, governance, and deployment.
Requirements
5+ years of hands-on experience in data engineering or a related role.
Expert proficiency in Apache Spark—both Scala and PySpark—for large-scale data processing.
Strong Python skills (including pandas) and practical experience with AWS’s boto3 SDK.
Deep familiarity with Databricks concepts: workspaces, clusters, jobs, Unity Catalog, and service principals.
Solid understanding of AWS data services: S3, Secrets Manager, SQS, IAM, and network/security configurations.
Nice to have experience with Terraform for defining and managing cloud infrastructure.
Proficient with GitHub for version control, code reviews, and CI/CD.
Excellent debugging and performance-tuning capabilities for distributed systems.
Strong communication and collaboration skills; able to translate technical solutions into business value.
Data Engineer Senior responsible for building data architecture and optimizing pipelines for Business Intelligence. Collaborating with analysts to develop insights using Power BI and Azure technologies.
Principal Data Engineer driving modernization from legacy systems to cloud - native platforms at Mastercard. Architecting and developing ETL platforms with AI integration and establishing data - driven strategies.
Principal Data Engineer modernizing cloud - native platforms for AI - powered solutions at Mastercard. Leading teams to enhance data processing efficiency and reliability across global operations.
Data Engineer creating data pipelines for Santander's card transactions. Collaborating with an agile team in strategic projects involving Databricks and PySpark.
Data Engineer designing, implementing, and maintaining data pipelines at Sabiá Gaming. Focused on high - quality data access and integration for enhanced decision - making.
Quantitative Data Engineer developing data solutions and automations for MassMutual's investment management. Working with data orchestration tools within a collaborative team environment.
Data Engineer developing architecture and pipelines for data analytics at NinjaTrader. Empowering analysts and improving business workflows through data - driven solutions.
Data Engineer joining Alterric to collaborate on data platform projects and analytics solutions. Working with Azure Cloud technologies to ensure data quality and integrity for informed decision - making.
Data Engineer at Kyndryl transforming raw data into actionable insights using ELK Stack. Responsible for developing, implementing, and maintaining data pipelines and processing workflows.