Senior Data Engineer designing and scaling data infrastructure for analytics, machine learning, and business intelligence in a software supply chain security company.
Responsibilities
We’re looking for a **Senior Data Engineer** to join our growing Data Platform team.
Design and scale the infrastructure and pipelines that power analytics, machine learning, and business intelligence across Sonatype.
Work closely with stakeholders across product, engineering, and business teams to ensure data is reliable, accessible, and actionable.
Architect, build, and maintain the data infrastructure that powers our analytics and data science efforts.
Partner closely with Data Analysts, Data Scientists, and product teams.
Design scalable, reliable data pipelines, manage our Databricks and AWS environments, and ensure best practices for security, governance, and deployment.
Requirements
5+ years of hands-on experience in data engineering or a related role.
Expert proficiency in Apache Spark—both Scala and PySpark—for large-scale data processing.
Strong Python skills (including pandas) and practical experience with AWS’s boto3 SDK.
Deep familiarity with Databricks concepts: workspaces, clusters, jobs, Unity Catalog, and service principals.
Solid understanding of AWS data services: S3, Secrets Manager, SQS, IAM, and network/security configurations.
Nice to have experience with Terraform for defining and managing cloud infrastructure.
Proficient with GitHub for version control, code reviews, and CI/CD.
Excellent debugging and performance-tuning capabilities for distributed systems.
Strong communication and collaboration skills; able to translate technical solutions into business value.
Data Engineer responsible for building ELT/ETL pipelines and supporting data governance practices at Daniels Health. Joining a mission - driven company innovating in healthcare waste management across multiple countries.
Data Engineer designing and optimizing Azure - based data platforms for enterprise analytics. Developing scalable data pipelines and enabling insights through Power BI and Azure Synapse Analytics.
Senior Software Engineer focused on ingestion pipeline at Fullstory. Engineering distributed systems for processing data at scale while collaborating with technical leaders.
Junior Data Engineer contributing to data solutions in home24's Martech team. Focus on data pipelines, analytical workflows, and machine learning model scaling with cross - functional collaboration.
Data Engineer at Onepoint developing cloud - native architectures and scalable data solutions. Collaborating on data processing pipelines and guiding clients on best practices.
Data Engineer at Onepoint contributing to client growth through cloud technologies. Involves data pipeline development, auditing cloud configurations, and supporting data science practices.
Senior Data Engineer / Data Scientist developing AI - driven solutions at GFT. Focus on scalable data pipelines, AI/ML models, and LLM technologies while collaborating with UK banking clients.
AI & BI Data Engineer developing analytics solutions to enhance genomic platforms at Corteva Agriscience. Focused on data pipelines, AI/ML model operation, and decision - making analytics.
Data Engineer developing cloud migration and data solutions for retail at Public Group. Engage in multiple projects and create growth opportunities in a hybrid team environment.