Senior Data Engineer designing and scaling data infrastructure for analytics, machine learning, and business intelligence in a software supply chain security company.
Responsibilities
We’re looking for a **Senior Data Engineer** to join our growing Data Platform team.
Design and scale the infrastructure and pipelines that power analytics, machine learning, and business intelligence across Sonatype.
Work closely with stakeholders across product, engineering, and business teams to ensure data is reliable, accessible, and actionable.
Architect, build, and maintain the data infrastructure that powers our analytics and data science efforts.
Partner closely with Data Analysts, Data Scientists, and product teams.
Design scalable, reliable data pipelines, manage our Databricks and AWS environments, and ensure best practices for security, governance, and deployment.
Requirements
5+ years of hands-on experience in data engineering or a related role.
Expert proficiency in Apache Spark—both Scala and PySpark—for large-scale data processing.
Strong Python skills (including pandas) and practical experience with AWS’s boto3 SDK.
Deep familiarity with Databricks concepts: workspaces, clusters, jobs, Unity Catalog, and service principals.
Solid understanding of AWS data services: S3, Secrets Manager, SQS, IAM, and network/security configurations.
Nice to have experience with Terraform for defining and managing cloud infrastructure.
Proficient with GitHub for version control, code reviews, and CI/CD.
Excellent debugging and performance-tuning capabilities for distributed systems.
Strong communication and collaboration skills; able to translate technical solutions into business value.
Data Engineer at Equinix implementing data architecture solutions for scalability and analytics. Collaborating with teams to design data pipelines and maintain data models for business objectives.
Data Warehouse Architect developing and optimizing robust data warehouse environments on SAP BW/4HANA. Critical for enabling advanced analytics and reporting across the organization.
Data Engineering Manager leading a new Data Engineering team in Bengaluru. Shaping the design and scaling of core data engineering practices across the organization.
Sr. ETL/Data Warehouse Lead at Huntington designing, developing, and supporting ETL and Data Warehousing framework. Analyzing systems based on specifications and providing technical assistance.
Senior Google Data Architect designing and delivering scalable data solutions on Google Cloud Platform. Collaborating across teams to shape target - state data architectures and influence enterprise data strategy.
Data Engineer developing scalable data lake solutions and optimizing data pipelines at U.S. Bank. Collaborating with teams to manage data governance and cloud migration activities.
Lead AI, MLOps & Data Engineer at WedR, guiding complex data projects and AI innovation. Collaborate with diverse experts in a Product Studio for digital transformations.
Lead Azure Databricks Data Engineer implementing data solutions for data engineering projects at Ryan Specialty. Collaborating with stakeholders and mentoring junior staff on data pipelines and ETL processes.
Lead Azure Databricks Data Engineer at Ryan Specialty focused on implementing data solutions and collaborating with cross - functional teams to enhance data architecture.
Senior Data Engineer designing and implementing sustainable data solutions for diverse clients. Collaborating closely with stakeholders to enhance data services and platforms in a hybrid environment.