Perception Data Engineer at ANYbotics building data pipelines for perception models in mobile robotics. Collaborating within a global team on cutting-edge robotic technology.
Responsibilities
Build and operate the data plumbing that our perception models need: ingestion, versioned storage, ETL, labeling integration, and reliable production pipelines for training and inference.
Design, build and maintain scalable data pipelines and ETL workflows that ingest raw images, sensor metadata, and labels (both real and synthetic).
Implement dataset versioning, schema management, and reproducible data snapshots to support experiments and audits.
Integrate annotation tools (CVAT / Label Studio), manage labeling workflows and quality-control tooling, and support label QA processes.
Build data validation and monitoring checks (file integrity, label sanity, distribution drift alerts) and automate remediation where possible.
Provide clean, ready-to-use datasets and data loaders for ML engineers; optimize data access patterns for training (sharding, caching, prefetching).
Requirements
3+ years engineering experience building production data pipelines or ETL systems.
Strong Python scripting and engineering skills (pandas, pyarrow, boto3 or equivalent).
Experience with dataset versioning or large-file management (DVC, Git-LFS, or similar) and cloud object storage (S3).
Familiarity with annotation tooling and workflows for image data (CVAT / Label Studio).
Basic understanding of ML training data needs (batching, sharding, augmentation integration).
Prior work supporting computer-vision teams (image pipelines, preprocessing, TFRecord or custom dataset formats).
Program Manager leading enterprise - wide data migration efforts for Boeing's transition to modern data platforms. Overseeing complex processes to ensure secure and effective migrations across multiple systems.
Senior Data Engineer responsible for developing data products for Disney's immersive digital experiences. Collaborating with teams to ensure data quality and operational efficiency in a fast - paced environment.
Senior Data Engineer crafting and developing data products for analytical insights at Zendesk. Collaborating in an Agile environment with a focus on data warehousing and process optimization.
Senior Data Engineer designing and building data warehouse solutions with Snowflake for a fintech company. Collaborating with cross - functional teams to facilitate data insights and analytics.
Data Engineer developing and maintaining data pipelines and applications at EvidenceCare. Collaborating across teams to generate actionable insights from healthcare data for better decision - making.
Data Engineer managing and expanding enterprise business intelligence and data platform. Focusing on Tableau development and administration with a strong engineering background.
Lead Data Engineer overseeing engineers and advancing the data platform at American Family Insurance. Creating tools and infrastructure to empower teams across the company.
Data Architect designing end - to - end Snowflake data solutions and collaborating with technical stakeholders at Emerson. Supporting the realization of Data and Digitalization Strategy.
Manager of Data Engineering leading data assets and infrastructure initiatives at CLA. Collaborating with teams to enforce data quality standards and drive integration efforts.