Perception Data Engineer at ANYbotics building data pipelines for perception models in mobile robotics. Collaborating within a global team on cutting-edge robotic technology.
Responsibilities
Build and operate the data plumbing that our perception models need: ingestion, versioned storage, ETL, labeling integration, and reliable production pipelines for training and inference.
Design, build and maintain scalable data pipelines and ETL workflows that ingest raw images, sensor metadata, and labels (both real and synthetic).
Implement dataset versioning, schema management, and reproducible data snapshots to support experiments and audits.
Integrate annotation tools (CVAT / Label Studio), manage labeling workflows and quality-control tooling, and support label QA processes.
Build data validation and monitoring checks (file integrity, label sanity, distribution drift alerts) and automate remediation where possible.
Provide clean, ready-to-use datasets and data loaders for ML engineers; optimize data access patterns for training (sharding, caching, prefetching).
Requirements
3+ years engineering experience building production data pipelines or ETL systems.
Strong Python scripting and engineering skills (pandas, pyarrow, boto3 or equivalent).
Experience with dataset versioning or large-file management (DVC, Git-LFS, or similar) and cloud object storage (S3).
Familiarity with annotation tooling and workflows for image data (CVAT / Label Studio).
Basic understanding of ML training data needs (batching, sharding, augmentation integration).
Prior work supporting computer-vision teams (image pipelines, preprocessing, TFRecord or custom dataset formats).
Senior Data Engineer designing and optimizing data platforms for clients using Microsoft Azure, Microsoft Fabric, Power BI, and Databricks. Working closely with clients to deliver scalable solutions.
Data Engineer providing technical expertise on mission - critical NAVSUP OIS program. Work involves data architecture and database management in AWS GovCloud environments.
Senior Data Engineer focusing on data infrastructure for an AI - driven insurtech startup based in Nepal. Collaborating with teams to optimize data models and maintain data quality.
Senior Professional Consultant leading architecture and design for SAP BW and SAC solutions at Freudenberg. Collaborating with stakeholders and optimizing performance of data landscapes.
Senior Data Engineer designing and managing data architectures to transform large - scale data into insights for Humana. Involves leading technical discussions and implementing best data practices.
Data Engineer II at Early Warning Services developing data science tools and infrastructure. Collaborating on software enhancements and mentoring interns in a hybrid work environment.
Senior Data Architect responsible for optimizing data architecture and supporting data - driven business decisions at TruStage. Leading technical guidance for data architecture and cross - functional team collaboration.
Senior Data Architect developing data architecture plans at The Hartford, collaborating with internal teams to align data standards and practices. Leading complex solutions with a focus on operational effectiveness.
Senior Solution Architect defining architecture framework for SA‑CCR in regulatory risk. Collaborating with stakeholders to ensure compliance and efficient data governance.