remote
Robotics Data Pipeline Engineer - Multimodal Data - Persona AI
Software Engineer
Design and maintain scalable data pipelines that ingest, process, and serve multimodal sensor data for humanoid robotics, leveraging Python, AWS, and advanced ETL techniques to support real‑time decision making in heavy‑industry environments.
About the role
Key Responsibilities
- Architect and implement end‑to‑end data pipelines for multimodal sensor streams (vision, lidar, IMU, force sensors) using Python and AWS services.
- Develop robust ETL workflows that transform raw data into analytics‑ready formats, ensuring data quality, consistency, and low latency.
- Collaborate with robotics, software, and operations teams to define data schemas, metadata standards, and performance metrics.
- Monitor pipeline health, troubleshoot failures, and optimize throughput and cost across cloud infrastructure.
- Document pipeline architecture, data models, and best practices for internal knowledge sharing.
Requirements
- 3+ years of experience building data pipelines in a production environment.
- Proficiency in Python, SQL, and AWS services (S3, Glue, Lambda, Redshift, Athena).
- Strong understanding of ETL concepts, data modeling, and performance tuning.
- Experience with multimodal or sensor data is a plus.
- Excellent problem‑solving skills and ability to work cross‑functionally in a fast‑paced setting.