remote
Data Engineer - Battery Data Platform & AI - Apple
Data Engineer
Lead the design and implementation of a battery data platform, building robust data pipelines and an AI-powered natural language interface to serve engineers across the organization. Leverage Python, Spark, Airflow, and AWS to deliver clean, scalable data solutions and advanced ML capabilities.
About the role
Key Responsibilities
- Design, develop, and maintain end‑to‑end data pipelines that ingest, transform, and store large battery datasets using Python, Spark, and SQL.
- Implement orchestration workflows with Airflow to ensure reliable, scheduled data processing and monitoring.
- Build and deploy AI/NLP services that provide a natural language interface for battery engineers to query and analyze data.
- Collaborate with battery scientists and software teams to define data models, quality metrics, and performance benchmarks.
- Optimize data storage and retrieval on AWS services (S3, Redshift, Athena) for cost and speed.
- Document architecture, code, and best practices to enable knowledge transfer across the organization.
Requirements
- 5+ years of data engineering experience in a large, data‑centric environment.
Skills
pythonsqlapache sparkairflowawsmachine learningnatural language processing