remote
Senior/Lead Data Engineer - AI-Native Aftermarket Platform - Truelogic
Data Engineer
Lead data engineering for an AI‑native aftermarket platform, driving scalable data pipelines, cloud architecture, and advanced analytics using Python, SQL, AWS, and Spark.
About the role
Key Responsibilities
- Design, build, and maintain end‑to‑end data pipelines that ingest, transform, and serve large volumes of structured and unstructured data for AI models.
- Architect and optimize data storage solutions on AWS (S3, Redshift, Athena) ensuring high availability, security, and cost efficiency.
- Collaborate with data scientists to deploy machine learning workflows, integrating feature stores and model monitoring into the data platform.
- Implement robust data quality, lineage, and governance frameworks using Airflow, dbt, and metadata catalogs.
- Mentor junior engineers, conduct code reviews, and champion best practices in data engineering and DevOps.
Requirements
- 5+ years of experience in data engineering, with a strong background in Python, SQL, and AWS services.
- Proficiency with Apache Spark, Airflow, and modern data modeling techniques.
- Hands‑on experience building ML pipelines and integrating with data science workflows.
- Excellent problem‑solving skills and a track record of delivering production‑grade data solutions.
- Strong communication skills and ability to work cross‑functionally in a fast‑paced environment.
Skills
pythonsqlawsapache sparkmachine learning