remote
Senior Data Engineer - BABYLIST
Data Engineer
Lead end‑to‑end data pipeline development for a high‑growth consumer platform, leveraging Python, SQL, AWS, and Spark to deliver reliable, scalable analytics and machine‑learning data assets.
About the role
Key Responsibilities
- Design, build, and maintain large‑scale data pipelines that ingest, transform, and store data from multiple internal and external sources.
- Collaborate with data scientists and product teams to create reproducible data models and feature stores for AI/ML initiatives.
- Optimize pipeline performance and cost on AWS, utilizing services such as S3, Redshift, Glue, and EMR.
- Implement robust monitoring, alerting, and documentation to ensure data quality and reliability.
- Mentor junior engineers and champion best practices in data engineering and DevOps.
Requirements
- 5+ years of experience in data engineering, with a strong background in Python and SQL.
- Hands‑on expertise with AWS data services (S3, Redshift, Glue, EMR) and Spark or similar big‑data frameworks.
- Proven track record of building scalable, production‑grade data pipelines and data warehouses.
- Excellent problem‑solving skills and ability to work independently in a remote environment.
- Strong communication skills and a collaborative mindset.