remote
Data Engineer I - AFT BI Content - Amazon.com
Data Engineer
Entry‑level Data Engineer focused on building scalable data pipelines in AWS, using Python, SQL and Spark to ingest, transform and model data for Amazon’s fulfillment analytics.
About the role
Key Responsibilities
- Design, develop and maintain large‑scale data pipelines using Python, SQL and Apache Spark on AWS.
- Collaborate with cross‑functional teams to define data models and schema for BI and analytics workloads.
- Implement robust ETL processes, ensuring data quality, lineage and performance optimization.
- Monitor pipeline health, troubleshoot issues and continuously improve reliability and scalability.
- Document data flows, architecture decisions and best practices for internal knowledge sharing.
Requirements
- Strong programming skills in Python and SQL.
- Experience with distributed data processing frameworks such as Apache Spark.
- Hands‑on knowledge of AWS services (S3, Redshift, EMR, Glue).
- Solid understanding of data modeling, ETL concepts and data warehousing principles.
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.
Skills
pythonsqlapache sparkaws