onsite
Data Engineer II, Creator Services - Amazon.com
Data Engineer
Build and maintain scalable data pipelines and analytics platforms for Amazon Creator Services, leveraging Python, SQL, AWS, Spark, and Airflow to enable creator monetization and insights.
About the role
Key Responsibilities
- Design, develop, and operate production‑grade data pipelines that ingest, transform, and store large‑scale creator‑related datasets.
- Collaborate with product, analytics, and machine‑learning teams to define data requirements and deliver reliable, low‑latency data services.
- Implement and maintain data models, warehouses, and lakehouse solutions on AWS (Redshift, S3, Glue) to support reporting and real‑time analytics.
- Automate workflow orchestration using Apache Airflow, ensuring robust scheduling, monitoring, and alerting.
- Optimize performance of Spark jobs and SQL queries, applying best practices for cost‑effective cloud resource utilization.
- Participate in code reviews, testing, and documentation to uphold engineering standards and data quality.
Requirements
- 2+ years of professional experience building data pipelines with Python, SQL, and Apache Spark.
- Strong hands‑on experience with AWS services such as S3, Redshift, Glue, and Lambda.
- Proficiency in workflow orchestration tools, preferably Apache Airflow.
- Solid understanding of data modeling, ETL design, and performance tuning for large datasets.
- Excellent problem‑solving skills and ability to work cross‑functionally in a fast‑paced environment.
Skills
pythonsqlawsapache sparkairflow