remote
Data Engineer II - Disney
Data Engineer
Mid‑level Data Engineer responsible for designing, building, and maintaining scalable data pipelines and warehouses using Python, SQL, Spark, Airflow, and cloud services to support media analytics and product development.
About the role
Key Responsibilities
- Design, develop, and maintain robust ETL/ELT pipelines that ingest, transform, and load large‑scale media and user‑behavior data.
- Implement data models and schemas in cloud data warehouses (e.g., AWS Redshift, Google BigQuery) to enable self‑service analytics.
- Collaborate with data scientists, product managers, and engineers to define data requirements and ensure data quality, reliability, and timeliness.
- Automate workflow orchestration using Apache Airflow, monitoring job health and optimizing performance.
- Apply best practices for code versioning, testing, and documentation to support a production‑grade data platform.
Requirements
- 3+ years of professional experience building data pipelines with Python, SQL, and Spark.
- Hands‑on experience with cloud platforms (AWS, GCP) and data warehouse solutions such as Redshift or BigQuery.
- Proficiency in workflow orchestration tools like Apache Airflow or similar.
- Strong problem‑solving skills, ability to work cross‑functionally, and a passion for delivering high‑quality data solutions.
- Bachelor’s degree in Computer Science, Engineering, or a related quantitative field (or equivalent experience).
Skills
pythonsqlapache sparkairflowaws