remote
Data Engineer II - Topgolf
Data Engineer
Mid‑level Data Engineer responsible for designing, building, and maintaining scalable ETL/ELT pipelines and cloud data platforms using Python, SQL, AWS analytics services, and Apache Airflow to enable reporting and machine‑learning workloads.
About the role
Key Responsibilities
- Design, develop, and maintain robust ETL/ELT pipelines using Python, SQL, and Apache Airflow.
- Build, optimize, and manage data warehouses and data lakes on AWS services such as Redshift, S3, and Glue.
- Collaborate with analytics and ML teams to ensure data availability, quality, and performance for downstream use cases.
- Implement data modeling, partitioning, and indexing strategies to improve query efficiency and cost effectiveness.
- Monitor pipeline health, troubleshoot failures, and continuously improve reliability and scalability.
Requirements
- 2+ years of professional experience in data engineering or related roles.
- Strong proficiency in SQL and Python for data transformation and automation.
- Hands‑on experience with AWS data analytics services (e.g., Redshift, S3, Glue, Athena).
- Practical knowledge of workflow orchestration tools, preferably Apache Airflow.
- Experience designing data warehouses or data lakes and applying best practices for data modeling and performance tuning.