remote
Data Warehouse Software Engineer I
Software Engineer
Entry‑level engineer building and maintaining data pipelines with Apache Airflow and Cloud Composer, designing scalable data lake and mart solutions, and ensuring data quality and performance across the organization.
About the role
Key Responsibilities
- Design, develop, and maintain data pipelines using Apache Airflow and Cloud Composer to ingest, transform, and load data into data lakes and marts.
- Collaborate with data scientists and analysts to understand data requirements and translate them into efficient ETL workflows.
- Implement best practices for data quality, lineage, and monitoring, including automated alerts and documentation.
- Optimize pipeline performance and resource utilization, troubleshooting failures and bottlenecks.
- Contribute to the evolution of the data architecture, recommending new tools and techniques to improve scalability and reliability.
Requirements
- Strong foundation in Python programming and SQL for data manipulation.
- Hands‑on experience with Apache Airflow and Cloud Composer orchestration.
- Knowledge of data lake concepts, storage formats (Parquet, Delta Lake), and data mart design.
- Familiarity with cloud data services (e.g., GCP, AWS, Azure) and containerization.
- Excellent problem‑solving skills and a collaborative mindset.