remote
Data Engineer - GCP - signitives IT Solutions
Data Engineer
Data Engineer role focused on building scalable GCP data platforms using BigQuery, Dataflow, Cloud Composer, and PySpark, with strong Python skills and expertise in Medallion Architecture, data governance, CI/CD, and performance optimization.
About the role
Key Responsibilities
- Design, develop, and maintain end‑to‑end data pipelines on GCP using BigQuery, Dataflow, and Cloud Composer.
- Implement Medallion Architecture to structure raw, curated, and refined data layers.
- Write efficient PySpark and Python code for data transformation and enrichment.
- Optimize query performance and cost in BigQuery through partitioning, clustering, and best‑practice tuning.
- Establish data governance, lineage, and security controls across the platform.
- Automate deployment and monitoring via CI/CD pipelines and GCP monitoring tools.
Requirements
- 5+ years of experience as a Data Engineer with deep GCP expertise.