remote
Lead Data Engineer - GCP - Capgemini
Data Engineer
Lead the design, development, and optimization of scalable, secure, and cost‑efficient data pipelines on Google Cloud Platform, driving data strategy for enterprise clients.
About the role
Key Responsibilities
- Architect and implement end‑to‑end data pipelines using GCP services such as BigQuery, Cloud Dataflow, and Pub/Sub.
- Design data models and schemas that support analytics, reporting, and machine learning workloads.
- Optimize query performance and storage costs through partitioning, clustering, and materialized views.
- Collaborate with data scientists, analysts, and stakeholders to translate business requirements into technical solutions.
- Ensure data quality, governance, and security compliance across all data assets.
Requirements
- Senior experience (5+ years) in data engineering with a strong focus on GCP.
- Hands‑on experience with BigQuery, Cloud Dataflow, Cloud Storage, and related GCP services.
- Solid understanding of data modeling, ETL/ELT processes, and performance tuning.
- Excellent communication skills and ability to mentor junior engineers.
Skills
generative aipythonscalasqlgcpterraformapache sparkdbt