remote
Data Engineer Databricks - Capgemini
Data Engineer
Lead the design and migration of enterprise‑scale Lakehouse solutions on Databricks, crafting reference architectures and data platform roadmaps to empower data‑driven decision making.
About the role
Key Responsibilities
- Design and implement enterprise‑scale Lakehouse architectures using Databricks, ensuring performance, scalability, and security.
- Define target‑state data platform architectures and develop detailed migration roadmaps from legacy systems.
- Develop reference architectures and best‑practice guidelines for data ingestion, processing, and analytics.
- Collaborate with data scientists, analysts, and business stakeholders to translate requirements into robust data solutions.
- Optimize data pipelines using Python, SQL, and Databricks notebooks, and monitor performance metrics.
Requirements
- Proven experience designing and deploying Lakehouse solutions on Databricks.
- Strong knowledge of data platform architecture, ETL/ELT processes, and cloud data services.
- Hands‑on expertise with Python, SQL, and Databricks notebooks.
- Excellent communication skills and ability to work cross‑functionally.
- Experience with migration planning and execution from on‑prem or other cloud platforms.