onsite
Senior Data Scientist I - MetLife
Data Scientist
Lead the design and deployment of large‑scale data pipelines on Azure, driving end‑to‑end data engineering solutions that power analytics and machine learning initiatives.
About the role
Key Responsibilities
- Architect, develop, and maintain robust ETL/ELT pipelines on Azure and on‑prem environments to ingest, transform, and store massive structured and semi‑structured datasets.
- Collaborate with cross‑functional teams to translate business requirements into scalable data solutions, ensuring data quality, lineage, and governance.
- Mentor junior data engineers, providing technical guidance on best practices in big data processing, cloud architecture, and performance tuning.
- Optimize data workflows using Spark, Python, and other modern data processing frameworks to achieve high throughput and low latency.
- Implement monitoring, alerting, and automated testing to guarantee production‑grade reliability and maintainability.
Requirements
- 8–10+ years of experience in data engineering or related roles, with a strong background in big data technologies.
- Proficiency in Azure services (Data Factory, Databricks, Synapse) and on‑prem data platforms.
- Expertise in Python, Apache Spark, and SQL for data transformation and analytics.
- Solid understanding of data governance, security, and compliance best practices.
- Excellent problem‑solving skills and the ability to mentor and influence peers.
Skills
azurepythonapache spark