remoteonsite
Associate Big Data Engineer - MetLife
Data Engineer
Entry‑level Big Data Engineer building scalable data pipelines using Hadoop, Spark, and AWS services, with strong SQL and Python skills to support analytics and reporting across the organization.
About the role
Key Responsibilities
- Design, develop, and maintain large‑scale data pipelines using Hadoop and Spark to ingest, transform, and store data.
- Write efficient SQL queries and Python scripts for data extraction, cleansing, and enrichment.
- Collaborate with data scientists and analysts to deliver high‑quality datasets for reporting and machine learning projects.
- Monitor pipeline performance, troubleshoot issues, and implement optimizations to ensure reliability and scalability.
- Utilize AWS services (S3, EMR, Glue, Redshift) to support data storage, processing, and analytics workflows.
Requirements
- Bachelor’s degree in Computer Science, Engineering, or related field.
- 1–2 years of experience in big data technologies (Hadoop, Spark).
- Strong proficiency in SQL and Python for data manipulation.
- Experience with AWS data services (S3, EMR, Glue, Redshift) is a plus.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced environment.