remote
Associate Data Engineer - Nomura
Data Engineer
Associate Data Engineer responsible for designing, building, and maintaining scalable data pipelines and infrastructure using Python, SQL, and AWS services to support analytics and business intelligence across the organization.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines and ETL processes to ingest, transform, and load data from diverse sources into data lakes and warehouses.
- Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver high‑quality, reliable datasets.
- Implement and optimize data models, ensuring performance, scalability, and data quality across the platform.
- Utilize AWS services (S3, Redshift, Glue, Athena) and big‑data technologies (Spark, Hive) to build and manage data infrastructure.
- Monitor, troubleshoot, and improve pipeline performance, applying best practices for error handling and logging.
Requirements
- Strong proficiency in Python and SQL for data manipulation and automation.
- Experience with ETL tools and frameworks, preferably AWS Glue or similar.
- Hands‑on knowledge of big‑data ecosystems (Spark, Hive) and cloud data services.
- Solid understanding of data modeling, warehousing concepts, and data quality principles.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced environment.