remoteonsite
Data Engineer-Husky Guindy,Chennai - Husky Injection Molding
Data Engineer
Data Engineer role focused on building scalable data pipelines using Python, SQL, and Spark on AWS, driving data quality and performance for high‑velocity manufacturing analytics.
About the role
Key Responsibilities
- Design, develop, and maintain robust ETL pipelines to ingest, transform, and load large volumes of production data into cloud data warehouses.
- Leverage Apache Spark and Python to build scalable data processing workflows, ensuring high performance and reliability.
- Implement data quality checks, monitoring, and alerting to guarantee data integrity across all stages.
- Collaborate with data scientists and business analysts to translate analytical requirements into efficient data models and services.
- Optimize SQL queries and data storage structures for cost‑effective, high‑throughput access on AWS services such as Redshift, S3, and Glue.
Requirements
- 3+ years of experience in data engineering, with strong proficiency in Python and SQL.
- Hands‑on experience with Apache Spark, AWS Glue, and Redshift.
- Solid understanding of data modeling, ETL best practices, and data warehousing concepts.
- Experience with version control (Git) and CI/CD pipelines for data workflows.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced environment.
Skills
pythonsqlawsapache spark