onsite
Senior Advanced Data Engineer - Honeywell
Data Engineer
Lead the design, development, and optimization of large‑scale data pipelines using Python, Spark, and AWS, ensuring high data quality and performance to drive business insights.
About the role
Key Responsibilities
- Design, build, and maintain scalable data pipelines and lakehouse architectures on AWS.
- Implement ETL processes using Python and Apache Spark, optimizing for performance and cost.
- Ensure data quality, lineage, and governance across all data assets.
- Collaborate with data scientists, analysts, and product teams to translate business requirements into technical solutions.
- Monitor, troubleshoot, and continuously improve pipeline reliability and throughput.
Requirements
- 5+ years of experience in data engineering with a focus on large‑scale data processing.
- Proficiency in Python, SQL, and Spark (PySpark).
- Hands‑on experience with AWS services such as S3, Glue, Redshift, and EMR.
- Strong understanding of data modeling, lakehouse concepts, and ETL best practices.
- Excellent problem‑solving skills and ability to work cross‑functionally in a fast‑paced environment.
Skills
pythonapache sparksqlaws