remote
Data Engineer III - Grainger Businesses
Data Engineer
Senior Data Engineer responsible for designing, building, and maintaining scalable data pipelines on AWS, leveraging Python, SQL, and Spark to transform and model data for analytics and machine learning initiatives.
About the role
Key Responsibilities
- Design, develop, and optimize large-scale data pipelines using Python, SQL, and Apache Spark on AWS services (Glue, Redshift, S3).
- Implement robust ETL processes, ensuring data quality, lineage, and performance across multiple data sources.
- Collaborate with data scientists and business analysts to model data and deliver actionable insights.
- Maintain and enhance data architecture, including data lake and warehouse solutions, following best practices.
- Monitor pipeline health, troubleshoot issues, and continuously improve reliability and scalability.
Requirements
- 5+ years of experience in data engineering with a strong focus on cloud-based solutions.
- Proficiency in Python, SQL, and experience with Spark or similar big‑data frameworks.
- Hands‑on experience with AWS data services (Glue, Redshift, Athena, S3, Lambda).
- Solid understanding of data modeling, schema design, and ETL best practices.
- Excellent problem‑solving skills and ability to work collaboratively in a fast‑paced environment.
Skills
pythonsqlawsapache spark