remote
Lead Data Engineer - GATX
Data Engineer
Lead Data Engineer driving enterprise analytics, building scalable pipelines on Databricks and AWS, shaping data strategy, and mentoring a high‑performance team.
About the role
Key Responsibilities
- Design, develop, and maintain large‑scale data pipelines using Databricks, Spark, and Python on AWS.
- Collaborate with data scientists and product teams to define data models, schemas, and governance standards.
- Lead architecture discussions, evaluate new technologies, and drive best practices for data quality and performance.
- Mentor and coach junior engineers, fostering a culture of continuous improvement and knowledge sharing.
- Ensure compliance with security, privacy, and regulatory requirements across all data assets.
Requirements
- 5+ years of experience in data engineering, with a strong background in Python, Spark, and cloud data platforms.
- Proven expertise in building and optimizing ETL pipelines on Databricks and AWS services (S3, Glue, Redshift).
- Solid understanding of data modeling, schema design, and performance tuning.
- Experience leading technical teams and driving cross‑functional collaboration.
- Excellent problem‑solving skills and a passion for data‑driven innovation.
Skills
pythondatabricksaws