remote
Data Lead Software Engineer - Python, PySpark and AWS - JPMorganChase
Software Engineer
Lead data engineering initiatives, designing and delivering scalable Python and PySpark solutions on AWS to power consumer banking products. Drive architecture, performance, and security while mentoring a cross‑functional agile team.
About the role
Key Responsibilities
- Architect, develop, and maintain large‑scale data pipelines using Python and PySpark on AWS services (EMR, S3, Redshift, Glue).
- Collaborate with product, data science, and security teams to translate business requirements into robust, secure, and scalable data solutions.
- Lead code reviews, enforce best practices, and mentor junior engineers on clean code, testing, and CI/CD.
- Optimize performance and cost of data workflows, implementing monitoring, alerting, and automated scaling.
- Drive continuous improvement of data architecture, tooling, and documentation to support rapid feature delivery.
Requirements
- 5+ years of software engineering experience with a focus on data processing.
- Proficiency in Python, PySpark, and AWS cloud services.
- Strong understanding of data modeling, ETL, and performance tuning.
- Experience with CI/CD pipelines, unit testing, and automated deployment.
- Excellent communication skills and a collaborative mindset.