remote

Data Lead Software Engineer - Python, PySpark and AWS - JPMorganChase

Software Engineer

Lead data engineering initiatives, designing and delivering scalable Python and PySpark solutions on AWS to power consumer banking products. Drive architecture, performance, and security while mentoring a cross‑functional agile team.

About the role

Key Responsibilities

Architect, develop, and maintain large‑scale data pipelines using Python and PySpark on AWS services (EMR, S3, Redshift, Glue).
Collaborate with product, data science, and security teams to translate business requirements into robust, secure, and scalable data solutions.
Lead code reviews, enforce best practices, and mentor junior engineers on clean code, testing, and CI/CD.
Optimize performance and cost of data workflows, implementing monitoring, alerting, and automated scaling.
Drive continuous improvement of data architecture, tooling, and documentation to support rapid feature delivery.

Requirements

5+ years of software engineering experience with a focus on data processing.
Proficiency in Python, PySpark, and AWS cloud services.
Strong understanding of data modeling, ETL, and performance tuning.
Experience with CI/CD pipelines, unit testing, and automated deployment.
Excellent communication skills and a collaborative mindset.

Skills

pythonaws

CompanyJPMorganChase

DepartmentEngineering

LocationNew York, United States

Experience7+ years

Tenurefull-time

LevelLead

Posted June 25, 2026