onsite
Senior Data Developer - Cleveland Clinic
Software Engineer
Senior Data Developer responsible for designing, building, and deploying robust data pipelines and applications using Python, SQL, and Spark on AWS, ensuring high-quality extraction, transformation, and loading of healthcare data into enterprise data warehouses.
About the role
Key Responsibilities
- Design, develop, test, and deploy scalable data pipelines and applications that extract, validate, transform, and load healthcare data into enterprise data warehouses.
- Collaborate with developers, project managers, analysts, leaders, and clinicians to translate business requirements into technical solutions.
- Implement and maintain ETL processes using Python, SQL, and Spark, ensuring data quality, performance, and reliability.
- Leverage AWS services (e.g., S3, Redshift, Glue) to build and optimize data infrastructure.
- Document data models, pipeline logic, and best practices for future maintenance and knowledge transfer.
Requirements
- 5+ years of experience in data engineering or related roles within a healthcare or regulated environment.
- Proficiency in Python, SQL, and Spark for data processing and transformation.
- Hands‑on experience with ETL tools and AWS data services.
- Strong analytical skills and ability to troubleshoot complex data issues.
- Excellent communication skills and a collaborative mindset.