remote
Data Engineer - Blood Cancer United
Data Engineer
Data Engineer responsible for designing, building, and maintaining scalable data pipelines and infrastructure using Python, SQL, and AWS services to support research and analytics for blood cancer cure initiatives.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines that ingest, transform, and load large volumes of structured and unstructured data.
- Collaborate with data scientists and analysts to understand data requirements and deliver high‑quality, reproducible datasets.
- Implement data quality checks, monitoring, and alerting to ensure data integrity and reliability.
- Optimize query performance and storage costs across AWS services such as Redshift, S3, and Glue.
- Document data models, pipeline architecture, and best practices for future maintenance and onboarding.
Requirements
- 3+ years of experience as a data engineer or similar role.
- Experience designing data models and building scalable data warehouses.
- Excellent problem‑solving skills and a collaborative mindset.