remote
Data Integration Specialist - University Of Washington
Software Engineer
Lead system‑level data discovery and stewardship, ensuring high‑quality datasets for global health modeling using SQL, Python, ETL tools, and the Global Health Data Exchange. Drive diagnostics, annotation, and collaboration across teams.
About the role
Key Responsibilities
- Conduct system‑level discovery of data sources, leveraging the Global Health Data Exchange (GHDx) and emerging technologies.
- Design and execute ETL pipelines in SQL and Python to ingest, transform, and load datasets into the data warehouse.
- Perform data quality diagnostics, identify gaps, and develop remediation plans to support model inputs.
- Annotate and catalog resources for discoverability, maintaining metadata and documentation.
- Collaborate with cross‑functional teams to secure critical datasets and ensure compliance with data governance policies.
Requirements
- Proven experience in data integration, ETL development, and data quality management.
- Strong proficiency in SQL and Python for data manipulation and automation.
- Familiarity with the Global Health Data Exchange (GHDx) and related health data standards.
- Excellent analytical, problem‑solving, and communication skills.
- Ability to work independently in a fast‑paced, collaborative environment.