onsite
Senior Data Engineer - Hirezy
Data Engineer
Senior Data Engineer responsible for designing, building, and maintaining scalable ETL/ELT pipelines that ingest and transform large pharmaceutical datasets, delivering clean, production‑grade data for analytics and visualization.
About the role
Key Responsibilities
- Design and implement scalable ETL/ELT pipelines to ingest structured and semi‑structured pharma data from diverse sources.
- Transform indication‑level market forecast data from Excel workbooks into JSON and database‑ready formats for front‑end visualisation.
- Develop and maintain data extraction workflows from clinical trial registries and other external APIs.
- Collaborate closely with Data Scientists and Python developers to ensure data quality, validation, and reproducibility.
- Monitor pipeline performance, troubleshoot failures, and optimise for cost and speed using orchestration tools.
Requirements
- 5+ years of experience building data pipelines in Python and SQL.
- Strong knowledge of ETL/ELT concepts, data modeling, and handling large‑scale structured and semi‑structured datasets.
- Hands‑on experience with workflow orchestration tools such as Apache Airflow.
- Proficiency in converting Excel‑based data into JSON and relational database formats.
- Ability to work cross‑functionally with data science and engineering teams to deliver production‑grade data solutions.