onsite
Junior Data Engineer - Guidehouse Inc.
Data Engineer
Entry‑level data engineer focused on building automated metadata ingestion pipelines using SQL, Python and R, ensuring data completeness, accuracy and timely refresh for enterprise reporting.
About the role
Key Responsibilities
- Design, develop, and maintain automated metadata harvesting pipelines that ingest source data on scheduled intervals.
- Write extraction scripts and connectors in SQL, Python, and R to pull data from diverse systems.
- Implement validation rules and data quality checks to guarantee completeness, accuracy, and timeliness of harvested metadata.
- Configure and monitor refresh frequencies, handling failures and performance tuning.
- Collaborate with source system owners and consulting teams, both in‑person and virtually, to gather requirements and resolve integration issues.
Requirements
- Bachelor's degree in Computer Science, Information Systems, or related field.
- Proficiency in SQL and programming with Python or R for data extraction and transformation.
- Understanding of ETL concepts and experience building data pipelines.
- Strong analytical skills with ability to design validation logic and troubleshoot data quality problems.
- Effective communication and teamwork skills; willingness to travel up to 10%.