onsite
Research Data Engineer II - University of Rochester
Data Engineer
Senior data engineer responsible for designing, building, and maintaining scalable data pipelines and analytics infrastructure using Python, SQL, AWS, and Spark to support neuroscience research initiatives.
About the role
Key Responsibilities
- Design, develop, and maintain robust data pipelines that ingest, transform, and store large volumes of research data across multiple sources.
- Implement ETL processes using Python, SQL, and Apache Spark to ensure data quality, consistency, and accessibility for downstream analytics.
- Leverage AWS services (S3, Redshift, Glue, Lambda) to build scalable, secure, and cost‑effective data solutions.
- Collaborate with data scientists, researchers, and IT teams to understand data requirements and translate them into technical specifications.
- Monitor pipeline performance, troubleshoot issues, and optimize for speed and reliability.
Requirements
- 3+ years of experience in data engineering or related field.
- Experience designing and maintaining ETL workflows and data pipelines.
- Excellent problem‑solving skills and ability to work collaboratively in a research environment.
Skills
pythonsqlawsapache spark