onsite
Data Engineer - Johannes Kepler Universitat
Data Engineer
Build and maintain scalable data pipelines using Python, SQL, and AWS services to support analytics and machine learning initiatives.
About the role
Key Responsibilities
- Design, develop, and optimize ETL workflows to ingest, transform, and load large datasets from diverse sources.
- Implement data models and schemas in relational and NoSQL databases, ensuring data integrity and performance.
- Collaborate with data scientists and analysts to provide clean, reliable data for modeling and reporting.
- Monitor pipeline health, troubleshoot issues, and implement automated alerts and logging.
- Document data lineage, metadata, and best practices for data governance.
Requirements
- Proficiency in Python and SQL for data manipulation and automation.
- Experience with AWS services such as S3, Redshift, Glue, and Lambda.
- Strong understanding of data modeling, normalization, and indexing.
- Familiarity with CI/CD pipelines and version control (Git).
- Excellent problem‑solving skills and ability to work in a collaborative environment.