remote
Intermediate Data Engineer - AB InBev Growth Group
Data Engineer
Build and maintain scalable data pipelines for personalization, leveraging Python, Spark, and AWS services to deliver high‑quality data for machine‑learning models.
About the role
Key Responsibilities
- Design, develop, and optimize data pipelines that ingest, transform, and store large volumes of structured and unstructured data.
- Implement Spark jobs in Python to process data at scale, ensuring performance and reliability.
- Collaborate with data scientists to provide clean, well‑documented datasets for machine‑learning experiments.
- Maintain and monitor data workflows using Airflow, ensuring timely execution and alerting.
- Utilize AWS services (S3, Redshift, Glue) to build and manage data infrastructure.
- Write efficient SQL queries for data exploration and reporting.
Requirements
- 3+ years of experience as a data engineer or similar role.
Skills
pythonapache sparksqlawsairflow