remote
Data Workflow Engineer - AI & Automation - Digital Realty
Software Engineer
Build and maintain scalable data pipelines that automate occupancy and lease analytics, improve data quality, and enable AI‑driven insights using Python, SQL, Airflow, Snowflake, and AWS services.
About the role
Key Responsibilities
- Design, develop, and operate end‑to‑end data pipelines that ingest, cleanse, and transform lease and occupancy data from multiple sources.
- Implement workflow orchestration using Apache Airflow to schedule, monitor, and recover data jobs.
- Leverage Snowflake and AWS data services to store, query, and serve high‑volume datasets for downstream analytics.
- Automate data quality checks and implement alerting to ensure reliable, trustworthy data for business users.
- Collaborate with data scientists and analysts to embed AI/ML models into production pipelines, enhancing predictive analytics capabilities.
Requirements
- 3+ years of experience building data pipelines with Python and SQL in cloud environments.
- Hands‑on expertise with Apache Airflow (or similar orchestration tools) and Snowflake or comparable data warehouses.
- Proficiency in AWS services such as S3, Lambda, and Glue for data storage and processing.
- Strong understanding of ETL best practices, data modeling, and data quality frameworks.
- Experience integrating machine‑learning models or AI components into production data workflows is a plus.
Skills
pythonsqlsnowflakeaws