remote
Senior Data Engineer - Sovos
Data Engineer
Senior Data Engineer responsible for designing, building, and optimizing large‑scale data pipelines on cloud platforms, leveraging Python, Spark, and AWS services to deliver reliable, high‑performance data solutions.
About the role
Key Responsibilities
- Design, develop, and maintain scalable ETL pipelines using Python, Apache Spark, and SQL to ingest and transform high‑volume data from diverse sources.
- Architect and implement data models and data warehouses on AWS (Redshift, S3, Glue) to support analytics and reporting needs.
- Orchestrate workflow automation with Apache Airflow, ensuring reliable scheduling, monitoring, and error handling.
- Collaborate with data scientists, product managers, and business stakeholders to translate requirements into robust data solutions.
- Implement best practices for data quality, security, and performance optimization across the data platform.
Requirements
- 5+ years of professional experience in data engineering or related fields.
- Strong proficiency in Python and SQL, with hands‑on experience in Apache Spark or similar distributed processing frameworks.
- Deep knowledge of AWS services (e.g., Redshift, S3, Glue, Lambda) and cloud‑native data architecture.
- Experience with workflow orchestration tools such as Apache Airflow.
- Solid understanding of data modeling, ETL design patterns, and data warehousing concepts.
Skills
pythonsqlapache sparkawsairflow