remote
Senior Data Engineer - Crocs
Data Engineer
Lead the design, build, and scaling of next‑generation data pipelines and analytics platforms using Python, SQL, AWS, and Spark to empower data‑driven decisions across the organization.
About the role
Key Responsibilities
- Architect, develop, and maintain scalable data pipelines that ingest, transform, and store large volumes of structured and unstructured data.
- Collaborate with data scientists, analysts, and product teams to define data requirements and deliver high‑quality, reproducible datasets.
- Implement robust data quality, monitoring, and alerting frameworks to ensure reliability and performance of data services.
- Optimize existing ETL processes for speed, cost, and maintainability using AWS services (Glue, Redshift, S3) and Spark.
- Document data models, pipeline logic, and best practices for internal use and compliance.
Requirements
- 5+ years of experience as a data engineer or similar role in a fast‑paced environment.
- Proficiency in Python, SQL, and Apache Spark for data processing.
- Hands‑on experience with AWS data services (Glue, Redshift, S3, Athena).
- Strong understanding of data modeling, schema design, and ETL best practices.
- Excellent problem‑solving skills and ability to communicate complex technical concepts to non‑technical stakeholders.
Skills
pythonsqlawsapache spark