remote
Senior Data Engineer Modern Data Platform & AI - aroundhome
Data Engineer
Lead the design and implementation of a modern data platform, integrating large-scale data pipelines, lakehouse architecture, and AI capabilities using Python, Spark, and AWS services.
About the role
Key Responsibilities
- Architect, build, and maintain scalable data pipelines and lakehouse solutions on AWS, ensuring high availability and performance.
- Collaborate with data scientists to deploy ML models into production, integrating model monitoring and retraining workflows.
- Implement data governance, security, and compliance controls across the data platform.
- Optimize query performance and resource utilization for large datasets using SQL and Spark.
- Mentor junior engineers and drive best practices in data engineering and DevOps.
Requirements
- 5+ years of experience in data engineering, with a strong background in Python, SQL, and Spark.
- Hands‑on experience with AWS services (S3, Redshift, Glue, Athena, EMR).
- Proven track record of building lakehouse or data lake architectures.
- Experience with ML Ops tools (MLflow, SageMaker, or similar) is a plus.
- Strong analytical, problem‑solving, and communication skills.
Skills
pythonsqlapache sparkaws