remote
Senior Data Engineer - Litera
Data Engineer
Lead the design, build, and operation of scalable analytics pipelines using Python, SQL, Spark, and AWS, driving data-driven insights for legal technology solutions.
About the role
Key Responsibilities
- Architect and develop robust data pipelines that ingest, transform, and store large volumes of legal data across cloud platforms.
- Leverage Python, SQL, and Apache Spark to build scalable ETL workflows and optimize performance for real‑time analytics.
- Collaborate with data scientists and product teams to translate business requirements into data models and dashboards.
- Implement data quality, governance, and security best practices across the data ecosystem.
- Monitor, troubleshoot, and continuously improve pipeline reliability and cost efficiency on AWS.
Requirements
- 5+ years of experience as a data engineer in a fast‑paced, cloud‑native environment.
- Proficiency in Python, SQL, and Spark for large‑scale data processing.
- Hands‑on experience with AWS services (S3, Redshift, Glue, EMR, Athena).
- Strong understanding of data modeling, schema design, and ETL best practices.
- Excellent problem‑solving skills and a collaborative mindset.
Skills
pythonsqlapache sparkaws