remote
Staff Engineer - Data - GEICO
Software Engineer
Lead the design and implementation of scalable data platforms, building robust pipelines and advanced analytics solutions using Python, Spark, SQL, and AWS to drive data‑driven decision making.
About the role
Key Responsibilities
- Architect, develop, and maintain high‑performance data pipelines and lakehouse solutions supporting real‑time and batch processing.
- Collaborate with product, analytics, and engineering teams to translate business requirements into scalable data models and APIs.
- Implement and optimize data processing workloads using Apache Spark, Python, and SQL on AWS services such as EMR, Redshift, and S3.
- Lead the integration of machine‑learning models into production data flows, ensuring reliability and monitoring.
- Establish best practices for data quality, governance, and security across the organization.
Requirements
- 10+ years of experience in data engineering or related fields, with a proven track record of building large‑scale data platforms.
- Strong proficiency in Python, SQL, and Apache Spark, and hands‑on experience with AWS data services.
- Deep understanding of data modeling, ETL/ELT design, and performance tuning for big data workloads.
- Experience deploying and operationalizing machine‑learning models in production environments.
- Excellent problem‑solving skills and ability to mentor junior engineers while driving technical excellence.
Skills
pythonsqlapache sparkawsmachine learning