remote
Senior Data Scientist - Geo Owl LLC
Data Scientist
Senior Data Scientist leading data architecture and pipeline development for geospatial AI accreditation, building robust ETL workflows, managing cross‑domain databases, and ensuring secure, scalable data delivery using Python, SQL, AWS, and Airflow.
About the role
Key Responsibilities
- Design and implement end‑to‑end data pipelines that ingest, standardize, and transform large volumes of satellite imagery for AI model accreditation.
- Develop and maintain database schemas and data warehouses across multiple security domains, ensuring data integrity and compliance.
- Build, schedule, and monitor ETL workflows using Apache Airflow, optimizing performance and reliability.
- Containerize data processing components with Docker and orchestrate deployments on AWS services (S3, EC2, RDS, Lambda).
- Collaborate with geospatial analysts and AI engineers to translate domain requirements into scalable data solutions.
Requirements
- 5+ years of experience in data engineering or data science, with a focus on geospatial data pipelines.
- Proficiency in Python for data manipulation, and strong SQL skills for schema design and query optimization.
- Hands‑on experience with AWS cloud services and containerization (Docker).
- Expertise in workflow orchestration tools such as Apache Airflow.
- Solid understanding of geospatial analytics and the ability to work with imagery datasets in secure environments.