remote
Data Engineer - AI/ML Pipelines Remote - Bright Sol
Data Engineer
Remote Data Engineer responsible for designing and implementing scalable AI/ML data pipelines, ETL/ELT processes, and cloud‑based data lakehouse solutions using Python, SQL, and AWS while supporting model deployment and data quality initiatives.
About the role
Key Responsibilities
- Design, develop, and maintain scalable data pipelines and AI integration workflows for machine‑learning projects.
- Build and optimize ETL/ELT processes to ingest, transform, and store data in cloud‑based lakehouse architectures.
- Implement data quality, observability, and CI/CD practices to ensure reliable, production‑grade data flows.
- Collaborate with data scientists and product teams to create feature stores and support model deployment pipelines (MLOps).
- Leverage AWS services (e.g., S3, Redshift, Glue) and apply Agile methodologies to deliver incremental improvements.
Requirements
- 5+ years of professional experience in data engineering, with strong Python and SQL proficiency.
- Hands‑on experience building ETL/ELT pipelines and working with cloud platforms (AWS preferred).
- Familiarity with data lakehouse concepts, MLOps practices, and CI/CD tooling.
- Demonstrated ability to ensure data quality, monitoring, and observability in production systems.
- Experience working in Agile teams and communicating effectively with cross‑functional stakeholders.