onsite
AI/ML Data Engineer
Data Engineer
Design and operate scalable AI/ML data pipelines on AWS, integrating API management and distributed systems to deliver reliable data for machine learning models.
About the role
Key Responsibilities
- Architect, build, and maintain robust data pipelines on Amazon Web Services to support AI/ML workloads.
- Implement API management solutions for secure, high‑throughput data ingestion and exposure.
- Design distributed system components that ensure low latency and high availability of data streams.
- Collaborate with data scientists and ML engineers to provide clean, well‑documented datasets for model training and inference.
- Monitor pipeline performance, troubleshoot failures, and continuously optimize for cost and scalability.
Requirements
- Strong experience with AWS services such as S3, Lambda, Glue, Kinesis, and Redshift.
- Proficiency in building data pipelines using Python and SQL.
- Solid understanding of distributed system principles and API management frameworks.
- Hands‑on experience with AI/ML data workflows and model‑ready data preparation.
- Ability to obtain and maintain a TS/SCI clearance with polygraph.