remote
AI Data Training and Quality Assurance Engineer - digital india private
QA Engineer
Join a large‑scale AI data training and quality assurance effort, applying Python, Go, Rust, and C++ to build and validate datasets for supervised fine‑tuning and reinforcement learning from human feedback.
About the role
Key Responsibilities
- Design, implement, and maintain data pipelines and annotation tools using Python, Go, Rust, or C++.
- Collaborate with subject‑matter experts to create high‑quality training datasets for supervised fine‑tuning (SFT) and RLHF models.
- Perform rigorous quality assurance checks, error analysis, and data validation to ensure dataset integrity.
- Develop automated testing frameworks to evaluate data consistency and model performance.
- Document processes, guidelines, and best practices for data collection, labeling, and QA.
Requirements
- Strong programming skills in at least two of the following: Python, Go, Rust, C++.
- Experience with data annotation, labeling workflows, or AI model training pipelines.
- Understanding of supervised fine‑tuning and reinforcement learning from human feedback concepts.
- Excellent analytical abilities and attention to detail for QA tasks.
- Effective communication skills to work with cross‑functional teams of engineers, linguists, and domain experts.