remote
Generative AI Annotator - Innodata
AI Engineer
Generative AI Annotator responsible for labeling and validating large language model datasets, ensuring high-quality training data for AI systems. Requires expertise in NLP, data annotation tools, Python scripting, and rigorous quality assurance practices.
About the role
Key Responsibilities
- Annotate and curate large-scale text, image, and multimodal datasets for generative AI models.
- Develop and maintain annotation guidelines, ensuring consistency and accuracy across projects.
- Collaborate with data scientists and ML engineers to refine labeling workflows and improve model performance.
- Perform quality assurance checks, flagging errors and providing feedback to improve annotation tools.
- Document annotation processes and contribute to best‑practice knowledge bases.
Requirements
- Strong background in NLP and experience with generative AI datasets.
- Proficiency in Python and familiarity with annotation platforms (e.g., Label Studio, Prodigy).
- Excellent attention to detail and ability to maintain high quality under tight deadlines.
- Effective communication skills and a collaborative mindset.
- Experience with version control (Git) and data versioning tools is a plus.
Skills
generative ainlpmachine learningpython