onsite
Generative AI Analyst - Welo Data
AI Engineer
Detail‑oriented analyst focused on building high‑quality multilingual and multimodal datasets for generative AI models, leveraging Python, NLP techniques, and rigorous data quality practices.
About the role
Key Responsibilities
- Annotate, evaluate, and perform quality reviews on multilingual and multimodal datasets used to train generative AI systems.
- Collaborate with data scientists and engineers to refine annotation guidelines and improve dataset consistency.
- Apply NLP techniques to preprocess and analyze text and image data, ensuring alignment with model requirements.
- Track and report on annotation metrics, identifying areas for process improvement and automation.
- Support the development of new annotation tools and workflows to increase efficiency and accuracy.
Requirements
- Proficiency in Python and experience with NLP libraries (e.g., spaCy, NLTK, Hugging Face).
- Strong understanding of data quality principles and experience in data annotation for AI projects.
- Experience working with multilingual datasets and multimodal data (text, image, audio).
- Excellent analytical and problem‑solving skills, with a keen eye for detail.
- Ability to communicate findings clearly to cross‑functional teams.
Skills
pythonnlpmachine learninggenerative ai