onsite
Linguistic Engineer - Meta
Software Engineer
Linguistic Engineer building multilingual datasets, models, and data pipelines for a multimodal conversational AI assistant, leveraging Python, ML, NLP, speech tech, and knowledge graph expertise.
About the role
Key Responsibilities
- Design, curate, and maintain high‑quality multilingual datasets for ASR, NLU, NLG, and LLM components.
- Develop and optimize data pipelines and infrastructure to support continuous model training and evaluation.
- Collaborate with cross‑functional teams to ensure consistent linguistic representations across voice, vision, and reasoning modules.
- Implement and refine knowledge graph structures to enhance contextual understanding and retrieval.
- Analyze model performance, identify linguistic gaps, and propose data‑driven solutions.
Requirements
- Strong programming skills in Python and experience with ML frameworks (PyTorch, TensorFlow).
- Deep understanding of NLP techniques, including tokenization, embeddings, and transformer models.
- Experience with speech technologies such as ASR/TTS and related data pipelines.
- Familiarity with graph databases and knowledge graph construction.
- Excellent analytical skills and ability to work in a fast‑paced, collaborative environment.
Skills
pythonmachine learningnatural language processing