
Researcher @reflectionai PhD @neulab & @deep-spin Previously @Google @Unbabel @microsoft
AI is analyzing your overall score…
Identifying your key strengths…
Evaluating your skill match against the job requirements…
Assessing your cultural and operational fit
CMU & IST
Data Scientist
June 27, 2026 – Present
croissant-llm-training
February 1, 2024 – February 4, 2024
Repository containing the code for training the CroissantLLM
View Projectlti-llm-deployment
October 19, 2022 – July 22, 2023
lti-llm-deployment — GitHub repository
View Projectqaware-decode
October 19, 2021 – June 7, 2022
A repository for experiments in quality-aware decoding
View Projectlearning-scaffold
February 15, 2021 – May 19, 2022
This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"
View Projectfairseq-dro-mnmt
November 20, 2020 – September 10, 2021
fairseq-dro-mnmt — GitHub repository
View Projectcontextual-mt
September 2, 2020 – September 22, 2025
A repository with the code related to experiments around context-aware machine translation
View Projectstructured-neural-summarization
March 12, 2019 – May 9, 2019
A repository with the code for the paper with the same title
View ProjectOpenGNN
January 27, 2019 – May 9, 2019
Open source machine learning for graph-structured data
View Projectacs-category-theory-notes-2017
October 5, 2017 – January 18, 2018
Cambridge ACS Category Theory, Type Theory, and Logic - lecture notes 2017.
View ProjectCultural Fit Analysis
The candidate's projects are primarily academic/research-focused and personal, indicating a strong drive for independent learning and contribution to open-source research. The current role as 'Data Scientist' at 'CMU & IST' aligns well with a research-heavy Data Scientist position. However, the lack of diverse project types (e.g., industry-specific applications, team projects beyond research collaborations) might suggest a need to evaluate adaptability to different organizational cultures and project delivery methodologies. The breadth of skills is strong within the ML/NLP domain but lacks explicit mention of broader data engineering, MLOps, or business intelligence tools often required in industry Data Scientist roles.
Soft Skills & Operational Fit
Insufficient data to assess soft skills and operational fit. The psychometric test score is 0, indicating no assessment was completed.