onsite
Data Scientist
Data Scientist
The Data Scientist will focus on developing and deploying machine learning solutions within an enterprise setting, leveraging Python, deep learning techniques, and Azure services. Key responsibilities include GenAI document extraction, data parsing, and robust Python engineering with CI/CD practices.
About the role
About the Role
Tata Technologies is seeking a skilled Data Scientist with a focus on Machine Learning and Data Science in enterprise settings. The ideal candidate will have experience in developing and deploying robust data solutions.
Responsibilities
- Perform Python engineering with strong development, maintenance/debugging, unit/integration testing, and CI/CD practices.
- Work on GenAI document extraction, including prompting and evaluation.
- Handle PDF/document parsing and text matching/validation.
- Utilize Azure OpenAI Services, VLM/OCR/layout models, ReqIF/XML handling, and DNG/DOORS import workflows.
- Apply basic DevOps principles, including containers, logging/monitoring.
Requirements
- Experience: 3+ years in ML/data science in enterprise settings.
- Deep Learning: Proficiency in supervised/unsupervised learning, Segmentation, anomaly detection, model evaluation, feature engineering, and Pytorch.
- Programming: Expert-level proficiency in Python.
- Data: Familiarity with ETL/ELT, and handling structured/unstructured data.
- Tools: Experience with Git, VSCode, MLflow, Docker, Azure ML Studio, Azure DevOps.
- Domain: Manufacturing/Quality inspection department experience is preferred but not mandatory.
Nice to Have Skills
- Docker
- Kubernetes
- ReactJS
- Frontend development
Skills
MLData scienceDLsupervised learningunsupervised learningSegmentationAnomaly DetectionModel EvaluationFeature EngineeringPyTorchPythonEtlELTGitVSCodeMlflowDockerAzure Ml StudioAzure DevopsGenaipromptingEvaluation