onsite
Experienced - Data Scientist GenAI
Data Scientist
Lead data science initiatives focused on Generative AI, leveraging Python, Apache Spark, and Docker to build scalable GPT-based models and pipelines.
About the role
Key Responsibilities
- Design, develop, and deploy large-scale generative AI models using GPT architectures.
- Engineer data pipelines in Apache Spark to preprocess and transform massive datasets.
- Containerize models and services with Docker for reproducible, scalable deployments.
- Collaborate with cross‑functional teams to integrate AI solutions into production workflows.
- Conduct performance tuning, monitoring, and continuous improvement of AI models.
Requirements
- Proven experience with Python and machine learning frameworks.
- Strong background in Apache Spark and distributed data processing.
- Hands‑on expertise in Docker and container orchestration.
- Deep understanding of generative AI, GPT models, and related NLP techniques.
- Excellent problem‑solving skills and ability to translate business needs into technical solutions.
Skills
pythonapache sparkdockergenerative aimachine learning