remote
Senior Generative AI Engineer / Data Scientist - Capgemini
Data Scientist
Senior AI Engineer with deep financial services expertise, designing and scaling generative AI data pipelines, vector stores, and LLM‑driven applications for risk, fraud, compliance, and document intelligence.
About the role
Key Responsibilities
- Architect and implement end‑to‑end data pipelines that ingest, transform, and store large‑scale financial data for generative AI workloads.
- Design, deploy, and maintain vector databases and retrieval systems to support LLM‑based analytics and document intelligence.
- Integrate large language models (LLMs) using frameworks such as LangChain to build risk analytics, fraud detection, and regulatory compliance solutions.
- Collaborate with domain experts to translate banking, capital markets, and insurance use cases into production‑grade AI services.
- Ensure scalability, security, and performance on cloud platforms (AWS, Azure, GCP) using containerization (Docker) and orchestration (Kubernetes).
Requirements
- 5+ years of data engineering or data science experience, with a focus on generative AI in financial services.
- Proficiency in Python, PySpark, and SQL for building robust data pipelines.
- Hands‑on experience with cloud services (AWS, Azure, or GCP) and container orchestration (Docker, Kubernetes).
- Demonstrated ability to work with LLMs, LangChain, and vector search technologies.
- Strong understanding of financial domain concepts such as risk analytics, fraud detection, and regulatory compliance.
Skills
pythonsqlawsdockerkuberneteslangchain