onsite
AI Automation Developer & Data Scientist - Tata Consultancy Services (TCS)
Data Scientist
Develop AI‑driven automation solutions and scalable data pipelines using Python, Node.js, and modern ML frameworks, while leveraging cloud services (AWS) and big‑data technologies such as Spark and Kafka.
About the role
Key Responsibilities
- Design, implement, and maintain AI automation workflows using LLM/ML frameworks like LangChain, TensorFlow, and PyTorch.
- Build and optimize scalable data pipelines with Spark, Hadoop, and Kafka to ingest, process, and store large‑volume datasets.
- Develop RESTful APIs and JSON interfaces for seamless integration with internal and external systems.
- Automate cloud compute resources across AWS, GCP, and Azure, including services such as Redshift, S3, Glue, and EMR.
- Integrate security platforms and support SOC detection engineering workflows.
Requirements
- Strong proficiency in Python and Node.js; experience with at least one additional language (e.g., Scala or Java) is a plus.
- Hands‑on experience with major ML libraries (TensorFlow, PyTorch, scikit‑learn) and LLM orchestration tools (LangChain, LangGraph).
- Proven ability to design and operate big‑data pipelines using Spark, Hadoop, and Kafka.
- Deep knowledge of cloud services (AWS, GCP, Azure) and data platform components such as Redshift, S3, Glue, and Dataflow.
- Familiarity with security platform integrations and SOC workflow automation.
Skills
pythonnodejstensorflowpytorchawskafkalangchain