Gautham Vasan

Member of technical staff

https://www.opentalent.in/gautham-vasan

Research Scientist | Understanding Intelligence

Key Strengths

Extensive academic and research background in Reinforcement Learning, Deep Learning, and Robotics, evidenced by multiple projects and a PhD candidacy.
Practical experience applying AI/ML techniques to real-world robotics domains at Kindred.ai and Sanctuary AI.
Strong foundation in control systems and autonomous navigation, demonstrated by projects like 'Model Predictive Control for Micro Aerial Vehicle (MAV) Systems' and 'Autonomous Path Finder'.
Experience with complex problem-solving in areas like medical image segmentation ('Multiple Sclerosis Lesion Segmentation in MRI Images') and fault-tolerant systems ('Efficient Fault Tolerant Routing Algorithm for New Hybrid Topology for regular NoC').

AI is analyzing your overall score…

Identifying your key strengths…

Evaluating your skill match against the job requirements…

Assessing your cultural and operational fit

About

I’m interested in building machines with animal-like intelligence. To this end, I aim to understand computational principles that could enable agents such as robots to continually learn, adapt, develop, and improve throughout their lives. I have worked extensively on online reinforcement learning, imitation learning, and sim-to-real approaches for robotic manipulation.

Top Skills

Market ResearchAlgorithmsResearchMicrosoft OfficePowerPointAnalyticsMicrosoft ExcelBusiness StrategyImage ProcessingMachine LearningEmbedded SystemsDigital Signal ProcessingMarketing StrategyTeam ManagementData AnalysisSocial MediaRobotics

Education

University of Alberta

Doctor of Philosophy - PhD, Statistical Machine Learning

January 1, 2020 – January 1, 2025

University of Alberta

Master’s Degree, Computing Science

January 1, 2015 – January 1, 2017

National Institute of Technology, Tiruchirappalli

Bachelor of Technology (B.Tech.), Instrumentation and Control Engineering

January 1, 2011 – January 1, 2015

Experience

Zyphra

Member of Technical Staff

February 1, 2026 – Present

San Francisco Bay Area · On-site

Sanctuary AI

Research Intern: Reinforcement/Imitation Learning

May 1, 2025 – September 1, 2025

Vancouver, British Columbia, Canada · On-site

University of Freiburg

Visiting Researcher | DAAD Scholar

March 1, 2023 – June 1, 2023

Freiburg, Baden-Württemberg, Germany · On-site

University of Alberta

Doctoral Student

September 1, 2020 – Present

Edmonton, Alberta, Canada

Kindred.ai

Machine Learning Researcher

June 1, 2019 – August 1, 2020

Greater Toronto Area, Canada

Kindred.ai

Artificial Intelligence (AI) Engineer

September 1, 2017 – May 1, 2019

Greater Toronto Area, Canada

University of Alberta

Graduate Research Assistant

May 1, 2016 – August 1, 2017

University of Alberta

Masters Candidate

September 1, 2015 – August 1, 2017

University of Alberta

Graduate Teaching Assistant

September 1, 2015 – April 1, 2016

IIIT Hyderabad

Research Intern

May 1, 2014 – July 1, 2014

Hyderabad Area, India

Festember

Head of Treasury

April 1, 2014 – May 1, 2015

India

Pragyan Guest Lectures

Manager

October 1, 2013 – April 1, 2014

India

Spider (R&D Club)

Researcher

May 1, 2013 – March 1, 2015

Trichy

Connect NIT

Web Developer

July 1, 2012 – April 1, 2013

Trichy

NITTFEST'13

Coordinator

July 1, 2012 – April 1, 2013

Trichy

Pragyan Crossfire

Coordinator

July 1, 2012 – May 1, 2013

Trichy

NIT Trichy

Deputy Manager, Festember Marketing & Media Relations

April 1, 2012 – April 1, 2014

Trichy, Tamil Nadu

Projects

Learning from Demonstration: Teaching a Myoelectric Prosthesis using an intact Limb

May 1, 2016 – Present

Prosthetic arms should restore and extend the capabilities of someone with an amputation. They should move naturally and be able to perform elegant, coordinated movements like a biological arm. However control of modern day prostheses is non-intuitive and taxing; it does not give rise to graceful, synergistic movements. Using actor-critic reinforcement learning, we present a method that allows someone with an amputation to use their non-amputated arm to teach their prosthetic arm how to move through a wide range of coordinated motions and grasp patterns. We evaluate our method during the myo-electric control of a multi-joint robot arm and demonstrate that the robot arm is able to learn a control policy for a particular sequence of gestures and movements involving 3 Degrees of Freedom (DoFs).

Deep Reinforcement Learning for Hex

March 1, 2016 – May 1, 2016

DeepMind’s recent spectacular success in using deep convolutional neural nets and machine learning to build superhuman level agents — e.g. for Atari games via deep Q-learning and for the game of Go via Reinforcement Learning — raises many questions, including to what extent these methods will succeed in other domains. In this paper we consider DQL for the game of Hex: after supervised initializing, we use selfplay to train NeuroHex, an 11-layer CNN that plays Hex on the 13 × 13 board. Hex is the classic two-player alternate-turn stone placement game played on a rhombus of hexagonal cells in which the winner is whomever connects their two opposing sides. Despite the large action and state space, our system trains a Q-network capable of strong lay with no search. After two weeks of Q-learning, NeuroHex achieves win-rates of 20.4% as first layer and 2.1% as second player against a 1-second/move version of MoHex, the current ICGA olympiad Hex champion. Our data suggests further improvement might be possible with more training time.

Robot Control with Policy Gradient Actor-Critic Methods

January 1, 2016 – April 1, 2016

Reinforcement learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in reinforcement learning. The aim of this independent study is to gain understanding of the current state-of-the-art and historical background of reinforcement learning (RL) research in policy gradient methods and actor-critic algorithms. In this article, we attempt to understand the different algorithms used to incorporate the above topics into deployed (real-world) reinforcement learning systems. We highlight both key challenges in robot reinforcement learning as well as notable successes. We discuss how contributions from Actor-Critic algorithms, Policy gradient methods, etc., handle the complexity of the domain and study the role of algorithms, representations, and prior knowledge in achieving these successes. The main objective for the experiments in this independent study is to deploy Prediction Learning methods coupled with Natural Actor Critic algorithms in real-world robots.

Multiple Sclerosis Lesion Segmentation in MRI Images

October 1, 2015 – December 1, 2015

Magnetic Resonance Imaging (MRI) is commonly used to detect Multiple Sclerosis Lesions in the Central Nervous System. The lesions are anatomically variable, thus, accurate segmentation of MS lesions is challenging. Since manual segmentation requires domain specific knowledge and is expensive both in terms of time and cost, we aim to automatically segment lesions. Our focus is on comparing classification techniques using a wide range of features in order to find the most successful method. We also attempt to combine some techniques to improve detection success. Our results indicate that Markov Random Fields with Random Forest provides the best result but there are other options useful in segmenting lesions. Interestingly, Random Forest is better in terms of computational complexity than other algorithms, especially when compared to SVM. This makes it the most practical in terms of both accuracy and speed.

Model Predictive Control for Micro Aerial Vehicle (MAV) Systems

May 1, 2014 – Present

I'm responsible for the development of an algorithm for trajectory tracking for quad rotors. The modelling and simulation of the trajectory tracking algorithm for the Quad rotor was successfully implemented. The algorithm is being currently being tested on the Parrot Ar Drone using ROS and Python coupled with CVXOPT for solving the quadratic constraints.

A Control Strategy for an Autonomous Robotic Vacuum Cleaner for Solar Panels

December 1, 2013 – January 1, 2014

I was primarily responsible for the Embedded Coding and design aspects of the robot. Abstract: Accumulation of dust on the surface of solar panels reduces the amount of radiation reaching it. This leads to loss in generated electric power and formation of hotspots which would permanently damage the solar panel. This project aims at developing an autonomous vacuum cleaning method which can be used on a regular basis to maximize the lifetime and efficiency of a solar panel. This system is implemented using two subsystems namely a Robotic Vacuum Cleaner and a Docking Station. The Robotic Vacuum Cleaner uses a two stage cleaning process to remove the dust from the solar panel. It is designed to work on inclined and slippery surfaces. A control strategy is formulated to navigate the robot in the required path using an appropriate feedback mechanism. The battery voltage of the robot is determined periodically and if it goes below a threshold, it returns to the docking station and charges itself automatically using power drawn from the solar panels. The operation of the robotic vacuum cleaner has been verified and relevant results are presented. The DC Charging circuit in the docking station is simulated in Proteus environment and is implemented in hardware. An economical, robust Robotic Vacuum Cleaner which can clean arrays of Solar panels (with or without inclination) interlinked by rails and recharge itself automatically at a docking station is designed and implemented.

Shadow Arm

July 1, 2013 – August 1, 2013

A Kinect based robotic arm that seamlessly replicates the motion of the human arm (Up to the wrist) using TI Stellaris LM4F210, Servo-Motors & OpenCV.

Efficient Fault Tolerant Routing Algorithm for New Hybrid Topology for regular NoC

May 1, 2013 – June 1, 2013

Implemented Network on Chip (NoC) concepts using Verilog HDL & developed an efficient fault tolerant algorithm for making the system robust & efficient. It was under the guidance of Prof. Dr.G. Lakshmi Narayanan at ECE Dept, NIT Trichy.

Autonomous Path Finder

February 1, 2013 – Present

Performs Colour & Shape Recognition in Matlab from live image feed & finds the shortest path required for autonomous navigation. It was controlled from the Laptop using UART communication.

In-Plant Training at TVS Sundaram Fasteners Ltd

December 1, 2012 – Present

During the training period, various practices of the organization of TVS Sundaram Fasteners Ltd was learnt. Also an insight into how sprockets, hubs & shafts are manufactured and the various tests it undergoes to be a commercially viable product was obtained.

Cricket, Hangman & Tic Tac Toe

January 1, 2011 – Present

The games were developed in C++ using graphic libraries. It uses unique user interfaces with different difficulty modes, load/save options etc.

Solar Tracker

September 1, 2010 – Present

It is a solar panel designed to orient itself toward the sun all day to make the system efficient & compact. Our method improves the efficiency of the system by 30%. It was submitted for CBSE Regional Science Exhibition

Key Strengths

Extensive academic and research background in Reinforcement Learning, Deep Learning, and Robotics, evidenced by multiple projects and a PhD candidacy.
Practical experience applying AI/ML techniques to real-world robotics domains at Kindred.ai and Sanctuary AI.
Strong foundation in control systems and autonomous navigation, demonstrated by projects like 'Model Predictive Control for Micro Aerial Vehicle (MAV) Systems' and 'Autonomous Path Finder'.
Experience with complex problem-solving in areas like medical image segmentation ('Multiple Sclerosis Lesion Segmentation in MRI Images') and fault-tolerant systems ('Efficient Fault Tolerant Routing Algorithm for New Hybrid Topology for regular NoC').

Cultural & Operational Fit

Cultural Fit Analysis

The candidate's extensive involvement in academic research, including a PhD, and roles at AI-focused companies like Kindred.ai and Sanctuary AI, indicates a strong alignment with a research-driven, innovative, and technically challenging environment. The diversity of personal projects, ranging from robotics to game development and image processing, shows a broad intellectual curiosity and a proactive approach to learning and application. The candidate's experience in various organizational roles for university festivals also suggests an ability to work in teams and manage complex initiatives, contributing to a well-rounded profile. However, the lack of explicit collaboration or team-oriented project descriptions in the technical projects makes it difficult to fully assess team collaboration style beyond organizational roles.

Soft Skills & Operational Fit

The candidate's project descriptions indicate a strong drive for research and problem-solving, often in collaborative academic settings. Roles like 'Head of Treasury' and 'Manager' for cultural/technical festivals suggest organizational and leadership capabilities. The detailed descriptions of research projects imply strong analytical and communication skills in a technical context. The focus on advanced AI/ML and robotics research suggests a fit for roles requiring innovation and deep technical contributions.

Gautham Vasan

Key Strengths

About

Top Skills

Skills

Education

Experience

Projects

Certifications

Key Strengths

Cultural & Operational Fit