Shankar Kethavath

Data Engineer

https://www.opentalent.in/shankar-kethavath

Data Engineer with less than a year in Data Pipelines & Real-Time Processing

Key Strengths

Extensive hands-on experience in designing and deploying real-time data pipelines and ETL workflows, particularly on edge hardware (Raspberry Pi).
Proficient in a wide array of data engineering technologies including Apache Spark, PostgreSQL, Snowflake, Docker, and Apache Kafka.
Demonstrated ability to integrate advanced AI/ML components (computer vision, OCR, Generative AI) into data processing architectures.
Strong focus on data quality, performance optimization (SQL query optimization, sub-second latency), and automation (Apache Airflow-style scheduling).
Experience with cloud-native data warehousing and containerized deployments, indicating modern data infrastructure skills.

Cultural & Operational Fit

Cultural Fit Analysis

The candidate's projects demonstrate a strong alignment with the target role of Data Engineer, focusing on real-time processing, ETL, and data warehousing. The diversity of projects, from real-time telemetry to edge AI data ingestion, showcases a broad skill set and adaptability. The use of various modern technologies and integration of AI components indicates a forward-thinking and innovative mindset, which aligns well with dynamic technical environments. The Stanford certification in Machine Learning further emphasizes a commitment to continuous learning and advanced technical capabilities.

Soft Skills & Operational Fit

The candidate's resume highlights strong problem-solving skills through optimizing query performance and reducing data errors. Their experience with real-time monitoring dashboards and automated alerts suggests a proactive approach to operational health. The ability to work with resource-constrained environments (Raspberry Pi) indicates adaptability and resourcefulness. The project descriptions imply a collaborative approach by enabling cross-team analytics consumption.

AI is analyzing your overall score…

Identifying your key strengths…

Evaluating your skill match against the job requirements…

Assessing your cultural and operational fit

About

Data Engineer with hands on experience of designing and deploying scalable data pipelines, ETL workflows, and real-time data processing systems. Hands-on expertise in Python, PySpark, SQL, PostgreSQL, and Snowflake with proven ability to build edge-deployed, multiprocessing data ingestion architectures. Experienced with cloud-native data warehousing, containerized deployments using Docker, and Generative AI integration into data workflows. Adept at optimizing pipelines for high-throughput, low-latency performance on resource-constrained and distributed environments. Stanford-certified in Machine Learning.

Top Skills

GitScikit Learn

Projects

Real-Time Telemetry Data Pipeline – DriveGuard AI

February 1, 2025 – Present

Architected an end-to-end real-time data pipeline fusing GPS coordinates, IMU sensor streams, and computer vision detection outputs into a unified analytical data store — processing 10K+ events/day with 99%+ uptime. Executed IoU-based event deduplication and multi-frame aggregation logic to clean and normalize raw sensor outputs before loading into the PostgreSQL data warehouse. Architected the Snowflake schema for long-term historical data retention and enabled Power BI-compatible data exports for cross-team analytics consumption.

Edge AI Data Ingestion System – Raspberry Pi Deployment

December 1, 2024 – January 1, 2025

Built a distributed edge data ingestion architecture on Raspberry Pi with 5 parallel processing workers, each handling a dedicated data stream (helmet detection, lane analysis, ANPR, vehicle classification, counting). Engineered Kafka-compatible event queuing logic to buffer high-frequency data bursts and ensure zero data loss during peak traffic capture windows; containerized all modules with Docker for portable deployment.

Certifications

Machine Learning Specialization: Supervised learning, unsupervised learning, and advanced ML techniques.

Stanford University / DeepLearning.AI

June 1, 2026 – Present

Power BI Certification

TNX

January 1, 2024 – Present

Key Strengths

Extensive hands-on experience in designing and deploying real-time data pipelines and ETL workflows, particularly on edge hardware (Raspberry Pi).
Proficient in a wide array of data engineering technologies including Apache Spark, PostgreSQL, Snowflake, Docker, and Apache Kafka.
Demonstrated ability to integrate advanced AI/ML components (computer vision, OCR, Generative AI) into data processing architectures.
Strong focus on data quality, performance optimization (SQL query optimization, sub-second latency), and automation (Apache Airflow-style scheduling).
Experience with cloud-native data warehousing and containerized deployments, indicating modern data infrastructure skills.

Cultural & Operational Fit

Cultural Fit Analysis

Soft Skills & Operational Fit

Shankar Kethavath

Key Strengths

Cultural & Operational Fit

About

Top Skills

Skills

Education

Experience

Projects

Certifications

Key Strengths

Cultural & Operational Fit