Shivam Gautam

Software Development Engineer

https://www.opentalent.in/shivam-gautam-7993219

Software Development Engineer with 3+ years in GPU & ML Inference Infrastructure

Fujitsu Research

Key Strengths

Exceptional expertise in GPU programming (CUDA) and performance optimization for ML inference, demonstrated by significant speedups and contributions to projects like llama.cpp and OpenBLAS.
Deep understanding of low-level system architecture, including NUMA, multithreading, memory access patterns, and cache optimization.
Strong background in compiler design, static analysis, and distributed systems, indicating a robust theoretical foundation combined with practical implementation skills.
Proven ability to diagnose and resolve complex performance bottlenecks in highly optimized systems, leading to substantial throughput improvements.
Experience with various hardware architectures (NVIDIA GPUs, ARM SVE, Graviton) and their specific optimization challenges.

Cultural & Operational Fit

Cultural Fit Analysis

The candidate's academic background from IIT Bombay and their current role at Fujitsu Research, combined with contributions to open-source projects (llama.cpp, OpenBLAS) and publications, suggest a strong alignment with a research-oriented, high-performance engineering culture. Their diverse project portfolio, ranging from static analysis to distributed systems and GPU acceleration, indicates adaptability and a broad technical curiosity. The 'Employee of the Quarter' award further highlights their commitment and impact within a professional setting.

Soft Skills & Operational Fit

The candidate demonstrates strong problem-solving skills, evidenced by their ability to diagnose and resolve complex performance issues (e.g., throughput cliff in llama.cpp, OpenBLAS oversubscription). Their contributions to open-source projects and publications suggest a collaborative mindset and a drive for continuous learning and sharing knowledge. The detailed descriptions of their work indicate a methodical approach to engineering and a focus on measurable impact.

AI is analyzing your overall score…

Identifying your key strengths…

Evaluating your skill match against the job requirements…

Assessing your cultural and operational fit

Projects

Low-Level Static Analysis Engine for C++

January 1, 2021 – January 1, 2023

Built an LLVM-IR static analysis engine for C++ — custom alias-analysis passes feeding SAT/SMT constraints into a bounded model checker, with full exception-handling (invoke/landingpad/resume) encoding; cut solve time 13% and verified 6/10 cases where CBMC crashed on all.

View Project

Cassandra-Inspired Distributed KV Store

January 1, 2021 – January 1, 2023

Built a leaderless 6-node KV store — CHORD ring with finger tables for O(log n) gRPC routing, gossip protocol for decentralised membership and failure detection; LSM-tree write path with locked memtable, async SSTable flush, and background compaction; node addition/removal with automatic key rebalancing and cache with fine-grained locking.

PageRank Acceleration

January 1, 2021 – January 1, 2023

Implemented GPU-accelerated PageRank (power iteration) in CUDA — CSR graph representation for coalesced warp memory access, shared memory reduction for convergence checks, and pointer-swap double buffering to eliminate data-race conditions; achieved ~50× speedup over CPU baseline on 1M-node graphs via full SM occupancy on T4.

Key Strengths

Exceptional expertise in GPU programming (CUDA) and performance optimization for ML inference, demonstrated by significant speedups and contributions to projects like llama.cpp and OpenBLAS.
Deep understanding of low-level system architecture, including NUMA, multithreading, memory access patterns, and cache optimization.
Strong background in compiler design, static analysis, and distributed systems, indicating a robust theoretical foundation combined with practical implementation skills.
Proven ability to diagnose and resolve complex performance bottlenecks in highly optimized systems, leading to substantial throughput improvements.
Experience with various hardware architectures (NVIDIA GPUs, ARM SVE, Graviton) and their specific optimization challenges.

Cultural & Operational Fit

Cultural Fit Analysis

Soft Skills & Operational Fit

Shivam Gautam

Key Strengths

Cultural & Operational Fit

About

Top Skills

Skills

Education

Experience

Projects

Certifications

Key Strengths

Cultural & Operational Fit