CHB

Chethan H B

Data Engineer

https://www.opentalent.in/chethan-h-b-6437639

Data Engineer with 1+ years in Python, SQL & ETL/ELT pipelines.

Deloitte

Key Strengths

Strong foundational understanding of ETL/ELT pipelines, data modeling, and data ingestion, as evidenced by multiple personal projects.
Proficiency in Python and SQL for data manipulation, cleaning, and transformation.
Experience with modern data tools like dbt for data quality and transformation, and AWS S3 for data loading.
Demonstrated ability to design and implement star schema data models for analytical purposes.
Practical experience with data visualization tools like Power BI and Tableau, and libraries like Matplotlib and Pandas.
Familiarity with API integration for data extraction and version control using Git/GitHub.

Cultural & Operational Fit

Cultural Fit Analysis

The candidate demonstrates a strong cultural fit for a data-driven organization, showing initiative through diverse personal projects covering various aspects of data engineering and analytics. The focus on building robust data platforms and contributing to business insights aligns well with a growth-oriented, impact-driven culture. The breadth of skills and tools used across projects indicates adaptability and a willingness to learn new technologies.

Soft Skills & Operational Fit

The candidate's resume highlights a 'collaborative, ownership-driven, and passionate' attitude, which suggests a good operational fit for team environments. The project descriptions indicate an ability to work independently on end-to-end solutions and a focus on data integrity and documentation, aligning with best practices in data engineering.

AI is analyzing your overall score…

Identifying your key strengths…

Evaluating your skill match against the job requirements…

Assessing your cultural and operational fit

Projects

End-to-End ELT Data Pipeline (Portfolio Project)

January 1, 2026 – June 1, 2026

Designed and built a data ingestion pipeline that extracts data from a public REST API, loads raw JSON into AWS S3, and transforms it into structured Silver/Gold layer tables using dbt models. Implemented data quality checks using dbt tests (not_null, unique, accepted_values) to validate pipeline outputs at each transformation stage. Modeled data using star schema principles with clearly defined fact and dimension tables for downstream analytics consumption. Versioned all pipeline code and dbt models using Git; documented transformation logic and lineage using dbt docs.

Uber Rides Data Ingestion & Transformation Pipeline

January 1, 2026 – June 1, 2026

Built an automated data ingestion script that loaded raw Uber ride CSVs into a PostgreSQL database, simulating a marketplace data source pipeline. Wrote modular Python functions for data cleaning, datetime feature engineering, and deduplication — reusable across multiple pipeline runs. Authored SQL transformation queries to aggregate ride demand by time, purpose, and distance — equivalent to Silver/Gold layer logic in a modern ELT stack. Validated pipeline outputs with row count checks and null-value assertions to ensure data integrity end-to-end.

Olympics Analytics Dashboard (1896–2024)

January 1, 2026 – June 1, 2026

Designed a Power BI dashboard fed by a Python-based data preparation pipeline that standardized, cleaned, and merged historical Olympics datasets spanning 128 years. Implemented data model relationships in Power BI (star schema) connecting athlete, country, event, and medal fact tables for multi-dimensional analysis. Enabled real-time interactivity with slicers for country, region, gender, and year — allowing users to explore complete country performance trends across all Olympic games.

Customer Segmentation Data Model

January 1, 2026 – June 1, 2026

Designed and implemented a data model to structure retail transaction data for customer analytics, applying PCA for dimensionality reduction prior to clustering. Used K-Means and Hierarchical Clustering to segment customers into distinct groups based on purchasing behaviour; stored results back to a PostgreSQL table for downstream use. Built a clean data transformation pipeline using Pandas for feature engineering before loading modeled outputs to the database.

Certifications

Database Management & SQL

Udemy

June 1, 2026 – Present

GenAl for Everyone

Coursera

June 1, 2026 – Present

Artificial Intelligence Fundamentals

IBM Skills Build

June 1, 2026 – Present

Key Strengths

Strong foundational understanding of ETL/ELT pipelines, data modeling, and data ingestion, as evidenced by multiple personal projects.
Proficiency in Python and SQL for data manipulation, cleaning, and transformation.
Experience with modern data tools like dbt for data quality and transformation, and AWS S3 for data loading.
Demonstrated ability to design and implement star schema data models for analytical purposes.
Practical experience with data visualization tools like Power BI and Tableau, and libraries like Matplotlib and Pandas.
Familiarity with API integration for data extraction and version control using Git/GitHub.

Cultural & Operational Fit

Cultural Fit Analysis

Soft Skills & Operational Fit

Chethan H B

Key Strengths

Cultural & Operational Fit

About

Top Skills

Skills

Education

Experience

Projects

Certifications

Key Strengths

Cultural & Operational Fit