
Building datamasterylab.com
AI is analyzing your overall score…
Identifying your key strengths…
Evaluating your skill match against the job requirements…
Assessing your cultural and operational fit
data-mastery-lab
Backend Engineer
June 23, 2026 – Present
Kubernetes-For-DataEngineering
January 24, 2024 – January 26, 2024
This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering environment using Kubernetes and Apache Airflow
View Projectmodern-data-eng-dbt-databricks-azure
December 18, 2023 – December 18, 2023
In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our cloud provider.
View Projectrealtime-voting-data-engineering
December 6, 2023 – December 11, 2023
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
View ProjectFlinkCommerce
December 3, 2023 – December 4, 2023
This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessary infrastructure components, including Apache Flink, Elasticsearch, and Postgres
View Projectchangecapture-e2e
November 27, 2023 – May 17, 2024
This project shows how to capture changes from postgres database and stream them into kafka
View ProjectSparkingFlow
November 4, 2023 – March 14, 2024
This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python, Scala and Java as an example.
View ProjectRealtimeStreamingEngineering
October 28, 2023 – January 4, 2024
This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenAI LLM, Kafka and Elasticsearch. It covers each stage from data acquisition, processing, sentiment analysis with ChatGPT, production to kafka topic and connection to elasticsearch.
View ProjectRedditDataEngineering
October 23, 2023 – October 23, 2023
This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and services including Apache Airflow, Celery, PostgreSQL, Amazon S3, AWS Glue, Amazon Athena, and Amazon Redshift.
View ProjectFootballDataEngineering
October 2, 2023 – October 2, 2023
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
View Projecte2e-data-engineering
September 6, 2023 – February 14, 2025
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
View ProjectCultural Fit Analysis
The candidate's projects are exclusively personal data engineering projects, indicating a strong passion for the domain. However, the lack of diverse team projects or contributions to open source beyond personal repositories makes it difficult to assess collaboration and broader cultural fit. The current role as 'Backend Engineer' at 'data-mastery-lab' aligns with the target role, but details are missing.
Soft Skills & Operational Fit
Insufficient data to assess soft skills and operational fit. The candidate's projects demonstrate strong technical initiative and problem-solving in data engineering contexts.