
Distributed Systems. Platform Engineering.
AI is analyzing your overall score…
Identifying your key strengths…
Evaluating your skill match against the job requirements…
Assessing your cultural and operational fit
SparkStreaming-Hudi-Pipeline
January 2, 2023 – January 2, 2023
Scala code example for using Spark Streaming to read data from a kafka topic, process and write to a Hudi table then read from Hudi table, perform a join and aggregation, and write the result to a Parquet file.
View ProjectAkkaStreamsProc
January 2, 2023 – January 2, 2023
An example of how we can use Akka streams to create a pipeline that consumes data from a Kafka topic, performs real-time analytics on the data, and writes the results to a data store
View Projectpipeline-scala
August 21, 2021 – August 21, 2021
A scala library for common data ingestion and processing tasks
View ProjectApacheHudiWriterSnippets
July 16, 2021 – July 16, 2021
Spark streaming application that reads from partitioned Kafka brokers and sink to S3 using apache hudi format
View ProjectPySparkKafkaConsumer
July 16, 2021 – July 16, 2021
A pySpark Code Snippet to query Kafka Concumer Offets Using Timestamp
View ProjectMaterialize_io_Streaming
September 19, 2020 – September 19, 2020
Scripts I have wrote for setting up and testing Materialize.io on the EC2 instances.
View ProjectLido_learning_assignment
January 7, 2020 – January 7, 2020
Lido_learning_assignment — GitHub repository
View ProjectHFresh_RecipeScore_Predictor
November 20, 2019 – September 19, 2020
A Supervised Tree Based ML Classifier on Food Recipe (Hello Fresh) - build it during 1-day hackathon and presented before panel.
View ProjectWebsite-Lead-Scoring
May 10, 2019 – May 12, 2019
A preliminary Inbound lead scoring analysis on a EdTech clickstream data - framework build on Scikit using Tree based models.
View ProjectCultural Fit Analysis
The candidate's projects demonstrate a strong inclination towards personal learning and exploration of various data engineering and machine learning technologies. The diversity of tools and frameworks used (Spark, Kafka, Hudi, Akka Streams, Materialize.io, Scikit-learn) suggests a proactive and curious mindset. However, all listed projects are personal, which limits insight into collaborative work, team dynamics, or experience within a structured organizational environment. This could indicate a preference for individual contribution or a lack of opportunity for team-based projects. The target role is 'Data Scientist', and while there are ML projects, a significant portion of the work is data engineering focused, which might require further validation of their core data science competencies in a team setting.
Soft Skills & Operational Fit
Insufficient data to assess soft skills or operational fit. No psychometric test results or interview feedback provided.