Projects and Work Samples

S.No Project Name Primary Tech/Skill Used Purpose Type
1 ๐Ÿ”  Machine Translation Tensorflow Seq2Seq Model architectures for Language Translation on Right-Left typed Languages Individual Work
2 ๐Ÿ’Š FinHealth Spark Distributed analysis to determine financial factors affecting the health of an individual Group Work
3 ๐Ÿจ Vanilla Network Python Math behind deep neural network learning Individual Work
4 โ™ป๏ธ Machine Unlearning / Forgetting Torch Exploring solutions to make models forget training data partially Individual Work
5 ๐Ÿ“ท Convolute Tensorflow Training several CNN architectures from scratch Individual Work
6 ๐Ÿ“ฑ SocioConnect Flask Prototype website demoing a new social media platform Group Work
7 ๐Ÿš— Auto-Insurance Risk Analysis Scikit-learn Using ML to predict whether a car is associated with any risks based on the carโ€™s attributes Group Work
8 ๐Ÿ  St.Petersburg Property Value Analysis Geopandas Property value analysis for the city of St. Pete Group Work
9 ๐Ÿ›ก๏ธ Insurance Management : System Design UML, LucidChart UML based design of an Insurance Management System Group Work
10 ๐Ÿ”Ž Analysis of Cyber attack against T-Mobile Info-Security analysis Report on the 2021 T-Mobile data breach and steps to protect against such attacks Individual Work

๐Ÿ”  Machine Translation

TensorFlowKerasPython

In this assignment I explore different strategies used in building and training a Language Translator. I use Seq2Seq learning to convert sequences from English to Hebrew(Right to left typed/ DextroSinistral). I also employ techniques such as bi-directional learning and Attention mechanism which serve as the building blocks of advanced transformer-based NLP models such as GPT, Llama etc.

Find the project on: Github

๐Ÿ’Š FinHealth

Apache Spark Python

The primary goal of this project is to identify how financial factors (and the emotional elements that come with it) affect the health of an individual. The analysis is based on data from the National Financial Well-being Survey, which also includes a section asking participants to rate their perception of their own health. The output of such an analysis would help companies operating in the FinTech space optimize their products to improvise customer experience. Though this analysis could have been completed with simple Python libraries, one of the goals associated is to demonstrate distributed storage and processing technologies - so Spark is being used to accomodate scaling in the future.

The final vision for this project is to serve as a plug and play component within FinTech products, delivering personalized experiences for each user.

Find the project on: Github YouTube

๐Ÿจ Vanilla Network

Python

I implement gradient descent and backpropagation mathematically from scratch, and implement it in Python code. These form as the building blocks of the abstract neural network layers, using which I explore how models learn patterns within data. I also test the effects of varying several structural and algorithmic hyperparameters on the learning process.

Find the project on: Github

โ™ป๏ธ Machine Unlearning

PyTorch

The field of Unlearning specifically deals with removing the influence of data points from a model without having to retrain it from scratch โ€“ in other words, making a model forget certain information. The concept struck my interest as the process of deep learning is very similar to human learning โ€“ can model forgetting be related to how the human brain forgets information as well? โ€“ we all have memories we want to remember, memories we want to forget and maybe we can help the model too!

Find the project on: Website Github

๐Ÿ“ท Convolute

TensorFlowKerasPython

This project demonstrates training various CNN architectures (including ResNet and VGG16) from scratch to build a Computer Vision Model. The implementation taught me the complexities and nuances of training an Image Classifier, and prepared me to build real-world models for CV.

Find the project on: Github

๐Ÿ“ฑ SocioConnect

FlaskJavaScript

In this project, I worked with a team in developing a website. The website is an advertising platform for a new social media app called โ€˜SocioConnectโ€™ and lets users sign up for the app. The project uses the Python Flask framework to render pages, submit forms and connect with the database.

Find the project on: Website Github

๐Ÿš— Auto-Insurance Risk Analysis

Pandasscikit-learnPython

The focus of this project was develop a classification model to better predict whether risk is associated with a car based on its attributes. This will aid in the risk assessment process to offer more individualized rates car insurance rates, promoting customer retention and company solvency. The primary motive behind this analysis was to explore machine learning models available for Structured data (numerical, categorical), train and tune them for good results, and learn the various metrics available for each kind of business problem.

Find the project on: Github

๐Ÿ›ก๏ธ Insurance Management : System Design

UML-based system design project for the day-day operations of an Insurance Company.

Find the project on: Google Drive

๐Ÿ”Ž Analysis of Cyber Attack against T-Mobile

This project report showcases a comprehensive analysis of the 2021 data breach against T-Mobile, how to avoid such incidents and countermeasures, policies against future incidents.

Find the project on: Google Drive

๐Ÿ  St.Petersburg Property Value Analysis

Python

In this project we (group) analyze the South St. Petersburg Community Redevelopment Area (CRA) that was established in 2015 to promote reinvestment in housing and neighborhoods, non-residential properties, education, and workforce development in the city. We aim to help the city with these findings by understanding historical trends of property values since the adoption of the CRA.

The dataset for this project was provided by the City of St.Pete and is confidential. So I provide a link to a video of the project demo that showcases our analysis approach.

Find the project on: YouTube