Projects and Work Samples
S.No | Project Name | Primary Tech/Skill Used | Purpose | Type |
---|---|---|---|---|
1 | ๐ Machine Translation | Tensorflow | Seq2Seq Model architectures for Language Translation on Right-Left typed Languages | Individual Work |
2 | ๐ FinHealth | Spark | Distributed analysis to determine financial factors affecting the health of an individual | Group Work |
3 | ๐จ Vanilla Network | Python | Math behind deep neural network learning | Individual Work |
4 | โป๏ธ Machine Unlearning / Forgetting | Torch | Exploring solutions to make models forget training data partially | Individual Work |
5 | ๐ท Convolute | Tensorflow | Training several CNN architectures from scratch | Individual Work |
6 | ๐ฑ SocioConnect | Flask | Prototype website demoing a new social media platform | Group Work |
7 | ๐ Auto-Insurance Risk Analysis | Scikit-learn | Using ML to predict whether a car is associated with any risks based on the carโs attributes | Group Work |
8 | ๐ St.Petersburg Property Value Analysis | Geopandas | Property value analysis for the city of St. Pete | Group Work |
9 | ๐ก๏ธ Insurance Management : System Design | UML, LucidChart | UML based design of an Insurance Management System | Group Work |
10 | ๐ Analysis of Cyber attack against T-Mobile | Info-Security analysis | Report on the 2021 T-Mobile data breach and steps to protect against such attacks | Individual Work |
๐ Machine Translation
In this assignment I explore different strategies used in building and training a Language Translator. I use Seq2Seq learning to convert sequences from English to Hebrew(Right to left typed/ DextroSinistral). I also employ techniques such as bi-directional learning and Attention mechanism which serve as the building blocks of advanced transformer-based NLP models such as GPT, Llama etc.
๐ FinHealth
The primary goal of this project is to identify how financial factors (and the emotional elements that come with it) affect the health of an individual. The analysis is based on data from the National Financial Well-being Survey, which also includes a section asking participants to rate their perception of their own health. The output of such an analysis would help companies operating in the FinTech space optimize their products to improvise customer experience. Though this analysis could have been completed with simple Python libraries, one of the goals associated is to demonstrate distributed storage and processing technologies - so Spark is being used to accomodate scaling in the future.
The final vision for this project is to serve as a plug and play component within FinTech products, delivering personalized experiences for each user.
๐จ Vanilla Network
I implement gradient descent and backpropagation mathematically from scratch, and implement it in Python code. These form as the building blocks of the abstract neural network layers, using which I explore how models learn patterns within data. I also test the effects of varying several structural and algorithmic hyperparameters on the learning process.
โป๏ธ Machine Unlearning
The field of Unlearning specifically deals with removing the influence of data points from a model without having to retrain it from scratch โ in other words, making a model forget certain information. The concept struck my interest as the process of deep learning is very similar to human learning โ can model forgetting be related to how the human brain forgets information as well? โ we all have memories we want to remember, memories we want to forget and maybe we can help the model too!
๐ท Convolute
This project demonstrates training various CNN architectures (including ResNet and VGG16) from scratch to build a Computer Vision Model. The implementation taught me the complexities and nuances of training an Image Classifier, and prepared me to build real-world models for CV.
๐ฑ SocioConnect
In this project, I worked with a team in developing a website. The website is an advertising platform for a new social media app called โSocioConnectโ and lets users sign up for the app. The project uses the Python Flask framework to render pages, submit forms and connect with the database.
๐ Auto-Insurance Risk Analysis
The focus of this project was develop a classification model to better predict whether risk is associated with a car based on its attributes. This will aid in the risk assessment process to offer more individualized rates car insurance rates, promoting customer retention and company solvency. The primary motive behind this analysis was to explore machine learning models available for Structured data (numerical, categorical), train and tune them for good results, and learn the various metrics available for each kind of business problem.
๐ก๏ธ Insurance Management : System Design
UML-based system design project for the day-day operations of an Insurance Company.
๐ Analysis of Cyber Attack against T-Mobile
This project report showcases a comprehensive analysis of the 2021 data breach against T-Mobile, how to avoid such incidents and countermeasures, policies against future incidents.
๐ St.Petersburg Property Value Analysis
In this project we (group) analyze the South St. Petersburg Community Redevelopment Area (CRA) that was established in 2015 to promote reinvestment in housing and neighborhoods, non-residential properties, education, and workforce development in the city. We aim to help the city with these findings by understanding historical trends of property values since the adoption of the CRA.
The dataset for this project was provided by the City of St.Pete and is confidential. So I provide a link to a video of the project demo that showcases our analysis approach.