Job Matching Widget
Tech Used: TF-IDF, word2vec algorithm, Hadoop HDFS, Python, Microsoft Azure
Project is regarding a recommendation engine built using Python NLTK and genism libraries. It compares input document with large document set and recommends 10 best fit results, Ingest and dump both structured and unstructured data using Hadoop HDFS/Hive, Built recommendation model using python, Use word2vec algorithm via genism library to build word2vec and them compare cosine similarity/Euclidean distances, Debiasing the word vector so that model does not incorporate unwanted gender/racial biases, TF-IDF algorithm used for vectorization.