Hybrid RecSys, CF-based RecSys, Model-based RecSys, Content-based RecSys, Finding similar items using Jaccard similarity
-
Updated
Jun 3, 2021 - Python
Hybrid RecSys, CF-based RecSys, Model-based RecSys, Content-based RecSys, Finding similar items using Jaccard similarity
A Web based Domain Specific Search Engine in Python
TF-IDF scores and visualizations for documents produced over time
First story detection using shingling, LSH and graphical methods
Calculate the TF-IDF score using parallel algorithms
TF-IDF (Term frequency, Inverse Document Frequency) is an algorithm or way to score the importance of words (or 'terms') based on how frequently they appear
A complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
Add a description, image, and links to the tf-idf-score topic page so that developers can more easily learn about it.
To associate your repository with the tf-idf-score topic, visit your repo's landing page and select "manage topics."