Predicting-Tags-For-Stackoverflow

Suggest the tags based on the content that was there in the question posted on Stackoverflow.

Procedure :

1] We are modeling with less data points (0.5M data points) and more weight is given to the title.
2] We are limiting our tags to 500 only.
3] Due to the above steps we are reducing the time to train the model.
4] If we want to train the whole data we need high computational resource.
5] With 500 tags we are covering 90.956 % of questions.
6] When we apply OneVsRest Logistic regression on BOW we get macro F1 score as 0.3338.
7] Tfidf performs well than BOW on this dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Predicting Tags For StackOverflow.ipynb		Predicting Tags For StackOverflow.ipynb
README.md		README.md
Tag_predictor.ipynb		Tag_predictor.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting-Tags-For-Stackoverflow

Procedure :

About

Releases

Packages

Languages

sahildigikar15/Predicting-Tags-For-Stackoverflow

Folders and files

Latest commit

History

Repository files navigation

Predicting-Tags-For-Stackoverflow

Procedure :

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages