Skip to content

bayuwira/Text-Classification-Topic-Extraction-on-Indonesian-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

forthebadge made-with-python

Indonesian News Topic Classification

Topic classification is a supervised machine learning technique, one that needs training before being able to automatically analyze texts. First, we'll delve into what topic modeling is, how it works, and how it compares to topic classification.

Summary

Work Flow Process :

  • Data Understanding
  • Data Exploration
  • Feature Engineering with :

Symbol Remover Lemmatization Stopword remover Lowercasing

  • Model Selection
  • Model Evaluation

The Best Model is Using Linear Support Vector Machine with accuracy 87.9%

Requirement Library

list of requirement package :

Kumparanian
nlp-id
pandas
numpy
matplotlib
sklearn

Feedback 💋

Ask Me Anything !

If you want to ask something or just want to greet and follow my social media please press the badge or find me at

Facebook!

Linkedin !

Linkedin !

Contributing 👀

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

CC0

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published