nlp

This repository includes the notebooks which handles the basic things, likes algorithms and basic tools for doing NLP (Natural Language Processing) within each specific folder. Mainly. this is intended for the beginners who want to start NLP stuffs. It shows what kinds of things he/she should be familiar with with detailed explanation.

Directories / Folders

tensorflow_dev_assignments

notebooks the assignments and labs of the Natural Language Propressing in Tensorflow course from DeepLearning.AI on Coursera

visalization_with_matplotlib

notebooks to see different basic visualization with matplotlib in each cell.

topic_modeling

notebooks to do topic modeling with Latent Dirichlet Allocation.

data

stores Data Files such as .csv and Model files which are needed to load in the notebooks

Notebooks

tokenize_basic_tensorflow_keras.ipynb

Notebook with basic tokenization code to tokenize the sentences with spaces using tensorflow and keras

WordNet.ipynb

checking synonyms and hypernyms of WordNet from NLTK

preprocessing.ipynb

normalizing and tokenizing the tweets including processing with stopwords, punctuations, stemming, lowercase and hyperlinks, needs to import utils.py

utils.py

the utility file to be imported in preprocessing.ipynb, building_and_visualizing_word_frequencies.ipynb

linear_algebra.ipynb

the notebook how to do linear algebra with vectors and matrices with numpy

manipulating_word_embeddings.ipynb

to see how word vectors works and find the relations betweens words. will need to upload the model file word_embeddings_subset.p.

building_and_visualizing_word_frequencies.ipynb

to create word frequencies for feature extraction, needs to import utils.py

Explanation_PCA.ipynb

Explaining PCA, based on the Singular Value Decomposition (SVD) of the Covariance Matrix of the original dataset, related to Eigenvalues and Eigenvectors which are used as The Rotation Matrix.pdf
might need some images under the images directory for the display in the notebook

logistic_regression_model.ipynb

visualization and interpreting logistic regression
uses logistic_features.csv under the data directory

LogisticRegression_fromScratch.ipynb

building and evaluating the Logistic Regression from Scratch
does Preprocessing, Feature Extraction, predicting new tweets
includes implementing loss function and the gradient descent learning algorithm from Scratch
needs to import utils.py and w1_unittest.py

visualizing_NaiveBayes.ipynb

interpreting Naive Bayes Performance
need to upload data/bayes_features.csv

wikipedia_library.ipynb

how to get the data from Wikipedia

Datasets

Tensorflow built-in Datasets Catalog
Wikipedia Data wikipedia_library.ipynb

API

Tensorflow Subword Text Encoder or Subword Tokenizer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nlp

Directories / Folders

Notebooks

Datasets

API

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 188 Commits
Preprocessing		Preprocessing
data		data
images		images
sentiment_analysis		sentiment_analysis
tensorflow_dev_assignments		tensorflow_dev_assignments
topic_modeling		topic_modeling
visalization_with_matplotlib		visalization_with_matplotlib
Explanation_PCA.ipynb		Explanation_PCA.ipynb
LogisticRegression_fromScratch.ipynb		LogisticRegression_fromScratch.ipynb
README.md		README.md
SentenceBERT.ipynb		SentenceBERT.ipynb
The Rotation Matrix.pdf		The Rotation Matrix.pdf
WordNet.ipynb		WordNet.ipynb
building_and_visualizing_word_frequencies.ipynb		building_and_visualizing_word_frequencies.ipynb
linear_algebra.ipynb		linear_algebra.ipynb
logistic_regression_model.ipynb		logistic_regression_model.ipynb
manipulating_word_embeddings.ipynb		manipulating_word_embeddings.ipynb
plot_topics_extraction_with_nmf_lda.ipynb		plot_topics_extraction_with_nmf_lda.ipynb
preprocessing.ipynb		preprocessing.ipynb
tokenize_basic_tensorflow_keras.ipynb		tokenize_basic_tensorflow_keras.ipynb
utils.py		utils.py
visualizing_NaiveBayes.ipynb		visualizing_NaiveBayes.ipynb
w1_unittest.py		w1_unittest.py
wikipedia_library.ipynb		wikipedia_library.ipynb

yiyichanmyae/nlp

Folders and files

Latest commit

History

Repository files navigation

nlp

Directories / Folders

Notebooks

Datasets

API

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages