Skip to content

Udacity's Machine Learning Engineer Nanodegree Capstone Project

Notifications You must be signed in to change notification settings

jeffgerlach/udacity_ml_engineer_capstone

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Udacity Machine Learning NanoDegree Capstone Project

Jeff Gerlach
25 July 2019

Toxic Comment Classification Challenge

Identify and classify toxic online comments

https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge

Project Setup

This machine learning project uses relatively smaller datasets, so development was performed locally using a Jupyter .ipynb notebook on a Conda environment.

Jupyter Notebook file:

comment_toxicity-final.ipynb

Python libraries required:
  • numpy
  • pandas
  • matplotlib.pyplot
  • IPython.display
  • seaborn
  • sklearn
  • scipy
  • keras
  • skmultilearn
  • scikitplot
Data files:

Kaggle training and test data can be found here - (sample_submission.csv, test_labels.csv, test.csv, train.csv)

fastText word embedding can be found here - (crawl-300d-2M.vec)

About

Udacity's Machine Learning Engineer Nanodegree Capstone Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published