Skip to content

Latest commit

 

History

History
11 lines (8 loc) · 943 Bytes

README.md

File metadata and controls

11 lines (8 loc) · 943 Bytes

FakeJobPosting

Kaggle Notebook: https://www.kaggle.com/sean49/fake-job-posting-a-hybrid-nn-classifier
A Hybrid Neural Network Classifier with Oversample Minority Class.

Oversample Minority Class

Since the two classes of this dataset are very imbalance, I decided to oversampling the minority class, which can be considered as adding more copies of the minority class. Here I used imblearn’s SMOTE (https://imbalanced-learn.readthedocs.io/en/stable/generated/imblearn.over_sampling.SMOTE.html) or Synthetic Minority Oversampling Technique. SMOTE uses a nearest neighbors algorithm to generate new and synthetic data for training the model.

Model Structure

Because this dataset has a large amount of text data and several columns of categorical data, I decided to build a hybrid deep learning model. The structure of the model is as follows:

Model