Project for the class Advanced Methods in Applied Statistics 2022 at UCPH
PreProcessor----------- Cleans the data, including use of NLP mmethods, and creates the datafile parsed_data.csv
FeatureExtractor------- Extracts features sentiment score and readability from the data and creates the datafile parsed_data_with_features.csv
BoW-------------------- Uses bag of word approach and a LinearSVC model to produce predictions on test
TF-IDF----------------- Uses bag of word approach and a LinearSVC model to produce predictions on test