Fake-Content-Detection

This project focuses on detecting fake content generated by language models. The domain for this project is scientist biographies, where real biographies were sourced from Wikipedia, and fake ones were generated using a fine-tuned GPT-2 model. The project involved implementing and evaluating three different model architectures: Feed-Forward Neural Network (FFNN), Long Short-Term Memory (LSTM) network, and a Transformer-based model (BERT).

Components:

Feed-Forward Neural Network (FFNN):

Implemented a basic FFNN for binary classification of real and fake biographies. Trained the model on labeled data and evaluated its performance with a confusion matrix. Learning curves for training and test perplexity were generated.

Long Short-Term Memory (LSTM):

Developed an LSTM network using PyTorch's LSTM module. Trained the model on the provided datasets and evaluated its performance. Presented learning curves and a confusion matrix for the test set.

Transformer-based Model (BERT):

Fine-tuned a pre-trained BERT model for the binary classification task. Evaluated the model's performance and discussed its advantages and limitations. Achieved the highest accuracy among the three models.

Results:

The FFNN achieved an accuracy of 56.6%.
The LSTM model improved the accuracy to 79.4%.
The BERT-based Transformer model achieved the highest accuracy of 82.8%.

Files Included:

Code: Python scripts for FFNN, LSTM, and Transformer models.
Models: Saved model parameters and checkpoints.
Results: Confusion matrices, learning curves, and performance metrics. This project demonstrates the application of different neural network architectures in the task of fake content detection, highlighting the strengths and weaknesses of each approach. The code and results can be found in the repository.

For more details or instructions, go to the pdf file.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
DL Fake Detection.pdf		DL Fake Detection.pdf
FFNN_LSTM.py		FFNN_LSTM.py
README.md		README.md
blind_data_results.csv		blind_data_results.csv
blind_test_txt.csv		blind_test_txt.csv
hw1_transformers.py		hw1_transformers.py
mix.test.txt		mix.test.txt
mix.train.txt		mix.train.txt
mix.valid.txt		mix.valid.txt
mix_test.csv		mix_test.csv
mix_train.csv		mix_train.csv
mix_valid.csv		mix_valid.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fake-Content-Detection

Components:

Feed-Forward Neural Network (FFNN):

Long Short-Term Memory (LSTM):

Transformer-based Model (BERT):

Results:

Files Included:

About

Releases

Packages

Languages

aanchal898/Fake-Scientists

Folders and files

Latest commit

History

Repository files navigation

Fake-Content-Detection

Components:

Feed-Forward Neural Network (FFNN):

Long Short-Term Memory (LSTM):

Transformer-based Model (BERT):

Results:

Files Included:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages