Twitter Sentiment Analysis

Introduction

This project focuses on analyzing the sentiment of tweets using machine learning techniques. The dataset used for this project is sourced from Kaggle. The objective is to classify tweets into positive, negative, or neutral sentiments.

Dataset

The dataset is sourced from Kaggle and contains tweets with their corresponding sentiment labels. The dataset includes the following columns:

id: Unique identifier for the tweet
text: The text of the tweet
target: The sentiment label (positive, negative)

Installation

To run this project, you'll need to have Python installed along with several libraries. You can install the required libraries using the following command: pip install pandas numpy nltk scikit-learn

Data Preprocessing

The data preprocessing steps involve the following:

Loading the dataset: The dataset is loaded into a pandas DataFrame.
Text cleaning: The text is cleaned by removing special characters, URLs, and stop words.
Tokenization: The text is tokenized into individual words.
Stemming: Words are reduced to their root forms.
Vectorization: The text data is converted into numerical format using techniques like TF-IDF.

Modeling

The modeling phase involves training machine learning models to classify the tweets' sentiments. The following models are used:

Logistic Regression
Support Vector Machine (SVM)
Random Forest

The steps include:

Splitting the data: The dataset is split into training and testing sets.
Training the models: Each model is trained on the training data.

Evaluation

The models are evaluated using the following metrics:

Accuracy
Precision
Recall
F1 Score

Results

The results of the models are compared, and the best-performing model is selected. The performance metrics are displayed in a tabular format.

Model Accuracy Precision Recall F1 Score Logistic Regression 0.85 0.84 0.85 0.84 SVM 0.88 0.87 0.88 0.87 Random Forest 0.86 0.85 0.86 0.85

Conclusion

The SVM model performed the best in terms of accuracy, precision, recall, and F1 score. Future improvements could include using deep learning techniques and experimenting with different feature extraction methods.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
Twitter_Sentiment_Analysis.ipynb		Twitter_Sentiment_Analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter Sentiment Analysis

Introduction

Table of Contents

Dataset

Installation

Data Preprocessing

Modeling

Evaluation

Results

Conclusion

About

Releases

Packages

Languages

AYUSHI-SHA/Twitter_Sentiment_Analysis

Folders and files

Latest commit

History

Repository files navigation

Twitter Sentiment Analysis

Introduction

Table of Contents

Dataset

Installation

Data Preprocessing

Modeling

Evaluation

Results

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages