Project Text Sentiment Classification

Abstract:

The aim of this project is to classify tweets into a 'happy' and a 'sad' class. To fulfill that aim, we implemented different machine learning algorithms and finally choose a Recurrent Neural Network as the best method. We achieved an accuracy of about 86% which is close to the state of the art.

Requirements:

Python 3
Tensorflow
Keras
ScikitLearn

How to reproduce:

Download the datasets on CrowdAI
Put them into a directory called data/
Download twitter GloVe embeddings over here
Put them into a directory called data/glove.twitter.27B
Create an empty directory called output/ and put into it the trained model you downloaded over here
Run the file run.py with python 3. It should produce a file prediction.csv that you can upload directly on CrowdAI

Content:

run.py, a simple script that load the saved model and predict the classes of the test dataset
mode_selection.py, a simple script that we used to cross-validate the accuracy of different classification algorithms
rnn.py, the implementation of our Recurrent Neural Network
plot.py, different functions that we used to make the plot of the report

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
__pycache__		__pycache__
.DS_Store		.DS_Store
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
build_vocab.sh		build_vocab.sh
clustering.py		clustering.py
cooc.py		cooc.py
cross_validation.py		cross_validation.py
cut_vocab.sh		cut_vocab.sh
debug.py		debug.py
feature_expansion.py		feature_expansion.py
glove_solution.py		glove_solution.py
glove_template.py		glove_template.py
loader.py		loader.py
model_selection.py		model_selection.py
nn.py		nn.py
pickle_vocab.py		pickle_vocab.py
plot.py		plot.py
requirements.txt		requirements.txt
rnn.py		rnn.py
rnn_lstm.py		rnn_lstm.py
run.py		run.py
tda.py		tda.py
tmp.py		tmp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Text Sentiment Classification

Abstract:

Requirements:

How to reproduce:

Content:

About

Releases

Packages

Contributors 2

Languages

Eagleseb/project_text_classification

Folders and files

Latest commit

History

Repository files navigation

Project Text Sentiment Classification

Abstract:

Requirements:

How to reproduce:

Content:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages