Toxic comment classification.

Problem Statement

Everyday while surfing the social media we encounter a lot of comments, reviews, tweets etc. that we believe might hurt the sentiments of the people of a particular group or a community. These comments are believed to be toxic in nature, which thus defines the problem that we are trying to solve with this project i.e Classifying the comments on the social media into various categories of toxicity, which are - Toxic, Severe-toxic, Obscene, Threat, Insult, Identity_hate. This is a Multi Label Classification problem which means that a given comment may belong to more than one category at the same time.

Language and Libraries used.

Python 3.7
Numpy
Pandas
Matplotlib
NLTK
Seaborn

Steps involved

Getting the dataset
Getting insights from dataset using visualisation tools.
Preprocessing the data using NLTK.
Applying Multi Label classification algorithms.
Comparing the results and choosing the best among them.

Results

Predicted an accuracy score of 88.16% using Binary Relevance method with SVM classifier.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
Minor 2.ipynb		Minor 2.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Toxic comment classification.

Problem Statement

Language and Libraries used.

Steps involved

Results

About

Releases

Packages

Languages

kesari007/Toxic-Comment-Classification

Folders and files

Latest commit

History

Repository files navigation

Toxic comment classification.

Problem Statement

Language and Libraries used.

Steps involved

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages