Jigsaw Multilingual Toxic Comment Classification

This is my submission for the Jigsaw Multilingual Toxic Comment Classification Kaggle competition. This is my initial submission. I will be modifying and trying to improve my score. If you're attempting this for the first time, feel free to fork this repo and make modifications to the code. Do let me know if you're able to improve the score. Running this on Kaggle will give you a result accuracy of ~91%

Contest details

This competition is based on Conversation AI, an initiative of Jigsaw and Google. The main area of focus is creating machine learning models that can identify toxicity in online conversations, where toxicity is defined as anything rude, disrespectful, or otherwise likely to make someone leave a discussion.

Data

You can access all datasets and details of each file from here

How to run

All the files included in src are sufficient to train and test the model.

The Jupyter notebooks Jigsaw-multilingual-nikhiljohn.ipynb and jigsaw-inference-nikhiljohn.ipynb are my Kaggle notebooks, one for training and one for inference respectively. Feel free to use them too. If you use this, make sure to use the TPUs provided by Kaggle. If you need a guide on how to work with TPUs, use this link. It's a video tutorial by Abhishek Thakur, a data scientist I really admire.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src		src
Jigsaw-multilingual-nikhiljohn.ipynb		Jigsaw-multilingual-nikhiljohn.ipynb
README.md		README.md
jigsaw-inference-nikhiljohn.ipynb		jigsaw-inference-nikhiljohn.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Jigsaw Multilingual Toxic Comment Classification

Contest details

Data

How to run

About

Releases

Packages

Languages

nikjohn7/Jigsaw-Multilingual-Toxic-Comment-Classification

Folders and files

Latest commit

History

Repository files navigation

Jigsaw Multilingual Toxic Comment Classification

Contest details

Data

How to run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages