Skip to content

State-of-the-art NLP principles and methods for toxic comment classification

Notifications You must be signed in to change notification settings

TheWoops/Natural-Language-Processing

Repository files navigation

Natural-Language-Processing

State-of-the-art NLP principles and methods for toxic comment classification
The links below display the above notebooks via nbviewer, because Github sometimes fails to display .ipynb-files properly.

Data preparation:

Modeling:

Resources

Natural Language Processing

Libraries for working with human languages.

  • General
    • gensim - Topic Modeling for Humans.
    • langid.py - Stand-alone language identification system.
    • nltk - A leading platform for building Python programs to work with human language data.
    • pattern - A web mining module for the Python.
    • polyglot - Natural language pipeline supporting hundreds of languages.
    • pytext - A natural language modeling framework based on PyTorch.
    • PyTorch-NLP - A toolkit enabling rapid deep learning NLP prototyping for research.
    • spacy - A library for industrial-strength natural language processing in Python and Cython.
    • stanfordnlp - The Stanford NLP Group's official Python library, supporting 50+
    • flair - library for state-of-the-art NLP by zalandoresearch
    • fastai - well documented NLP library for transfer learning

About

State-of-the-art NLP principles and methods for toxic comment classification

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published