Skip to content

Multiclass text classification using Ngram tf-idf and Naive bayes

Notifications You must be signed in to change notification settings

dogukankse/Multiclass-text-classification

Repository files navigation

Multiclass Text Classification

Summary

  • Programmed using Python 3.6 language.
  • Requirements is in the file named ***requirements.txt *** .
  • Requirements can be downloaded with the command pip install -r requirements.txt in the directory.
  • Zemberek-NLP 0.16 is used in the project. (Link: [Zemberek-NLP] (https://github.com/ahmetaa/zemberek-nlp/))
  • Zemberek library requires [JDK8] (https://www.oracle.com/technetwork/java/javase/downloads/index.html).
  • To use Zemberek library, gRPC has been used.
  • To open the gRPC server, the command java -jar zemberek-full.jar StartGrpcServer --dataRoot. \ zemberek_data \ should be run in the directory.

Requirements

  • Java 8
  • Python 3.6
    • grpcio==1.20.1
    • numpy==1.16.3
    • zemberek_grpc==0.16.1
    • nltk==3.4.1
    • pandas==0.24.2
    • grpc==0.3-19
    • scikit_learn==0.21.1

About

Multiclass text classification using Ngram tf-idf and Naive bayes

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published