Skip to content

A text classification library using the microtc approach

License

Notifications You must be signed in to change notification settings

sadit/TextClassification.jl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Stable Dev Build Status Coverage

TextClassification.jl

This package provides methods to create fast and simple text classifiers, based on the same idea behind MicroTC. The main idea is to perform a model selection among a large space of configurations, including preprocessing steps, weighting schemes, tokenizers (combinations), and classifiers. Moreover, TextClassification.jl also includes support for different classifiers and fine-tune them in the search stage; additional support for weighthing shcmes, and a better support for distributed computing thanks to Julia. As the original implementation, this package is designed to be both domain and language independent.