NLP_Text_Classification_With_ML

Natural Language Procession For Text Classification and Machine learning

Text Classification is one model of supervised machine learning task with a labelled dataset containing text documents and their labels is used for train a classifier.

Steps : -

1. Dataset Preparation:

Dataset Preparation step which includes the process of loading a dataset and performing basic pre-processing. The dataset is then splitted into train and validation sets.

2. Feature Engineering

In this step, raw text data will be transformed into feature vectors #and new features will be created using the existing dataset. We will implement the following different ideas in #order to obtain relevant features from our dataset.

2.1 Count Vectors as features

2.2 TF-IDF Vectors as features

-Word level

-N-Gram level

-Character level

2.3 Word Embeddings as features

2.4 Text / NLP based features

2.5 Topic Models as features

3. Model Building & Training:

Model Building step in which a machine learning model is trained on a labelled dataset.

-Naive Bayes Classifier

-Linear Classifier

-Support Vector Machine

-Bagging Models

-Boosting Models

-Shallow Neural Networks

-Deep Neural Networks

-Convolutional Neural Network (CNN)

-Long Short Term Modelr (LSTM)

-Gated Recurrent Unit (GRU)

-Bidirectional RNN

-Recurrent Convolutional Neural Network (RCNN)

-Other Variants of Deep Neural Networks

The diagnostic measures covered are:

accuracy: proportion of test results that are correct
sensitivity: proportion of true +ve identified
specificity: proportion of true -ve identified
positive likelihood: increased probability of true +ve if test +ve
negative likelihood: reduced probability of true +ve if test -ve
false positive rate: proportion of false +ves in true -ve patients
false negative rate: proportion of false -ves in true +ve patients
positive predictive value: chance of true +ve if test +ve
negative predictive value: chance of true -ve if test -ve
precision = positive predictive value
recall = sensitivity
f1 = (2 * precision * recall) / (precision + recall)

4. Improve Performance of Text Classifier:

we will use different ways to improve the performance of text classifiers.

5. Machine learning classifiers interpretability

Using ELI5

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Classifying_Patients_with_Normal_Vs_Abnormal_Ejection_Fraction.ipynb		Classifying_Patients_with_Normal_Vs_Abnormal_Ejection_Fraction.ipynb
NLP_Text_Classification_With_ML.ipynb		NLP_Text_Classification_With_ML.ipynb
README.md		README.md
Split_Task.ipynb		Split_Task.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP_Text_Classification_With_ML

Steps : -

1. Dataset Preparation:

2. Feature Engineering

3. Model Building & Training:

The diagnostic measures covered are:

4. Improve Performance of Text Classifier:

5. Machine learning classifiers interpretability

About

Releases

Packages

Languages

MazenSalama/NLP_Text_Classification_With_ML

Folders and files

Latest commit

History

Repository files navigation

NLP_Text_Classification_With_ML

Steps : -

1. Dataset Preparation:

2. Feature Engineering

3. Model Building & Training:

The diagnostic measures covered are:

4. Improve Performance of Text Classifier:

5. Machine learning classifiers interpretability

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages