I have extracted some of the code lines from the original repository to use it as an example of pre-processing datasets
the original dataset is in the dataset folder, the X_test.csv, X_train.csv, y_train.csv are the result, the the set of data ready to do machine learning