Name		Name	Last commit message	Last commit date
parent directory ..
Prediction_Project.Rmd		Prediction_Project.Rmd
Prediction_Project.html		Prediction_Project.html
Probabilities1.png		Probabilities1.png
Probabilities2.png		Probabilities2.png
README.md		README.md
pml-test.csv		pml-test.csv
pml-train.csv		pml-train.csv

README.md

Weightlifting Prediction Analysis Using PCA and Random Forests

This is the Practical Machine Learning Project Repository. If you want a web-based version of the project go to this RPubs page. The basic outline of my solution is:

Clean the training and testing datasets
- Remove non-relevant variables
- Remove variables with more than 95% NAs
Do a Principal Component Analysis (PCA) on the clean training dataset in order to reduce the number of variables
Select the PCs whose cumulative proportion of explained variance equals 95%
Predcit the new PCs on the training dataset
Use the predictions to build a Random Forest model (parallelization is required)
Do a confusion matrix to assess the performance on the training dataset
Predict the class outcome on the testing set, first calculating the PCs and then using the Random Forest model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Practical_Machine_Learning

Practical_Machine_Learning

README.md

Weightlifting Prediction Analysis Using PCA and Random Forests

Files

Practical_Machine_Learning

Directory actions

More options

Directory actions

More options

Latest commit

History

Practical_Machine_Learning

Folders and files

parent directory

README.md

Weightlifting Prediction Analysis Using PCA and Random Forests