📈 Let's Predict the Price of Bitcoin - Tensorflow/Keras

The financial markets generally are unpredictable… The idea that you can actually predict what's going to happen contradicts my way of looking at the market. - George Soros

🎤 Introduction

One of my mentors had a sequence classification project coming up with a client in a few weeks. He wanted me to demonstrate I could do time series/sequence classification in Tensorflow/Keras. I had never built such a model before, nor had I done a time series problem. Since we both have an interest in cryptocurrency, we thought it would be fun to build a model to predict the price of Bitcoin.

Disclaimer: since people dedicate their lives to building financial trading models, I thought there was a close to zero percent chance I would build a profitable model. So, I treated this as a project to build my ML skills.

I completed the project much to my mentor's satisfaction (and then we completed the client's sequence classification project) and he left the following ⭐⭐⭐⭐⭐ review:

Note: I spent more than 8 hours on this but asked to bill it hourly to increase the number of hours billed on my Upwork profile.

🔎 Results

The best results were obtained by an LSTM with 5 layers each getting sequentially smaller. I ran multiple tests on wandb and got the lowest loss on the validation set to be 0.01816 RMSE.

X_val predictions vs. actuals for runs 421-423 (blue = actual, red = predictions)

Due to the stochastic nature of DL models, I ran each experiment at least 10 times. Best results were obtained on runs 413-424 (which you can search for using the regex 41[3-9]|42[0-3] on the wandb project page).

The best model was pretty-vortex-422.

I manually tuned the learning rate and implemented a custom learning rate scheduler. The optimal batch size was 168, the best scaling was to first apply a log transformation, then scale the min/max to (0, 1), finally the Adam optimizer outperformed the others.

Pretty-vortex-422 X_train results - actuals vs. predictions

Pretty-vortex-422 Loss

Pretty-vortex-422 1-RMSE

Pretty-vortex-422 X_val results - actuals vs. predictions - note the low RMSE of 0.01816 and how the red line tightly hugs the blue.

📝 Notes

Where is the Code?

The vast majority of the code I used is in price_predictor/helpers.py. There are 29 functions that I have split into sections and one called train_and_validate to perform all the training and validation steps for each experiment. The functions that make up train_and_validate should make it clear what is happening at each step.

Model Tuning

All model tuning for this project took place with Weights & Biases (wandb). You can see the results of all 540+ runs on the bitcoin_price_predictor wandb page. As such, the notebooks themselves are not that interesting - I just used them to run wandb experiments and saved everything to the cloud.

💪 Improvements

Using Classes

This was my first time building such a model with TensorFlow/Keras. Since then I have used PyTorch Lightning and love the flexibility of their Data Modules to encapsulate all the data processing code. I would like to encapsulate more of the code into easy-to-transport classes instead of the (rather large) collection of functions I wrote.

Regular Re-Training and Deployment.

Since the Bitcoin price never stops, it's easy to re-train the model and see how it performs on brand new data. Because we only used the price of Bitcoin to make predictions, I doubt the model will perform well. But it would be great to get a measure of how it performs in production.

Some courses related to this that would not take long to implement are:

Libraries Used

I used Python and the following libraries:

TensorFlow (and Keras) 2.4
Numpy
Pandas
Scikit-learn
Wandb
Matplotlib
Seaborn
Tqdm

🏗 This Repo is a Work-in-Progress

I finished this project in June 2021 and am in the process of tidying everything up so it can be presented to the world in a nice manner. You are one of the lucky souls who gets to see the repo in its raw form. But this means that not everything is as clean or orderly as it should be.

Name		Name	Last commit message	Last commit date
Latest commit History 459 Commits
.github/workflows		.github/workflows
data		data
download		download
models		models
price_predictor		price_predictor
resources		resources
simple_price_predictor		simple_price_predictor
.gitignore		.gitignore
01_data_extractor_alpaca.ipynb		01_data_extractor_alpaca.ipynb
02_lightgmb_kaggle.ipynb		02_lightgmb_kaggle.ipynb
03_training-3rd-place-solution.ipynb		03_training-3rd-place-solution.ipynb
LICENSE		LICENSE
README.md		README.md
config.py		config.py
environment.yml		environment.yml
feature_engineering.py		feature_engineering.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
setup.py		setup.py
stock-market-meme.png		stock-market-meme.png
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📈 Let's Predict the Price of Bitcoin - Tensorflow/Keras

🎤 Introduction

🔎 Results

📝 Notes

Where is the Code?

Model Tuning

💪 Improvements

Using Classes

Regular Re-Training and Deployment.

Libraries Used

🏗 This Repo is a Work-in-Progress

About

Languages

License

codeananda/bitcoin_price_predictor

Folders and files

Latest commit

History

Repository files navigation

📈 Let's Predict the Price of Bitcoin - Tensorflow/Keras

🎤 Introduction

🔎 Results

📝 Notes

Where is the Code?

Model Tuning

💪 Improvements

Using Classes

Regular Re-Training and Deployment.

Libraries Used

🏗 This Repo is a Work-in-Progress

About

Topics

Resources

License

Stars

Watchers

Forks

Languages