Crypto Clustering

This code is designed to cluster two hundred cryptocurrencies by time series. For this purpose, the hourly data was first uploaded using the cryptocompare Python library. It can also be used to unload daily and minute data on cryptocurrencies. In order to facilitate the unloading, a class was created, which can be used to unload this data depending on the need. It is also worth noting that the data was unloaded only by the closing price.

The variable names stores cryptocurrency tickers. You can remove unnecessary cryptocurrencies by simply removing tickers that are unnecessary to you. You can also add new tickers, but for that you need the data of added cryptocurrencies to be available in cryptocompare library.

The data was then pre-prepared for convenience (so, for example, the column 'index' was renamed to 'ticker').

The tslearn and sklearn libraries were used to cluster the time series. The methods that were used: Kmeans, DTW (Dynamic Time Warping). The number of clusters was determined using silhouette_score and distortions (their graphs were plotted for this purpose). As a result, 5 clusters were chosen for Kmeans method and 7 for DTW.

For convenience, graphs of each cryptocurrency were plotted and grouped into clusters. Also the information was uploaded to csv files hour_conclusion_kmeans.csv and hour_conclusion_dtw.csv

Conclusion:

DTW method coped better with cryptocurrency time series clustering - especially this method, in contrast to Kmeans, was able to display some cryptocurrencies with extreme graphs (for example, BTCB) in a separate cluster.

October, 18 (updated version)

October, 10

Despite this, some extreme data failed to cluster successfully.

October, 10

Anyway, both methods failed to cope 100%, but we can see the obvious patterns according to which cryptocurrencies were clustered, which gives already a good result.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Crypto time series Clustering (hourly data) (1).ipynb		Crypto time series Clustering (hourly data) (1).ipynb
README.md		README.md
hour.csv		hour.csv
hour_conclusion_dtw.csv		hour_conclusion_dtw.csv
hour_conclusion_kmeans.csv		hour_conclusion_kmeans.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Crypto Clustering

Conclusion:

About

Releases

Packages

Languages

greyfin2707/Clustering

Folders and files

Latest commit

History

Repository files navigation

Crypto Clustering

Conclusion:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages