Skip to content

Releases: cp2k/dbcsr

Version 1.1.0

09 Apr 03:44
Compare
Choose a tag to compare

This DBCSR version introduces predictive modeling for the CUDA generation of the kernels. A decision tree model is built from the data obtained by autotuning certain (m, n, k)-triplets and is used to predict optimal parameters for unseen (m, n, k)-triplets.

Developed as a part of the PASC proposal "Sparse Tensor Linear Algebra Library"

Thanks to @shoshijak for the hard work to implement it!

v1.1.0-rc.0

03 Apr 21:25
Compare
Choose a tag to compare
v1.1.0-rc.0 Pre-release
Pre-release
Version 1.1.0-rc.0

Version 1.0.0

03 Jan 11:00
Compare
Choose a tag to compare

This is the first official DBCSR release, after the extraction from CP2K repository. The main requirement for this release was to preserve the original API as it was in CP2K, while several changes were internaly made:

  1. On-the-fly generation of the GPU kernels (fully revisited code for the generation of the GPU kernels)
  2. Prototype of C API
  3. Improved algorithm for the multiplication of rectangular matrices (when M>>N and M>>K)
  4. CUBLAS support for the multiplication of dense matrices
  5. CMAKE support

v1.0.0-rc.4

03 Jan 10:18
Compare
Choose a tag to compare
v1.0.0-rc.4 Pre-release
Pre-release
Version v1.0.0-rc.4

v1.0.0-rc.3

03 Jan 10:18
Compare
Choose a tag to compare
v1.0.0-rc.3 Pre-release
Pre-release
Version v1.0.0-rc.3

v1.0.0-rc.2

03 Jan 10:17
Compare
Choose a tag to compare
v1.0.0-rc.2 Pre-release
Pre-release
Version 1.0.0-rc.2

v1.0.0-rc.1

03 Jan 10:15
Compare
Choose a tag to compare
v1.0.0-rc.1 Pre-release
Pre-release
Second release candidate for 1.0.0

v1.0.0-rc.0

03 Jan 10:17
Compare
Choose a tag to compare
v1.0.0-rc.0 Pre-release
Pre-release
Tagging release candidate 1 for 1.0.0