RSTSR: A Rust Tensor Toolkit

This is a crate in early development. It has not reached a version 0.1.

A possible roadmap listed below.

Current Features

Device Dependent: This crate allows multiple devices/backends. By design, this crate separates tensor API and algorithm implementation API; all devices/backends share the same tensor API, while different in algorithm implementation API. Currently, DeviceCpuSerial (as reference implementation) and DeviceFaer is created. Other devices/backends could be implemented in future.
n-Dimensional: This crate provides n-dimensional tensor support, similar data-structure as ndarray in rust, and numpy in python. Broadcasting is available. We will try to implement as many Python array API functions as we can. We also hope to make function APIs to be similar to that of python/numpy.
% (remainder) as Matmul: This crate is radical in adopting matrix-multiplication operator. Remainder operator % is rarely used in floating-point arithmetics and integer matrix/vector computations. We use % as matrix-multiplication operator, like @ in python PEP-465.
```
// `a` and `b` are tensor objects
let c = &a % &b;  // perform `a.matmul(&b)`
```
Fast Matmul by Faer: With DeviceFaer, efficiency of matrix-multiplication (of f32, f64, Complex<f32>, Complex<f64>) should be comparable to (some cases even faster than) highly-optimized BLAS. Further more, $C = A A^T$ can be further speeded-up by SYRK. Though matrix symmetrize is not fully optimized, current implementation in DeviceFaer will handle $C = A A^T$ faster than general GEMM, comparable to that of numpy.
Parallel in Complicated Layouts: For example, in most cases, tensor addition $C = A + B$ is fast enough in serial (one-thread). Compiler nowadays usually automatically generates vectorized assemblies for this kind of task, even for naive implementation. However, when layout is not match (something like $C = A + B^T$), it can be extremely inefficient due to cache miss. rstsr does not prefectly solves this problem (decreases cache miss by tiling tensor), but try to perform tensor addition by parallel. Given enough threads, parallel can give 2--8 times efficiency boost.

Roadmap to version 0.1

Some requirements in Python array API

Other utilities

(parallel) index by axis

About crate

Minimal user document
Minimal dev document
Minimal correctness validation (testing)
Github action

Roadmap to future version

BLAS (OpenBLAS, MKL) device
GPU device
enhanced linalg
- faster matrix congruence ($C = A^T B A$), which is used in ERI transformation
- einops (may implement in another crate), which is convenient for many post-HF algorithms
more numpy features and near-full support to Python array API
user/dev/api documentation, testing, coverage, benchmarking
user documentation for best/recommended practice
optimization for memory-bounded operations in 1-D, 2-D cases (transpose, symmetrize, triangular-pack/unpack)
more blas/lapack and linalg may be implemented in another crate, to support more requirements for chemistry applications

Miscellaneous

RSTSR actually refers to its relationship with REST Tensor (REST), instead of Rust Tensor. This crate was originally tried to developed a more dev-friendly experience for chemist programmer from numpy/scipy/pytorch. But that can be a tough task.

It is grateful if you share your views on how to further improve this crate. This project is still in early stage, and radical code factorization could occur; dev-documentation has not been prepared now.

This crate gets inspires from numpy, array-api, ndarray, candle, burn.

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
assets		assets
notes		notes
rstsr-core		rstsr-core
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
katex-header.html		katex-header.html
readme.md		readme.md
rust-toolchain.toml		rust-toolchain.toml
rustfmt.toml		rustfmt.toml
tarpaulin.toml		tarpaulin.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RSTSR: A Rust Tensor Toolkit

Current Features

Roadmap to version 0.1

Roadmap to future version

Miscellaneous

About

Releases

Packages

Languages

License

ajz34/rstsr

Folders and files

Latest commit

History

Repository files navigation

RSTSR: A Rust Tensor Toolkit

Current Features

Roadmap to version 0.1

Roadmap to future version

Miscellaneous

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages