Skip to content

🔶 Compressed bitvector/container supporting efficient random access and rank queries

License

Notifications You must be signed in to change notification settings

gvinciguerra/la_vector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LA-vector

LA-vector is a compressed bitvector/container supporting efficient random access and rank queries. It uses novel ways of compressing and accessing data by learning and adapting to data regularities, as described in this research paper.

GitHub Workflow Status License GitHub stars GitHub forks

This repo provides a reference C++17 implementation of LA-vector with the following features:

  • A container interface via the operator[] and lower_bound methods.
  • A succinct bitvector interface via the select and rank methods.
  • Navigation through iterators.
  • Serialisation capabilities.

Usage

LA-vector is implemented in the two header files inside the include directory, and it uses the sdsl header-only library.

To compile the tests and the example.cpp file, use the following commands:

git clone --recurse-submodules https://github.com/gvinciguerra/la_vector.git
cd la_vector
cmake . -DCMAKE_BUILD_TYPE=Release
make -j8

Contribute

Contributions are welcome. Some ideas:

  1. Using vector instructions (e.g. in la_vector::decode and ::lower_bound).
  2. Compressing segments (e.g. using a different encoding for slopes and intercepts).
  3. Improving the construction performance of la_vector_opt.

License

This project is licensed under the terms of the Apache License 2.0.

If you use this code for your research, please cite:

Antonio Boffa, Paolo Ferragina, and Giorgio Vinciguerra. A learned approach to design compressed rank/select data structures. ACM Transactions on Algorithms (2022).

Antonio Boffa, Paolo Ferragina, and Giorgio Vinciguerra. A “learned” approach to quicken and compress rank/select dictionaries. In Proceedings of the SIAM Symposium on Algorithm Engineering and Experiments (ALENEX), 2021.

@article{Boffa:2022talg,
	author = {Boffa, Antonio and Ferragina, Paolo and Vinciguerra, Giorgio},
	doi = {10.1145/3524060},
	issn = {1549-6325},
	journal = {ACM Transactions on Algorithms},
	title = {A Learned Approach to Design Compressed Rank/Select Data Structures},
	year = {2022}}

@inproceedings{Boffa:2021,
	author = {Boffa, Antonio and Ferragina, Paolo and Vinciguerra, Giorgio},
	booktitle = {Proceedings of the 23rd SIAM Symposium on Algorithm Engineering and Experiments (ALENEX)},
	doi = {10.1137/1.9781611976472.4},
	pages = {46--59},
	title = {A ``learned'' approach to quicken and compress rank/select dictionaries},
	year = {2021}}

The code to reproduce the experiments of the paper is available here.

About

🔶 Compressed bitvector/container supporting efficient random access and rank queries

Resources

License

Stars

Watchers

Forks

Releases

No releases published