Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spatial sorting or shuffling #66

Open
benbovy opened this issue Sep 27, 2024 · 0 comments
Open

Spatial sorting or shuffling #66

benbovy opened this issue Sep 27, 2024 · 0 comments

Comments

@benbovy
Copy link
Member

benbovy commented Sep 27, 2024

This will be more useful when we support partitioning (chunking) the DGGS cell ids coordinate and its index.

Spatial sorting or shuffling consists of re-ordering (grouping) cells such that adjacent cell ids (cell ids within the same chunk) are spatially close to each other. In general this makes operations like spatial indexing much more efficient. See, e.g., dask-geopandas' guide on spatial partitioning.

The good thing is that most (all?) DGGSs already provide ways to efficiently measure (sort by) spatial locality, e.g., H3's coordinate systems, S2 cells aligned on a (Hilbert) space filling curve, HEALPix's well defined ring vs. nested ordering schemes, etc.

Another good thing is that we could probably leverage dask.array.shuffle and also support for shuffle being added in Xarray pydata/xarray#9320.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant