Virtualizarr + Coiled Serverless Example Notebook #233

norlandrhagen · 2024-08-27T21:57:28Z

Inspired by @thodson-usgs's Lithops example, I've created a Virtualizarr example using coiled serverless functions.

1TB virtual dataset from 924 NetCDF files
9 minutes and ~$0.24 of cloud cost on coiled

Would love some feedback if anyone has thoughts.

Changes are documented in docs/releases.rst

TomNicholas · 2024-08-27T22:26:46Z

Awesome!!

I'm a bit unclear where the in-memory datasets live at each point in the computation when using coiled functions. You do the reference generation on a bunch of separate instances, but in order to to combine_by_coords they all have to be on the same instance. At what point does that transfer occur?

norlandrhagen · 2024-08-27T22:40:09Z

I'm a bit unclear where the in-memory datasets live at each point in the computation when using coiled functions. You do the reference generation on a bunch of separate instances, but in order to to combine_by_coords they all have to be on the same instance. At what point does that transfer occur?

I'm pretty sure the .map returns a generator of all the virtual datasets to my local laptop.
Coiled allows you to run notebooks in the cloud. I successfully tried running all the reference generation serverless functions from a coiled cloud notebook and that also worked great. It might be a good option for a larger dataset, but I think since the manifest arrays are so memory efficient, I didn't see any issues doing the reduce on a laptop.
I also tried starting the reference generation severless functions from a larger serverless function (that was meant to be the reduce machine) and it was kinda wacky.

It would be nice to find the limits to where this becomes a problem! At that point, maybe the lithops map-reduce executor or beam would be a better option.

examples/coiled/terraclimate.ipynb

terraclimate_coiled ex

3749270

norlandrhagen added the usage example Real world use case examples label Aug 27, 2024

norlandrhagen temporarily deployed to test-release August 27, 2024 21:58 — with GitHub Actions Inactive

norlandrhagen changed the title ~~terraclimate_coiled ex~~ Virtualizarr + Coiled Serverless Example Notebook Aug 27, 2024

adds example to releases.rst

2f59cc4

norlandrhagen temporarily deployed to test-release August 28, 2024 15:59 — with GitHub Actions Inactive

adds fastparquet to ex deps for writing refs

11ceb94

norlandrhagen temporarily deployed to test-release August 28, 2024 17:16 — with GitHub Actions Inactive

TomNicholas reviewed Aug 29, 2024

View reviewed changes

examples/coiled/terraclimate.ipynb Outdated Show resolved Hide resolved

TomNicholas approved these changes Aug 29, 2024

View reviewed changes

removed FileType

5930ff7

norlandrhagen temporarily deployed to test-release August 29, 2024 15:43 — with GitHub Actions Inactive

norlandrhagen merged commit 708d168 into main Aug 29, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Virtualizarr + Coiled Serverless Example Notebook #233

Virtualizarr + Coiled Serverless Example Notebook #233

norlandrhagen commented Aug 27, 2024 •

edited

Loading

TomNicholas commented Aug 27, 2024

norlandrhagen commented Aug 27, 2024 •

edited

Loading

Virtualizarr + Coiled Serverless Example Notebook #233

Virtualizarr + Coiled Serverless Example Notebook #233

Conversation

norlandrhagen commented Aug 27, 2024 • edited Loading

TomNicholas commented Aug 27, 2024

norlandrhagen commented Aug 27, 2024 • edited Loading

norlandrhagen commented Aug 27, 2024 •

edited

Loading

norlandrhagen commented Aug 27, 2024 •

edited

Loading