Use patch_size instead of chunk_size as base shape for sampling #4

fercer · 2024-03-29T19:02:07Z

This change allows zarrdataset to extract samples using as reference the patch_size instead of the input's chunk size.
Basically, chunks are now considered as multiples of patch_size and therefore patches can be extracted without separation.
This is helpful when using zarrdataset for inference on larger-than-memory inputs.

No changes are needed ImageBase class since it can handle loading adjacent chunks. This will increase the memory usage if the patch size is not multiple of the chunk size due to multiple chunks being loaded.

There is no impact on the multi-thread capability of zarrdataset to use multiple workers.
That is because each worker has its own handler to access the zarr file, and chunks can be read safely without collisions.

…nput image's chunk sizes

…maller than patch sizes

… the chunk size

…size

fercer · 2024-04-30T15:23:12Z

I'm resuming the conversation from PR #6 here @ClementCaporal.

I'll be working in the next step to solving #3 and #5, by adding a way to extract overlapped patches.
My first attempt would be adding a stride parameter to the PatchSampler class that would allow to extract overlapped patches when stride < patch size.

The following step would be to allow ImageBase objects to add padding to patches retrieved from the cache when the requested slice is bigger than the actual image size. This is the case of edge chunks that are commonly smaller than the rest of the chunks in the image.

Padding is necessary because torch's DataLoader expects all samples to be of the same shape to collate them.

…ow overlapping patches extraction

ClementCaporal · 2024-05-02T12:45:22Z

Hello @fercer.

Thank you for the explanations. I will try this new implementation as soon as the overlap sample is ready. (I have my own small patch meanwhile)

Have a good day,

Clément

…hes of the defined shape

fercer · 2024-05-02T20:14:53Z

Thanks for your contribution to improve ZarrDataset @ClementCaporal!
The fixes to allow sampling with padding for inference have been implemented in this PR.
I'll add this functionality into the documentation and create a notebook example before merging with main branch.

…scales than image scale

ClementCaporal · 2024-05-08T21:39:27Z

docs/source/examples/advanced_example_pytorch_inference.md

+patch_sampler = zds.PatchSampler(patch_size=patch_size, pad=pad, allow_incomplete_patches=True)
+```
+
+Create a dataset from the list of filenames. All those files should be stored within their respective group "0".


I think there is a typo here for group "0", should it be "4" in this example?

Thanks for noticing this @ClementCaporal! I considered this change and added it to a recent PR #8 that addresses an incorrect sampling of masked regions.

Oh Nice!
I was starting to use masked regions on friday and started noticing strange behavior so I just have to pull now thanks to you!

Have a good week,

Clément

Changed PatchSampler to take as base the patche size instead of the i…

8ca4afc

…nput image's chunk sizes

fercer self-assigned this Mar 29, 2024

fercer mentioned this pull request Mar 29, 2024

Add Inference example Sampler #3

Closed

fercer added 2 commits March 29, 2024 16:46

Reverted change in the computation when masks elements are relative s…

4720c76

…maller than patch sizes

Fixed spatial chunk size computation when patch sizes are grater than…

9e1e985

… the chunk size

fercer mentioned this pull request Apr 29, 2024

Missing patches when zarr-chunks is not "full" #5

Closed

Fixed missing patches from chunks smaller than the input image chunk …

d7fb5f5

…size

fercer mentioned this pull request Apr 30, 2024

Fix grid missing border patches #6

Closed

Padding and stride added to PatchSampler and ImageBase classes to all…

1826378

…ow overlapping patches extraction

fercer added 2 commits May 2, 2024 10:38

Added tests for stride and pad parameters of PatchSampler class

b894985

Fixed patch slices generation in PatchSampler to always retrieve patc…

aaa51f6

…hes of the defined shape

fercer added 2 commits May 7, 2024 16:32

Standardized patch sampling method to handle smaller and bigger mask …

7885749

…scales than image scale

Added example notebook to documentation

a5a4097

fercer merged commit 4afb0bd into main May 7, 2024
2 checks passed

fercer deleted the overlap_sampling branch May 7, 2024 21:11

ClementCaporal reviewed May 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use patch_size instead of chunk_size as base shape for sampling #4

Use patch_size instead of chunk_size as base shape for sampling #4

fercer commented Mar 29, 2024

fercer commented Apr 30, 2024

ClementCaporal commented May 2, 2024

fercer commented May 2, 2024

ClementCaporal May 8, 2024

fercer May 10, 2024

ClementCaporal May 13, 2024

Use patch_size instead of chunk_size as base shape for sampling #4

Use patch_size instead of chunk_size as base shape for sampling #4

Conversation

fercer commented Mar 29, 2024

fercer commented Apr 30, 2024

ClementCaporal commented May 2, 2024

fercer commented May 2, 2024

ClementCaporal May 8, 2024

Choose a reason for hiding this comment

fercer May 10, 2024

Choose a reason for hiding this comment

ClementCaporal May 13, 2024

Choose a reason for hiding this comment