Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for webdataset #9

Open
faroit opened this issue Dec 13, 2020 · 1 comment
Open

Add support for webdataset #9

faroit opened this issue Dec 13, 2020 · 1 comment
Labels
enhancement New feature or request torch

Comments

@faroit
Copy link
Contributor

faroit commented Dec 13, 2020

Using Webdataset is a great way to speed up the training pipeline and also makes it convenient to share and download achieves of datasets (e.g. by uploading to Zenodo).

Addressing this issue should involve:

  • a method to write webdataset tar files using the gbif_dl.io method.
  • a torch dataset class/pipeline to parse the dataset.
@faroit
Copy link
Contributor Author

faroit commented Feb 4, 2022

support https://github.com/rom1504/img2dataset for downloading could close this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request torch
Projects
None yet
Development

No branches or pull requests

1 participant