Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Difficulty Downloading AudioSet from Pan Baidu Storage for Non-Chinese Users #63

Open
gevmin94 opened this issue Sep 29, 2023 · 1 comment

Comments

@gevmin94
Copy link

gevmin94 commented Sep 29, 2023

Hi @qiuqiangkong,

I have been encountering challenges while attempting to download data from Pan Baidu Storage, especially from outside of China. Unfortunately, I haven't been able to locate any client libraries or command-line utilities that would facilitate managing this storage platform. Additionally, it appears that even without an account, downloading data is not feasible, and the registration process for non-Chinese users is not straightforward.

In light of these issues, I would like to propose an alternative solution: transferring the data to more accessible services such as Zenodo. I am fully prepared to carry out this transfer and share the dataset link in the repository if you can provide an alternative method for accessing the data.

Please let me know if you have any other suggestions or if you would like to proceed with this plan.

@gevmin94
Copy link
Author

gevmin94 commented Oct 5, 2023

Update

I managed to download the dataset and upload a compressed version here: https://www.kaggle.com/datasets/gev1994/audioset-encodec-3k-bitrates/data, recent work called audioformer used discrete audio tokens from EnCodec instead of signal processing based audio features.
Hope this will be useful for the community.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant