-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use fsspec
instead of pyfilesystem2
#102
Comments
TBF, I think |
xref: #7 |
I've been talking to @martindurant about @dhirschfeld I don't have a huge quantity of bandwidth to work on jupyter-fs right now, and most of my effort is currently going towards the new |
It's an itch I'd like to scratch, but realistically won't have time to look at any time soon. I'm using Unfortunately, since it's a "nice-to-have" rather than a "can't live without" I won't be able to invest time into it in the medium term - I can't even keep up with my can't live without's :/ |
Is there any update on this? If not, I would like to work on this issue, to use fsspec for protocols not supported by pyfilesystem2.
Based on the above comments, I am considering either of the following policies, but would appreciate comments if you have a preference.
If there is no preference, I would like to proceed with 1 for future expansion. |
Note that fsspec instances generally need more configuration. Whilst it is possible to set the default values for any particular protocol, it is very conceivable to want different configurations for, e.g., an owned bucket, a public bucket and a requestor-pays bucket on S3. (or even different S3-compatible service) |
Thank you for your comment. (Not related to the issue, but I also find fsspec useful on a daily basis. |
I have started to implement the addition of fsspec. Since fsspec.core.url_to_fs() is used internally to create instances, I began to think that making 'kwargs' configurable in addition to 'backend' would solve the problem you mentioned. Of course, as an interface to JupyterLab's setting, this would be redundant. However, this is not a big problem because this function is only for users who want to do complicated things. |
Thanks @reoono , let me know if I can help. |
Is the effort here related to https://github.com/fsspec/jupyter-fsspec |
jupyter-fsspec is "inspired" by this repo, and is only in early stages so far. If you would like to port any functionality or otherwise help develop it, that would be cool. |
I do still plan on moving to |
Before
fsspec
existed I usedpyfilesystem2
and was very happy with it - it's a great library however it (apparently) didn't meet all the requirements fordask
sofsspec
was built, primarily to supportdask
, but it's also used inintake
and as a generic filesystem api. As such it has a robust community around it and is continually improving and maturing.Coming from the distributed computing world it has first-class support for cloud storage, and in particular (for my use-case) Azure Data Lake.
I haven't actually used the cloud storage plugins in
pyfilesystem2
but they don't seem to have a lot of development momentum behind them, unlikefsspec
.To better support cloud filesystems I think it would be great if
jupyter-fs
could make use offsspec
rather thanpyfilesystem2
The text was updated successfully, but these errors were encountered: