add option to download default models #200

ljchang · 2024-01-02T17:52:30Z

This PR adds an option when installing py-feat via pip to also download the default models.

pip install py-feat[default_models]

@ejolly Let's make sure this works first before merging as I havent really tested it yet.

ejolly · 2024-03-29T20:12:39Z

@ljchang Unfortunately this isn't going to work because you can only include package names in extras_require.
In fact pip doesn't seem to support any kind of post-install that isn't simply installing other packages due to security issues. That's also why they suggest including any package data within the package if you need it. I don't think that makes sense for us as our pip installs would be huge and would tie model weights to package versions.

I've added an alternative solution, which is a compromise, but still a little annoying:

User pip install py-feat
User runs feat_get_models command their terminal which will be automatically setup after they pip install

It's not that different that simply downloading the models on first run of Detector, so I'm torn about whether it's worth adding. What do you think?

ljchang · 2024-07-18T15:53:00Z

@ejolly, I've only scratched the surface of my deep dive into hugging face repositories, but I definitely think this is the way to go. I'm going to keep adding notes here as I learn more.

I've created an organization for the lab to host datasets or model repositories.
models and datasets can be public or private and can solicit community feedback or block it.
models and datasets can be versioned
webhooks are possible . One thing I've wanted for a long time is to build a benchmarking server, which I think will be possible with hugging face. We can post our test data as private to hugging face (our EULAs prevent us from making it public). Everytime a model is updated or a new one is added, we can add a webhook to run our benchmarking tests on that model or all of them. Honestly, I don't care if we have to pay for compute time on one of their spaces, this would be amazing and would enable a living benchmark for py-feat.
there is a python cli for working with repositories and model i/o.
models can be standalone and dowloaded, OR they can be integrated into a code repository . I think we would want to do this so you can download models from py-feat, just like you can from the transformers library.
jupyter notebooks can be rendered and linked to colab. This could be nice for demos or tutorials
We should do a deepdive into the possibility of porting py-feat to be an integrated library
There are widgets to create live demos for each model. Not sure this will work for us or not.

ljchang · 2024-08-05T04:04:46Z

this is addressed in issue #221

ejolly · 2024-10-19T05:12:15Z

Subsumed by #228

ljchang requested a review from ejolly January 2, 2024 17:53

ljchang and others added 2 commits March 29, 2024 15:41

add option to download default models

497e76d

alternative CLI-based model download

7bcad4a

ejolly force-pushed the updatesetup branch from 7f0f12d to 7bcad4a Compare March 29, 2024 20:08

ljchang self-assigned this Jul 18, 2024

ljchang added the investigate label Jul 18, 2024

ejolly closed this Oct 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add option to download default models #200

add option to download default models #200

ljchang commented Jan 2, 2024 •

edited

Loading

ejolly commented Mar 29, 2024

ljchang commented Jul 18, 2024 •

edited

Loading

ljchang commented Aug 5, 2024

ejolly commented Oct 19, 2024

add option to download default models #200

add option to download default models #200

Conversation

ljchang commented Jan 2, 2024 • edited Loading

ejolly commented Mar 29, 2024

ljchang commented Jul 18, 2024 • edited Loading

ljchang commented Aug 5, 2024

ejolly commented Oct 19, 2024

ljchang commented Jan 2, 2024 •

edited

Loading

ljchang commented Jul 18, 2024 •

edited

Loading