Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Non-normalised package name #73

Open
jayvdb opened this issue May 13, 2019 · 4 comments
Open

Non-normalised package name #73

jayvdb opened this issue May 13, 2019 · 4 comments

Comments

@jayvdb
Copy link

jayvdb commented May 13, 2019

I would like to use the data to correlate with openSUSE package names, which use the 'real' name supplied in setup.py, i.e. not-normalised.

I've been doing a bit of research at hugovk/top-pypi-packages#4 and psincraian/pepy#128, and the raw data from bigquery can include this, with a very small perf hit.

The query only needs to change from selecting file.project to substr(max(file.filename),1,LENGTH(file.project)) , or more likely including both.

Note this does depend on using standard SQL ( #28 ).

Do we know the cost implications of those changes?

@ofek
Copy link
Owner

ofek commented May 13, 2019

Hello there!

I'm unsure of the cost implications of this. Though, I'll approve whatever @hugovk thinks is best 🙂

suv27 pushed a commit to suv27/pypinfo that referenced this issue Oct 5, 2019
@ofek
Copy link
Owner

ofek commented Oct 20, 2019

@hugovk

@hugovk
Copy link
Collaborator

hugovk commented Oct 20, 2019

I don't know the cost implications, I guess the best way is to test it out.

If it costs more, I'd suggest adding a switch to include the change. I'm nearly always out of quota.

@jayvdb jayvdb mentioned this issue Oct 20, 2019
@jaraco
Copy link

jaraco commented Aug 23, 2024

I'd also be interested in this feature. In coherent-oss/coherent.build#3, I'm building a database that keeps track of the normalized and non-normalized names (along with the Python root names it exposes). It currently resolves the package name from the wheel that it downloads, but there are situations where it would be advantageous to have the original name available prior to resolving the wheel (or when a wheel is unavailable).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants