Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EG 06.01 + 06.02 and start Listing 6.2 (Polars) #141

Merged
merged 7 commits into from
Dec 18, 2024
Merged

Conversation

nerdai
Copy link
Owner

@nerdai nerdai commented Dec 18, 2024

This PR adds the Polars crate to work with its DataFrames.

  • added download_smsspam_parquet fn to download parquet file from HF (unfortunately loading a .tsv into a DataFrame didn't work for me -- submitted issue to polars here)
  • modified EG 06.02 to be the one that loads the sms spam file and produces a df and computes value counts for the label
  • starting listing 6.2 (but will complete it in a separate PR)

@nerdai nerdai linked an issue Dec 18, 2024 that may be closed by this pull request
@nerdai nerdai enabled auto-merge (squash) December 18, 2024 06:18
@nerdai nerdai removed a link to an issue Dec 18, 2024
@nerdai nerdai linked an issue Dec 18, 2024 that may be closed by this pull request
@nerdai nerdai merged commit 2ce7560 into main Dec 18, 2024
2 checks passed
@nerdai nerdai deleted the nerdai/listing-6.2 branch December 18, 2024 06:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Enhance EG 06.01 to load in dataframe from spam .tsv file
1 participant