-
Notifications
You must be signed in to change notification settings - Fork 134
Issues: IBM/data-prep-kit
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Feature] Add discord link on front page (README.md)
enhancement
New feature or request
#820
opened Nov 20, 2024 by
sujee
2 tasks done
[Feature] Some subdirs not cleaning up venv on make clean
enhancement
New feature or request
#819
opened Nov 20, 2024 by
daw3rd
2 tasks done
[Bug] parquet files with columns containing large list of byte arrays can not be read by pyarrow.
bug
Something isn't working
#816
opened Nov 20, 2024 by
daw3rd
2 tasks done
[Bug] pdf2parquet: identical PDF files have different Something isn't working
contents
bug
#812
opened Nov 19, 2024 by
sujee
1 of 2 tasks
[Feature] Extend New feature or request
doc_quality
to include stop words annotation
enhancement
#811
opened Nov 18, 2024 by
Harmedox
1 of 2 tasks
[Bug] Cannot run KFP pipeline for fuzzy dedup with more than 100 actors
bug
Something isn't working
#803
opened Nov 16, 2024 by
cmadam
2 tasks done
[Feature] Create a 'User Feedback' section in discussions
enhancement
New feature or request
#802
opened Nov 14, 2024 by
sujee
1 of 2 tasks
[Feature] RAG: when saving DPK processed data into vector database, optionally save it in llama-index format
enhancement
New feature or request
#795
opened Nov 12, 2024 by
sujee
2 tasks done
[Feature] Modify pdf2parquet to accept a parquet file with the payload in the content column
enhancement
New feature or request
#792
opened Nov 11, 2024 by
touma-I
1 of 2 tasks
[Feature] add an example of html2pq in the documentation
documentation
Improvements or additions to documentation
#788
opened Nov 8, 2024 by
sujee
2 tasks done
Enable DPK on native windows and then add info to readme
med priority
simplify-DPK
#783
opened Nov 6, 2024 by
Bytes-Explorer
Rename the "Intro" notebooks to call out specific functionality it supports (PDF to Embedings)
#782
opened Nov 6, 2024 by
Bytes-Explorer
Pass parameters to modules in a way familiar to Python users/developers
enhancement
New feature or request
simplify-DPK
#776
opened Nov 5, 2024 by
shahrokhDaijavad
1 of 2 tasks
[Feature] Restructure transforms as their own modules
enhancement
New feature or request
simplify-DPK
#774
opened Nov 5, 2024 by
touma-I
2 tasks done
[Bug] Python launcher error when the child process dies
bug
Something isn't working
fixed
Marks an issues as fixed in the dev branch
#770
opened Nov 5, 2024 by
sujee
1 of 2 tasks
fdedup ( fuzzy dedup ) is not available to install with new install method
bug
Something isn't working
#768
opened Nov 4, 2024 by
santoshborse
1 of 2 tasks
[Bug] CICD Workflow test-image fails on repo_level_order
bug
Something isn't working
#764
opened Oct 31, 2024 by
touma-I
1 of 2 tasks
Update RAG and Intro examples to use release 0.2.2.dev2 (after the pypi release)
enhancement
New feature or request
simplify-DPK
#763
opened Oct 31, 2024 by
shahrokhDaijavad
1 of 2 tasks
Template for single transform notebook examples
enhancement
New feature or request
simplify-DPK
#754
opened Oct 29, 2024 by
shahrokhDaijavad
1 of 2 tasks
Uniform documentation and example Notebooks for all transforms!
enhancement
New feature or request
simplify-DPK
#753
opened Oct 29, 2024 by
shahrokhDaijavad
1 of 2 tasks
[Bug] Add data-connector-lib to the make directory
bug
Something isn't working
#736
opened Oct 23, 2024 by
touma-I
1 of 2 tasks
[Bug] HAP kfp test failing
bug
Something isn't working
#734
opened Oct 23, 2024 by
touma-I
1 of 2 tasks
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.