-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pod5 merge hangs indefinitely at 99-100%(the last 20 pod5 have not been merged) #131
Comments
Interesting. Are you able to merge the remaining 20 files into the ip_merge.pod5 file? |
Hi, @HalfPhoton
I run this code in conda environment.
I don't know how pod5 merge handles the order of files. Best wishes, |
ah - I see. In this case please create a list of missing read ids from the first merged output and all inputs using pod5 view. # get read ids
pod5 view -IH input_data/ -o input.ids
pod5 view -IH merged.pod5 -o merged.ids
# Sort the files (comm requires sorted files)
sort input.ids > input.ids.sorted
sort merged.ids > merged.ids.sorted
# Find ids in input that are not in merged file
comm -23 input.ids.sorted merged.ids.sorted > missing.ids
# Get a pod5 file of only missing ids
pod5 filter input_data/ --ids missing.ids -o missing.pod5
# Merge in missing ids
pod5 merge merged.pod5 missing.pod5 -o merged.final.pod5 |
I recommend using a python virtual environment instead of a conda environment: python3.10 -m venv venv --prompt=pod5
source venv/bin/activate
pip install -U pip pod5
pod5 --version |
Just for the record, the same thing happens to me, but all the files are actually processed and there's no missing reads. So it's probably something with the progress bar. |
Issue Description
Logs
This is input group.
This is ip group.
Here is my pod5 merge code:
Here is the size of merge_pod5 and multi_pod5:
It seems that the last 20 pod5 have not been merged.
Specifications
The text was updated successfully, but these errors were encountered: