Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

toil-vg wastes time filtering down input VCFs when only asked for the --primary graph #610

Open
adamnovak opened this issue Aug 20, 2018 · 2 comments

Comments

@adamnovak
Copy link
Member

I just started a toil-vg run with --primary, but not --pangenome or any of the other graph construction options. But I did pass it in a bunch of VCFs, and instructions to filter out NA12878 and the other ceph samples.

It ought to know that it doesn't actually need those VCFs to make the primary graph, and not use them.

But instead, it not only imports the VCFs, but wastes time filtering them down, only to not use them later.

This is inefficient and should be fixed.

@adamnovak adamnovak added the HAMP Haplotype-Aware Mapping Paper label Aug 20, 2018
adamnovak added a commit to vgteam/haplotype-aware-mapping-paper that referenced this issue Aug 20, 2018
@ghost ghost assigned adamnovak Aug 21, 2018
@ghost ghost added the in progress label Aug 21, 2018
@adamnovak adamnovak removed their assignment Aug 23, 2018
@adamnovak adamnovak added enhancement help wanted and removed HAMP Haplotype-Aware Mapping Paper labels Aug 23, 2018
@adamnovak
Copy link
Member Author

I'm going to leave this alone for now; just not passing in the VCFs when they aren't needed seems to be a good enough solution for my pipeline, but this is still a thing we probably want to fix eventually.

@glennhickey
Copy link
Collaborator

glennhickey commented Aug 23, 2018 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants