Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request for pointing to the correct VCF files for comparison #4

Open
tnnandi opened this issue Jan 7, 2023 · 0 comments
Open

Request for pointing to the correct VCF files for comparison #4

tnnandi opened this issue Jan 7, 2023 · 0 comments

Comments

@tnnandi
Copy link

tnnandi commented Jan 7, 2023

Hi,

I'm running a validation study to compare Parliament2 SV calls with the GIAB v0.6 truth set using 60X hg002.

I was looking at the VCF files at https://github.com/slzarate/parliament2/tree/master/benchmarking_data/hg002_benchmarks, specifically the HG002-NA24385-50x.70_percent.markdup.realigned.combined.genotyped.formatted.vcf file and wondering if all the DEL calls in the file were used to create Fig 1 of the Zarate et al. (2020) paper or were they again filtered (to keep only those calls corresponding to the Tier1 high confidence regions of the GIAB v0.6 truth set) for making the plots?

Also, can you please confirm if the VCF and BED files at https://ftp-trace.ncbi.nlm.nih.gov/ReferenceSamples/giab/release/AshkenazimTrio/HG002_NA24385_son/NIST_SV_v0.6/ were used as the truth set, and to denote the Tier1 high confidence regions?

Based on the above two GIAB VCF and BED files, I'm finding that Parliament2 predicts significantly large number of deletions in the 400-1000 bp range, but Fig 1(b) in the Zarate et al. (2020) paper appears to show reasonably high precision and recall for that range. I'm finding around 21000 calls in the Tier1 high confidence regions, among which there are around 6000 DELs in the 400-1000 bp range (while the GIAB truth set has approximately 1300 DELs within that range).

I'd appreciate it if you can confirm if I am looking at the correct files for comparison and if I'm interpreting them correctly.

Thank you very much.

@tnnandi tnnandi changed the title Request for pointing to the VCF files for comparison Request for pointing to the correct VCF files for comparison Jan 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant