-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: adding data specific p-value filters #788
Changes from 3 commits
73b97a3
0077370
0f5b669
f629e63
9d66127
88e4dd6
996fe3a
5c57ab4
3b95cfd
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -37,6 +37,10 @@ def __init__( | |
finngen_susie_finemapping_cs_summary_files=finngen_susie_finemapping_cs_summary_files, | ||
) | ||
|
||
finngen_finemapping_df = finngen_finemapping_df.validate_lead_pvalue( | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Minor stylistic comment, and it is absolutely my preference, but I like to use as few variables as possible: (
# Reading Finngen finemapped dataset and convert it to study locus:
FinnGenFinemapping.from_finngen_susie_finemapping(
spark=session.spark,
finngen_susie_finemapping_snp_files=finngen_susie_finemapping_snp_files,
finngen_susie_finemapping_cs_summary_files=finngen_susie_finemapping_cs_summary_files,
)
# Flagging sub-significnat loci:
.validate_lead_pvalue(
pvalue_cutoff=FinngenFinemappingConfig().finngen_finemapping_lead_pvalue_threshold
)
# Write the output.
.df.write.mode(session.write_mode).parquet(
finngen_finemapping_out
)
) There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Please carefully check, I removed the variables but not sure |
||
pvalue_cutoff=FinngenFinemappingConfig().finngen_finemapping_lead_pvalue_threshold | ||
) | ||
|
||
# Write the output. | ||
finngen_finemapping_df.df.write.mode(session.write_mode).parquet( | ||
finngen_finemapping_out | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There's one thing I'm not sure about. You have added
finngen_finemapping_lead_pvalue_threshold
to the relevant config, and refer to aspvalue_cutoff=FinngenFinemappingConfig().finngen_finemapping_lead_pvalue_threshold
in the step. However,finngen_finemapping_lead_pvalue_threshold
it not an argument forFinnGenFinemappingIngestionStep
. Is it OK? Would all parameters in the config passed to the step? Would that cause any problem? @project-defiant , what do you think?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK, double-checked with @project-defiant and @d0choa and all parameters in the setepConfig classes needs to be parameters in the init function of the step.