New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

feat: add the step class for fine-mapping #554

Merged

addramir merged 8 commits into dev from ytdc_susie_finemapper_2

Apr 2, 2024

Contributor

addramir commented Mar 21, 2024 •

edited

Loading

✨ Context

This PR introduced the fine-mapper step and has a function that converts susie output to study-locus.

🛠 What does this PR implement

See above.

🙈 Missing

Documentation will be updated later.

🚦 Before submitting

Do these changes cover one single feature (one change at a time)?
Did you read the contributor guideline?
Did you make sure to update the documentation with your changes?
Did you make sure there is no commented out code in this PR?
Did you follow conventional commits standards in PR title and commit messages?
Did you make sure the branch is up-to-date with the dev branch?
Did you write any new necessary tests?
Did you make sure the changes pass local tests (make test)?
Did you make sure the changes pass pre-commit rules (e.g poetry run pre-commit run --all-files)?

addramir added 2 commits

March 21, 2024 13:06


          feat: add the step class for fine-mapping

8f165e9


          Merge branch 'dev' into ytdc_susie_finemapper_2

e5c87ac

github-actions bot added size-M Step Feature labels

addramir marked this pull request as ready for review

March 21, 2024 13:39


          test: adding test for susie to studylocus converter

a1dc3a8

addramir requested review from Daniel-Considine and d0choa

March 21, 2024 14:10


          chore: fix the class description

3c1e11d

addramir requested a review from DSuveges

March 22, 2024 13:58

DSuveges and others added 3 commits

March 22, 2024 14:32


          Merge branch 'dev' into ytdc_susie_finemapper_2

bbc0bc9


          Merge branch 'dev' into ytdc_susie_finemapper_2

caf4d7b


          Merge branch 'dev' into ytdc_susie_finemapper_2

ba1f39f

DSuveges approved these changes

View reviewed changes

Contributor

DSuveges left a comment

I only have a few small comments that doesn't affect the logic, so not a deal breaker for merging. My comments are stylisting, if you want to address them, you can before merge.

src/gentropy/susie_finemapper.py

+                                  cred_set.tolist(),
+                                  ["variantId", "posteriorProbability", "logBF", "beta"],
+                              )
+                              .join(

Contributor

DSuveges Apr 2, 2024

At this join the mode is inner by default. Is it expected?

Contributor Author

addramir Apr 2, 2024

Yes, it should be 1 to 1 the same size and order

src/gentropy/susie_finemapper.py Outdated

+                          session (Session): Spark session
+                          _studyId (str): study ID
+                          _region (str): region
+                          _join (DataFrame): DataFrame with variant information

Contributor

DSuveges Apr 2, 2024

I would rename the _join variable to something more intuitive.

Contributor Author

addramir Apr 2, 2024

Done

src/gentropy/susie_finemapper.py Outdated

+                      order_creds.sort(key=lambda x: x[1], reverse=True)
+                      cred_sets = None
+                      counter = 0
+                      for i, value in order_creds:

Contributor

DSuveges Apr 2, 2024

i is fine, this is a scanonical way of calling an index variable, however value is not very telling. At row 110 it is very complicated what does this valuerefer to.

Contributor Author

addramir Apr 2, 2024

renamed it for cs_lbf_value

src/gentropy/susie_finemapper.py

+                          susie_output (dict[str, Any]): SuSiE-inf output dictionary
+                          session (Session): Spark session
+                          _studyId (str): study ID
+                          _region (str): region

Contributor

DSuveges Apr 2, 2024

Do we have a consisted way of representing regions? If so, in the args description could be written eg.:

_region (str): finemapped region in chr:start-end format

Contributor Author

addramir Apr 2, 2024

No, we don't have it now. But agree, we need to think about the standard.

src/gentropy/susie_finemapper.py Outdated

+                          if cred_sets is None:
+                              cred_sets = cred_set
+                          else:
+                              cred_sets = cred_sets.union(cred_set)

Contributor

DSuveges Apr 2, 2024

I would use unionByName rather union, because union concatenates columns by positions instead of names. I think in this case it is fine, as the order of columns are defined in rwo 120, but still.

Contributor Author

addramir Apr 2, 2024

Done.

tests/gentropy/method/test_susie_inf.py

+                          _join=gwas_df,
+                          cs_lbf_thr=2,
+                      )
+                      assert isinstance(L1, StudyLocus), "L1 is not an instance of StudyLocus"

Contributor

DSuveges Apr 2, 2024

I'm not sure how does the test dataset look like, it would be great to assert that the number of credible set is what you are expecting, and validate if the locus object is healthy. However I understand if that is not a high priority for now.

Contributor Author

addramir Apr 2, 2024

I think we can create more meaningful test for bigger function that will use this convertor on later stages.


          chore: answering comments

60d482f

addramir merged commit d76ebbe into dev

4 checks passed

addramir deleted the ytdc_susie_finemapper_2 branch

April 2, 2024 13:52

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Feature size-M Step