-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
calculate % of features unique to a sample #9
Comments
@jesudk2, it looks like we had this issue with a sample in one of our analyses that we could test this on, correct? |
yes there is one sample that has features that are sample specific. I can check against that sample and see what percentage there is. |
The idea is that I'll write a function in this pkg to do the calculations. I just wanted to confirm that we had an example in a data set to test this with. If you want to do an initial calculation to see if this metric might even be useful, that would be good. |
I will work on that today. I think it will be extremely useful though On Wed, Apr 13, 2016 at 3:46 PM, Robert M Flight [email protected]
|
Yes, I think so. I'm also interested to see how this sample truly compares to the others, and how useful this metric will actually be. We have an indication of it given how many features disappear when we remove that sample from consideration at the beginning, but this calculation actually gives us a firm number / fraction, and we can see if it really is an outlier with respect to this value. |
It seems that another indicator of problems would be the percentage of all of the features that are sample specific.
For example, if a sample has a large number of features that are only in that sample and no other, then we expect there could be a problem.
The text was updated successfully, but these errors were encountered: