-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
docs need to explain "aliases" better #254
Comments
Sorry to hear the explanation of this behavior isn't clear! If there are specific situations that you've run into that lead to diving deeply into this behavior, that might help with improving the docs. If it's just curiosity, then that's something I understand well :) We're always open to suggestions for how to improve the docs. From your questions I think there are a few points of clarification that would help:
To be honest, it took me quite a bit of head scratching, testing things in R, and reading the MASS book to figure out what R was doing clearly enough to implement the similar functionality here. We went back and forth quite a bit on how to explain things, and it's hard to strike a balance between being concise and clear without being overly technical and still providing enough information to make sense of behavior that can be sometimes counterintuitive. I'm afraid that there's no amount of explanation that will make this stuff be super clear without a similar amount of head scratching. |
Thank you for your detailed response. I haven't fully processed it, but I can tell you some more about why I am digging into this. I want to implement an extension of |
Ah yeah that makes sense. StatsModels SHOULD just work if you emit the necessary interaction terms before calling apply schema. Also I think in RegressionFormulae.jl we implement the R-style |
Thanks for the pointer; RegressionFormula.jl looks like just what I need. At least until things get more complicated. I'm still not following all the details, but perhaps the docs could just note that if expanding a categorical variable to full rank in a particular term would create a linear dependency with columns to the left, For comparison, With any elimination behavior, there are 3 approaches to documentation:
|
https://juliastats.org/StatsModels.jl/latest/contrasts/#Further-details says
I can't figure out what this means, even after looking at the examples following that quote. Are there some words missing?
For example:
~a+b+c
referring to the term that remains after droppinga
is meaningless since there are 2.a
alias bothb
andc
in that case?a
aliases the intercept1
. But if there is no explicit1
then what? (A bit further down it says ify~0+a
thena
aliases nothing. But what if the intercept is implicit?)~a&b + a&b&c
the first expressiona&b
is completely redundant. It is unclear how that's handled byStatsModels
or how it relates to the discussion of aliases.~a&b + a&c
also unclear.The text was updated successfully, but these errors were encountered: