-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Permutation invariance - site ordering and data augmentation #75
Comments
Related stats.SE question and a related manuscript:
|
See |
Sort a list by multiple attributes?
answer source: https://stackoverflow.com/a/4233482/13697228 This seems like a pretty reasonable implementation and should be directly compatible with For example, sort by electronegativity then by Wyckoff number then by letter? (letters converted to integers) |
Based on Wyckoff positions, maybe something to be adapted/learned from here. (1) Goodall, R. E. A.; Parackal, A. S.; Faber, F. A.; Armiento, R.; Lee, A. A. Rapid Discovery of Stable Materials by Coordinate-Free Coarse Graining. Sci. Adv. 2022, 8 (30), eabn4117. https://doi.org/10.1126/sciadv.abn4117. |
xref: sparks-baird/matbench-genmetrics#77
If the sites aren't already sorted, best to sort. Perhaps using
s.copy(sanitize=True)
. Can add as a hyperparameter. Shouldn't affect thextal2png
encoding and decoding process to swap the order of sites.Data augmentation is something I've considered, but with 52 sites, the combinatorial space is enormous and probably intractable. In the worst case with 52 sites and if I'm thinking about this correct, that's nPr ==$52P52$ ==
8.07E67
. Could maybe do partial data augmentation where sites with a shared periodic element undergo permutation data augmentation locally, but even that might be intractable.The text was updated successfully, but these errors were encountered: