CCA/PLS Toolkit

This is a MATLAB toolkit to incorporate Canonical Correlation Analysis (CCA), Partial Least Squares (PLS) and their different variants to investigate multivariate associations between multiple modalities of data, e.g., brain imaging and behaviour. These models find pairs of weights (one weight for each data modality) such that the linear combination of the brain and behavioural variables maximise correlation (CCA) or covariance (PLS).

The toolkit includes various options for CCA/PLS models (e.g., standard CCA, standard PLS, regularized CCA, sparse PLS) and analysis frameworks (e.g., statistical framework, machine learning framework). It can also perform Principal Component Analysis (PCA) to reduce the dimensionality of the data before entering them into standard CCA analysis (PCA-CCA).

Although there are methods to estimate all weights (or associative effects) for most CCA/PLS models at once, the toolkit uses an interative solution to be able to optimize the hyperparameters of the model (i.e., number of principal components or regularization parameters) for each associative effect independently. In such iterative solution, the CCA/PLS model estimates one pair of weights (one weight for each data modality) at a time. These associative effects are then removed from the data (by a process called deflation) and the same process is repeated multiple times. The iterative solution also allows to estimate different PLS variants by choosing a specific deflation.

For a short theoretical introduction to the CCA/PLS models, analytic frameworks and deflation methods used in the toolkit, see the link to the online documentation below. For further reading, see:

Shawe-Taylor J, Cristianini N (2004) Kernel Methods for Pattern Analysis. Cambridge: Cambridge University Press.
Rosipal R, Kramer N (2006) Overview and Recent Advances in Partial Least Squares. In: Saunders, C., Grobelnik, M.m Gunn, S., Shawe-Taylor, J. (eds) Subspace, Latent Struct Featur Sel. Berlin, Heidelberg: Springer Berlin Heidelberg, pp 34-51.
Krishnan A, Williams LJ, McIntosh AR, Abdi H (2011) Partial Least Squares (PLS) methods for neuroimaging: A tutorial and review. Neuroimage. 56: 455-475.
Monteiro JM, Rao A, Shawe-Taylor J & Mourao-Miranda J (2016) A multiple hold-out framework for Sparse Partial Least Squares. J. Neurosci. Methods 271, 182-194.
Mihalik A, Ferreira FS, Moutoussis M et al. (2020) Multiple Holdouts With Stability: Improving the Generalizability of Machine Learning Analyses of Brain-Behavior Relationships. Biol. Psychiatry 87, 368-376.
Winkler AM, Renaud O, Smith SM, Nichols TE (2020) Permutation inference for canonical correlation analysis. Neuroimage 220, 117065
Mihalik A, Chapman J, Adams RA et al. (2022) Canonical Correlation Analysis and Partial Least Squares for identifying brain-behaviour associations: a tutorial and a comparative study. Biol. Psychiatry Cogn. Neurosci. Neuroimaging doi: https://doi.org/10.1016/j.bpsc.2022.07.012

Documentation

Our detailed documentation can be found here.

Contributors

Agoston Mihalik - main developer (former at UCL, now at University of Cambridge, UK)
Nils Winter (University of Münster, Germany)
Fabio Ferreira (former at UCL, now at Imperial College London, UK)
James Chapman (UCL, UK)
Janaina Mourao-Miranda - Principal Investigator (UCL, UK)

Some of the code used in the toolkit was developed by Joao Monteiro who was a PhD student at UCL (currently a data scientist at Heni). We wish to thank members and collaborators of the Machine Learning & Neuroimaging Laboratory for testing the toolkit and providing invaluable feedback. We would particularly like to acknowledge Eliana Nicolaisen, Cemre Zor, Konstantinos Tsirlis, Taiane Ramos and Richard Nguyen.

Feel free to report any bugs under [email protected] or by creating an issue here. Pull requests are also welcome to https://github.com/anaston/cca_pls_toolkit, however, unfortunately we don't have the resources to provide general user support.

Acknowledgements

The CCA/PLS toolkit was developed at the Machine Learning & Neuroimaging Laboratory (MLNL), Centre for Medical Imaging Computing, Computer Science Department, University College London, UK. The development of the toolkit was supported by the Wellcome Trust (grant number WT102845/Z/13/Z).

License

This project is licensed under the terms of the GNU General Public License v3.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github/workflows		.github/workflows
demo		demo
documentation		documentation
examples		examples
fileio		fileio
machines		machines
misc		misc
plot		plot
test		test
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cfg_defaults.m		cfg_defaults.m
res_defaults.m		res_defaults.m
set_path.m		set_path.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CCA/PLS Toolkit

Documentation

Contributors

Acknowledgements

License

About

Releases

Contributors 3

Languages

License

anaston/cca_pls_toolkit

Folders and files

Latest commit

History

Repository files navigation

CCA/PLS Toolkit

Documentation

Contributors

Acknowledgements

License

About

Resources

License

Stars

Watchers

Forks

Releases

Contributors 3

Languages