SAE Lens

SAE Lens

SAELens exists to help researchers:

Train sparse autoencoders.
Analyse sparse autoencoders / research mechanistic interpretability.
Generate insights which make it easier to create safe and aligned AI systems.

Please refer to the documentation for information on how to:

Download and Analyse pre-trained sparse autoencoders.
Train your own sparse autoencoders.
Generate feature dashboards with the SAE-Vis Library.

SAE Lens is the result of many contributors working collectively to improve humanity's understanding of neural networks, many of whom are motivated by a desire to safeguard humanity from risks posed by artificial intelligence.

This library is maintained by Joseph Bloom and David Chanin.

Loading Pre-trained SAEs.

Pre-trained SAEs for various models can be imported via SAE Lens. See this page in the readme for a list of all SAEs.

Tutorials

Join the Slack!

Feel free to join the Open Source Mechanistic Interpretability Slack for support!

Citation

Please cite the package as follows:

@misc{bloom2024saetrainingcodebase,
   title = {SAELens
   author = {Joseph Bloom, David Chanin},
   year = {2024},
   howpublished = {\url{https://github.com/jbloomAus/SAELens}}
}}

Name		Name	Last commit message	Last commit date
Latest commit History 585 Commits
.github		.github
.vscode		.vscode
content		content
docs		docs
sae_lens		sae_lens
scripts		scripts
tests		tests
tutorials		tutorials
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
check_open_ai_sae_metrics.ipynb		check_open_ai_sae_metrics.ipynb
eval_metrics_resid_mid_oai.csv		eval_metrics_resid_mid_oai.csv
make_hf_repo.sh		make_hf_repo.sh
makefile		makefile
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAE Lens

Loading Pre-trained SAEs.

Tutorials

Join the Slack!

Citation

About

Releases

Packages

Languages

License

AlignmentResearch/lp_sae

Folders and files

Latest commit

History

Repository files navigation

SAE Lens

Loading Pre-trained SAEs.

Tutorials

Join the Slack!

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages