Neural-steering is a Python package for performing activation space interventions in neural networks. It provides tools and utilities for steering the behavior of language models through targeted modifications of their internal representations.
You can install the package locally using:
pip install .
For usage examples, please refer to the examples
directory in this repository. The examples demonstrate various techniques for neural steering and provide practical implementations using this package.