tx

This library is intended to be a reimplementation of the 'TransformerLens' library¹ using JAX and the Flax module system.

The main distinguishing features of this library are as follows.

No dependencies on PyTorch, all numerical operations are performed using JAX.
The Flax module system is used for defining networks and storing activations.
Simplified low-level module implementations are provided in a 'single batch' style.

Installation

The following prerequisites are required to use the library.

A Python 3.7+ installation.
A working installation of jax and jaxlib (either CPU or GPU).
Other module requirements (see requirements.txt).

1. Create Virtual Environment

Assuming you have a working Python 3.7+ installation, you should first clone this project into a new directory <project_dir>, and then create and upgrade a virtual environment in <env_dir>.

git clone https://github.com/alexjackson1/tx.git <project_dir>
cd <project_dir>
python -m venv <env_dir>
source <env_dir>/bin/activate
pip install --upgrade pip

2. Install a Compatible Version of JAX

To install a version of JAX that is compatible with your hardware, please refer to the JAX installation instructions on the project README. Installation via the pip wheel(s) is highly recommended.

3. Install the Remaining Requirements

Once you have installed a compatible version of JAX, you can install the remaining requirements as follows. This includes Flax, the module system used by this library for defining networks.

pip install -r requirements.txt

Usage

This library is still in development and is not yet ready for use.

The notebook(s) in the examples directory follow the tutorials on mechanistic interpretability provided here.

Relation to `TransformerLens`

The API of this library is intended to (eventually) expose the same functionality as the original 'TransformerLens' library, making some changes where appropriate.

Similarities

The library seeks to model Transformer architectures and enable users to inspect intermediate activations and other hidden information (e.g. attention weights).
Modules are written 'from scratch', attempting to eliminate abstractions that obfuscate the underlying mathematics.
GPU acceleration is supported as a first-class feature.

Differences

The transformer architecture and related algorithms use JAX, instead of PyTorch, for better performance and hardware acceleration.
In-keeping with the functional paradigm of JAX, the library and API are designed to be more functional in nature and embrace the Flax philosophy.
Module definitions use a 'single batch' style made possible by jax.vmap (reducing cognitive load and improving readability).

License

This project is licensed under the terms of the MIT license. The full license text can be found in the LICENSE file.

The original 'TransformerLens' library is also licensed under the terms of the MIT license. The full license text can be found here. Additionally, the original library can be cited as shown below.

@misc{nandatransformerlens2022,
    title  = {TransformerLens},
    author = {Nanda, Neel and Bloom, Joseph},
    url    = {https://github.com/neelnanda-io/TransformerLens},
    year   = {2022}
}

Formerly 'EasyTransformer', 'TransformerLens' is maintained by Joseph Bloom and was created by Neel Nanda. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/workflows		.github/workflows
.idea		.idea
.vscode		.vscode
examples		examples
test		test
tx		tx
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example.py		example.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

tx

Installation

1. Create Virtual Environment

2. Install a Compatible Version of JAX

3. Install the Remaining Requirements

Usage

Relation to `TransformerLens`

Similarities

Differences

License

About

Uh oh!

Releases

Packages

Languages

License

alexjackson1/tx

Folders and files

Latest commit

History

Repository files navigation

tx

Installation

1. Create Virtual Environment

2. Install a Compatible Version of JAX

3. Install the Remaining Requirements

Usage

Relation to TransformerLens

Similarities

Differences

License

Footnotes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Relation to `TransformerLens`

Packages