Experiments for running toy models of superposition as in Anthropic's 2022 paper. These experiments focus on superposition of composed features.
First blog post from this repo: https://www.lesswrong.com/posts/a5wwqza2cY3W7L9cj/sparse-autoencoders-find-composed-features-in-small-toy