Add model card for SAEBench models (#1)
Browse files- Add model card for SAEBench models (9ba849930a2e26d91119dbbb5ad307f8907148aa)
Co-authored-by: Niels Rogge <[email protected]>
README.md
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
library_name: transformers
|
| 4 |
+
pipeline_tag: feature-extraction
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
# SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability
|
| 8 |
+
|
| 9 |
+
This repository contains model checkpoints for Sparse Autoencoders (SAEs), as described in the paper [SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability](https://huggingface.co/papers/2503.09532).
|
| 10 |
+
|
| 11 |
+
Project page: https://saebench.xyz.
|
| 12 |
+
For code, please see the [SAEBench repository](https://github.com/adamkarvonen/SAEBench).
|