Skip to content

Commit f880fde

Browse files
authored
Update README.md
1 parent b0d48bd commit f880fde

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,5 @@
1+
[![Verify Numbers](https://github.com/nulone/sae-consciousness-steering-pitfalls/actions/workflows/verify.yml/badge.svg)](https://github.com/nulone/sae-consciousness-steering-pitfalls/actions/workflows/verify.yml)
2+
13
# SAE Consciousness Steering: A Multi-Model Null Result
24

35
I tried to use contrastive SAE discovery to find features that control how language models answer consciousness-related questions. After 9 experiments on Gemma 3 4B and Gemma 3 12B (plus a qualitative Neuronpedia label search on Llama 3.3 70B), I found no evidence of causal consciousness features with this pipeline. The contrastive method finds punctuation, Japanese grammar, and self-referential discourse markers — not consciousness.

0 commit comments

Comments
 (0)