Steering #15053

shakedzy · 2025-08-03T08:17:29Z

shakedzy
Aug 3, 2025

It seems many recent papers and researches are dealing with a new technique named "Steering", where layers attentions are manipulated using a provided vector. On a more technical level, this involves another multiplication operation given an artifact of the vector/matrix.
At the moment, the only available tool to use this is transformers, as it allows full interaction and manipulation with the model weights.
I believe future versions of llama.cpp should support this functionality too. What do others here think?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Steering #15053

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Steering #15053

Uh oh!

shakedzy Aug 3, 2025

Replies: 0 comments

shakedzy
Aug 3, 2025