Commit 6958888
committed
fix: Use per-layer sizes in granite build_attention_layer
Also no need to pass in kv cache since it's already in the inp_attn
Branch: GraniteFour
Signed-off-by: Gabe Goodhart <[email protected]>1 parent 52cd6d1 commit 6958888
1 file changed
+3
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12749 | 12749 | | |
12750 | 12750 | | |
12751 | 12751 | | |
12752 | | - | |
12753 | | - | |
12754 | | - | |
| 12752 | + | |
| 12753 | + | |
| 12754 | + | |
12755 | 12755 | | |
12756 | 12756 | | |
12757 | 12757 | | |
| |||
0 commit comments