Skip to content

Commit 385a516

Browse files
committed
Fix: Correctly read query_pre_attn_scalar from text_config (Gemma3)
1 parent 17762c1 commit 385a516

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

exllamav2/config.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -296,8 +296,8 @@ def prepare(self, no_tensors: bool = False):
296296
self.num_key_value_groups = self.num_attention_heads // self.num_key_value_heads
297297
self.use_qk_norm = read(read_config, bool, ["use_qk_norm"], self.arch.lm.default_use_qk_norm)
298298

299-
self.query_pre_attn_scalar = read(read_config, float, "query_pre_attn_scalar", None)
300-
self.attention_multiplier = read(read_config, float, "attention_multiplier", None)
299+
self.query_pre_attn_scalar = read(read_config, float, ["query_pre_attn_scalar"], None, opt_subkey = "text_config")
300+
self.attention_multiplier = read(read_config, float, ["attention_multiplier"], None, opt_subkey = "text_config")
301301

302302
# MLP params
303303

0 commit comments

Comments
 (0)