Commit b0b3266
committed
fix: normalize logits to probabilities before computing adaptive temperature
In AdaptiveTeacherModel.ComputeAdaptiveTemperature, difficulty was computed
from raw logits which broke bounds. Now:
- Convert logits to probabilities using softmax before difficulty computation
- Clamp resulting difficulty to [0,1] for all strategies
- Ensures temperature stays within [_minTemperature, _maxTemperature] bounds1 parent 8beeed2 commit b0b3266
1 file changed
+8
-2
lines changedLines changed: 8 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
178 | 178 | | |
179 | 179 | | |
180 | 180 | | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
181 | 184 | | |
182 | 185 | | |
183 | 186 | | |
184 | 187 | | |
185 | 188 | | |
186 | 189 | | |
187 | | - | |
| 190 | + | |
188 | 191 | | |
189 | 192 | | |
190 | 193 | | |
191 | 194 | | |
192 | | - | |
| 195 | + | |
193 | 196 | | |
194 | 197 | | |
195 | 198 | | |
| |||
198 | 201 | | |
199 | 202 | | |
200 | 203 | | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
201 | 207 | | |
202 | 208 | | |
203 | 209 | | |
| |||
0 commit comments