Commit 21a696b
authored
[None][feat] Optimize the q3n decode kernel with IO read (NVIDIA#11344)
Signed-off-by: jiant <107457950+JadoTu@users.noreply.github.com>1 parent 959306c commit 21a696b
File tree
1 file changed
+1
-1
lines changed- tensorrt_llm/_torch/modules/fla
1 file changed
+1
-1
lines changedLines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
177 | 177 | | |
178 | 178 | | |
179 | 179 | | |
180 | | - | |
| 180 | + | |
181 | 181 | | |
182 | 182 | | |
183 | 183 | | |
| |||
0 commit comments