Commit 098b9ff
authored
[NVIDIA#9147][feat] AutoDeploy: Draft Target Speculative Decoding (NVIDIA#9275)
Signed-off-by: Govind Ramnarayan <[email protected]>1 parent a1964bc commit 098b9ff
File tree
9 files changed
+750
-197
lines changed- tensorrt_llm
- _torch
- auto_deploy
- shim
- pyexecutor
- llmapi
- tests
- integration
- defs/examples
- test_lists/test-db
- unittest/_torch/auto_deploy/unit/singlegpu
- shim
9 files changed
+750
-197
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
185 | 185 | | |
186 | 186 | | |
187 | 187 | | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
188 | 193 | | |
189 | 194 | | |
190 | 195 | | |
| |||
0 commit comments