File tree
873 files changed
+48394
-26643
lines changed- .buildkite
- lm-eval-harness
- configs
- nightly-benchmarks
- scripts
- tests
- scripts
- hardware_ci
- tpu
- .github
- scripts
- workflows
- benchmarks
- disagg_benchmarks
- kernels
- kv_cache
- multi_turn
- cmake
- external_projects
- csrc
- attention/mla
- core
- cpu
- moe
- marlin_moe_wna16
- prepare_inputs
- quantization
- aqlm
- cutlass_w4a8
- cutlass_w8a8
- moe
- gptq_marlin
- machete
- marlin
- dense
- common
- qqq
- docker
- docs
- api
- cli
- bench
- community
- configuration
- contributing
- model
- deployment/frameworks
- design
- examples
- features
- quantization
- getting_started
- installation
- cpu
- mkdocs
- hooks
- stylesheets
- models
- extensions
- usage
- examples
- offline_inference
- basic
- online_serving/openai_embedding_long_text
- others/lmcache/disagg_prefill_lmcache_v1
- requirements
- tests
- async_engine
- basic_correctness
- benchmarks
- compile
- piecewise
- config
- core
- block/e2e
- detokenizer
- distributed
- engine
- entrypoints
- llm
- offline_mode
- openai
- correctness
- tool_parsers
- evals/gsm8k
- configs
- kernels
- attention
- core
- mamba
- moe
- modular_kernel_tools
- quantization
- lora
- metrics
- models
- language
- generation
- pooling
- multimodal
- generation
- vlm_utils
- processing
- quantization
- mq_llm_engine
- multi_step
- multimodal
- plugins/vllm_add_dummy_platform/vllm_add_dummy_platform
- prefix_caching
- quantization
- samplers
- speculative_decoding/speculators
- standalone_tests
- tensorizer_loader
- tool_use
- tpu/lora
- utils_
- v1
- attention
- core
- cudagraph
- e2e
- engine
- entrypoints
- llm
- openai
- responses
- executor
- kv_connector/unit
- logits_processors
- sample
- spec_decode
- tpu
- worker
- worker
- weight_loading
- worker
- tools
- ep_kernels
- profiler/nsys_profile_tools
- images
- vllm
- attention
- backends
- mla
- layers
- ops
- benchmarks
- lib
- compilation
- config
- core
- device_allocator
- distributed
- device_communicators
- eplb
- kv_transfer/kv_connector
- v1
- p2p
- engine
- multiprocessing
- output_processor
- entrypoints
- cli
- openai
- tool_parsers
- executor
- inputs
- lora
- model_executor
- layers
- fused_moe
- configs
- mamba
- ops
- quantization
- compressed_tensors
- schemes
- kernels
- mixed_precision
- scaled_mm
- quark
- utils
- configs
- rotary_embedding
- model_loader
- models
- warmup
- multimodal
- platforms
- reasoning
- transformers_utils
- configs
- processors
- tokenizers
- utils
- v1
- attention/backends
- mla
- core
- sched
- engine
- executor
- pool
- sample
- logits_processor
- ops
- tpu
- spec_decode
- structured_output
- worker
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
873 files changed
+48394
-26643
lines changedLines changed: 21 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
8 | 8 |
| |
9 | 9 |
| |
10 | 10 |
| |
11 |
| - | |
| 11 | + | |
| 12 | + | |
12 | 13 |
| |
13 | 14 |
| |
14 | 15 |
| |
| |||
21 | 22 |
| |
22 | 23 |
| |
23 | 24 |
| |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
24 | 38 |
| |
25 | 39 |
| |
26 |
| - | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
27 | 46 |
|
Lines changed: 0 additions & 12 deletions
This file was deleted.
Lines changed: 0 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
3 | 3 |
| |
4 | 4 |
| |
5 | 5 |
| |
6 |
| - |
Lines changed: 1 addition & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
2 | 2 |
| |
3 | 3 |
| |
4 | 4 |
| |
5 |
| - | |
| 5 | + | |
6 | 6 |
| |
7 | 7 |
| |
8 | 8 |
| |
|
Lines changed: 1 addition & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
3 | 3 |
| |
4 | 4 |
| |
5 | 5 |
| |
6 |
| - | |
| 6 | + | |
7 | 7 |
| |
8 | 8 |
| |
9 | 9 |
| |
|
Lines changed: 12 additions & 20 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
7 | 7 |
| |
8 | 8 |
| |
9 | 9 |
| |
10 |
| - | |
| 10 | + | |
11 | 11 |
| |
12 | 12 |
| |
13 | 13 |
| |
| |||
138 | 138 |
| |
139 | 139 |
| |
140 | 140 |
| |
141 |
| - | |
| 141 | + | |
| 142 | + | |
142 | 143 |
| |
143 |
| - | |
144 |
| - | |
145 |
| - | |
146 |
| - | |
147 |
| - | |
148 |
| - | |
149 |
| - | |
150 |
| - | |
151 |
| - | |
152 |
| - | |
153 |
| - | |
| 144 | + | |
154 | 145 |
| |
155 | 146 |
| |
156 |
| - | |
157 |
| - | |
158 |
| - | |
159 |
| - | |
160 |
| - | |
161 |
| - | |
162 |
| - | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
163 | 155 |
| |
164 | 156 |
| |
165 | 157 |
| |
|
Lines changed: 1 addition & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
17 | 17 |
| |
18 | 18 |
| |
19 | 19 |
| |
20 |
| - | |
| 20 | + | |
21 | 21 |
| |
22 | 22 |
| |
23 | 23 |
| |
|
0 commit comments