Commit 9b6c2be
authored
File tree
1,921 files changed
+310497
-50645
lines changed- .buildkite
- lm-eval-harness
- configs
- nightly-benchmarks
- scripts
- tests
- .github
- ISSUE_TEMPLATE
- scripts
- workflows
- matchers
- scripts
- benchmarks
- cutlass_benchmarks
- disagg_benchmarks
- fused_kernels
- kernels
- overheads
- structured_schemas
- cmake
- csrc
- attention
- core
- cpu
- cutlass_extensions
- epilogue
- gemm
- collective
- mamba
- causal_conv1d
- mamba_ssm
- moe
- marlin_kernels
- prepare_inputs
- punica
- bgmv
- quantization
- aqlm
- awq
- compressed_tensors
- cutlass_w8a8
- c3x
- fp4
- fp8
- amd
- nvidia
- fused_kernels
- gguf
- gptq_marlin
- machete
- marlin
- dense
- common
- qqq
- sparse
- common
- squeezellm
- rocm
- sparse/cutlass
- docs
- source
- _static
- _templates/sections
- api
- engine
- model
- multimodal
- offline_inference
- assets
- contributing
- deployment
- design
- arch_overview
- v1/prefix_caching
- features/disagg_prefill
- logos
- automatic_prefix_caching
- community
- contributing
- dockerfile
- model
- profiling
- deployment
- frameworks
- integrations
- design
- kernel
- v1
- dev
- dockerfile
- engine
- input_processing
- kernel
- multimodal
- offline_inference
- features
- quantization
- getting_started
- examples
- installation
- ai_accelerator
- cpu
- gpu
- models
- extensions
- performance
- quantization
- serving
- integrations
- examples
- fp8
- quantizer
- offline_inference
- basic
- openai
- profiling_tpu
- online_serving
- chart-helm
- templates
- opentelemetry
- prometheus_grafana
- other
- production_monitoring
- rocm_patch
- tests
- async_engine
- basic_correctness
- compile
- piecewise
- core
- block
- e2e
- data
- distributed
- encoder_decoder
- engine
- output_processor
- entrypoints
- llm
- offline_mode
- openai
- correctness
- reasoning_parsers
- tool_parsers
- fp8_kv
- llama2-70b-fp8-kv
- llama2-7b-fp8-kv
- kernels
- kv_transfer
- lora
- data
- metrics
- mistral_tool_use
- model_executor
- models
- decoder_only
- audio_language
- language
- vision_language
- vlm_utils
- embedding
- language
- vision_language
- encoder_decoder
- audio_language
- language
- vision_language
- fixtures
- multimodal
- processing
- mq_llm_engine
- multi_step
- multimodal
- neuron
- plugins_tests
- plugins
- vllm_add_dummy_model
- vllm_add_dummy_model
- vllm_add_dummy_platform
- vllm_add_dummy_platform
- prefix_caching
- prompt_adapter
- quantization
- runai_model_streamer_test
- samplers
- spec_decode
- e2e
- standalone_tests
- system_messages
- tensorizer_loader
- tokenization
- tool_use
- tpu
- tracing
- v1
- core
- e2e
- engine
- entrypoints
- openai
- sample
- spec_decode
- worker
- vllm_test_utils
- vllm_test_utils
- weight_loading
- worker
- tools
- profiler
- vllm
- adapter_commons
- assets
- attention
- backends
- mla
- ops
- blocksparse_attention
- compilation
- core
- block
- device_allocator
- distributed
- device_communicators
- kv_transfer
- kv_connector
- kv_lookup_buffer
- kv_pipe
- engine
- multiprocessing
- output_processor
- entrypoints
- cli
- openai
- reasoning_parsers
- tool_parsers
- executor
- inputs
- logging_utils
- logging
- lora
- ops
- torch_ops
- triton_ops
- punica_wrapper
- model_executor
- guided_decoding
- layers
- fused_moe
- configs
- mamba
- ops
- ops
- quantization
- compressed_tensors
- schemes
- kernels
- mixed_precision
- scaled_mm
- quark
- schemes
- utils
- configs
- model_loader
- models
- multimodal
- platforms
- plugins
- profiler
- prompt_adapter
- spec_decode
- third_party
- transformers_utils
- configs
- processors
- tokenizer_group
- tokenizers
- triton_utils
- usage
- v1
- attention
- backends
- core
- engine
- executor
- metrics
- sample
- ops
- spec_decode
- stats
- worker
- vllm_flash_attn
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
1,921 files changed
+310497
-50645
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
1 | 3 | | |
| 4 | + | |
2 | 5 | | |
3 | 6 | | |
4 | | - | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
5 | 12 | | |
6 | 13 | | |
7 | 14 | | |
| 15 | + | |
8 | 16 | | |
9 | 17 | | |
10 | 18 | | |
11 | 19 | | |
12 | | - | |
| 20 | + | |
13 | 21 | | |
14 | 22 | | |
15 | 23 | | |
| 24 | + | |
16 | 25 | | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | | - | |
21 | | - | |
22 | | - | |
23 | | - | |
24 | | - | |
25 | | - | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
26 | 34 | | |
27 | 35 | | |
28 | 36 | | |
29 | 37 | | |
30 | | - | |
| 38 | + | |
31 | 39 | | |
32 | 40 | | |
33 | 41 | | |
34 | 42 | | |
35 | | - | |
36 | | - | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
Lines changed: 12 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
Lines changed: 11 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
Lines changed: 11 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
Lines changed: 11 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
Lines changed: 11 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
Lines changed: 4 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
| 7 | + | |
8 | 8 | | |
9 | | - | |
10 | | - | |
| 9 | + | |
| 10 | + | |
11 | 11 | | |
Lines changed: 11 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
0 commit comments