Commit 45c9ce4
authored
Merge branch 'main' into patryk/offline-eplb
Signed-off-by: PatrykSaffer <[email protected]>File tree
1,384 files changed
+83742
-30063
lines changed- .buildkite
- lm-eval-harness
- configs
- nightly-benchmarks
- scripts
- tests
- performance-benchmarks
- scripts
- tests
- scripts
- hardware_ci
- scheduled_integration_test
- .github
- benchmarks
- cutlass_benchmarks
- kernels
- multi_turn
- overheads
- cmake
- external_projects
- csrc
- attention
- cpu
- mamba/mamba_ssm
- moe
- quantization
- fp4
- fused_kernels
- gptq_marlin
- gptq
- w8a8
- cutlass/c3x
- int8
- docker
- docs
- assets
- contributing
- design/debug_vllm_compile
- features/disagg_encoder
- cli
- bench/sweep
- community
- configuration
- contributing
- ci
- model
- deployment
- frameworks
- design
- features
- quantization
- getting_started
- installation
- mkdocs/hooks
- models
- extensions
- hardware_supported_models
- serving
- training
- usage
- examples
- offline_inference
- basic
- logits_processor
- pooling
- profiling_tpu
- qwen2_5_omni
- online_serving
- chart-helm
- templates
- tests
- dashboards/perses
- disaggregated_encoder
- pooling
- prometheus_grafana
- structured_outputs
- others
- lmcache
- requirements
- tests
- basic_correctness
- benchmarks
- compile
- piecewise
- config
- distributed
- engine
- entrypoints
- llm
- openai
- tool_parsers
- pooling
- llm
- openai
- sagemaker
- evals/gsm8k
- configs
- kernels
- attention
- core
- mamba
- moe
- modular_kernel_tools
- quantization
- lora
- model_executor
- model_loader
- fastsafetensors_loader
- runai_model_streamer
- tensorizer_loader
- models
- language
- generation
- pooling
- multimodal
- generation
- vlm_utils
- pooling
- processing
- quantization
- plugins_tests
- plugins
- prithvi_io_processor_plugin/prithvi_io_processor
- vllm_add_dummy_platform/vllm_add_dummy_platform
- vllm_add_dummy_stat_logger
- dummy_stat_logger
- quantization
- reasoning
- samplers
- standalone_tests
- tokenization
- tool_use
- tools
- transformers_utils
- utils_
- v1
- attention
- core
- cudagraph
- distributed
- e2e
- ec_connector
- integration
- unit
- engine
- entrypoints
- llm
- openai
- serving_responses
- executor
- generation
- kv_connector
- nixl_integration
- unit
- kv_offload
- logits_processors
- metrics
- sample
- shutdown
- spec_decode
- structured_output
- tpu/worker
- worker
- tools
- ep_kernels
- elastic_ep
- pre_commit
- profiler
- vllm-tpu
- vllm
- assets
- attention
- backends
- layers
- ops
- utils
- benchmarks
- lib
- sweep
- compilation
- config
- device_allocator
- distributed
- device_communicators
- ec_transfer
- ec_connector
- eplb
- kv_transfer
- kv_connector
- v1
- lmcache_integration
- p2p
- kv_pipe
- engine
- entrypoints
- anthropic
- cli
- benchmark
- openai
- tool_parsers
- sagemaker
- executor
- inputs
- lora
- layers
- ops/triton_ops
- punica_wrapper
- model_executor
- layers
- fla/ops
- fused_moe
- configs
- mamba
- ops
- quantization
- compressed_tensors
- schemes
- kernels
- mixed_precision
- scaled_mm
- quark
- schemes
- utils
- rotary_embedding
- model_loader
- models
- transformers
- warmup
- multimodal
- platforms
- plugins/io_processors
- profiler
- reasoning
- transformers_utils
- chat_templates
- configs
- processors
- tokenizers
- usage
- utils
- v1
- attention/backends
- mla
- core
- sched
- engine
- executor
- kv_offload
- worker
- metrics
- pool
- sample
- logits_processor
- ops
- tpu
- spec_decode
- structured_output
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
1,384 files changed
+83742
-30063
lines changedLines changed: 0 additions & 12 deletions
This file was deleted.
Lines changed: 14 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
| 24 | + | |
| 25 | + | |
24 | 26 | | |
25 | 27 | | |
26 | 28 | | |
27 | | - | |
| 29 | + | |
| 30 | + | |
28 | 31 | | |
29 | 32 | | |
30 | 33 | | |
| |||
37 | 40 | | |
38 | 41 | | |
39 | 42 | | |
40 | | - | |
41 | | - | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
42 | 50 | | |
43 | 51 | | |
44 | 52 | | |
| |||
This file was deleted.
This file was deleted.
This file was deleted.
0 commit comments