Skip to content

feat: add Kimi-K2.5 (moonshotai/Kimi-K2.5) model support in HYBRID mode#403

Draft
jasonqinzhou wants to merge 7 commits intomainfrom
jasonzho/kimi-k2.5
Draft

feat: add Kimi-K2.5 (moonshotai/Kimi-K2.5) model support in HYBRID mode#403
jasonqinzhou wants to merge 7 commits intomainfrom
jasonzho/kimi-k2.5

Conversation

@jasonqinzhou
Copy link
Contributor

@jasonqinzhou jasonqinzhou commented Feb 19, 2026

  • Add model config for Kimi-K2.5 (MLA-based MoE, 61 layers, 384 routed experts, 64 attention heads, 262k context)
  • Register KimiK25ForConditionalGeneration architecture under the DEEPSEEK model family and add moonshotai/Kimi-K2.5 to DefaultHFModels
  • Fix _parse_hf_config_json to fall back to top-level config when model params are nested under "text_config" (required for VLM-style HF configs like Kimi-K2.5)
  • Extend MLA collector test cases and TRT-LLM collect_mla n_list to cover num_heads=64 (Kimi-K2.5) in addition to the existing 128 (DeepSeek-V3)

Fix DeepSeekModel / TrtllmWideEPDeepSeekModel hardcoded 128-head ops:
DeepSeekModel and TrtllmWideEPDeepSeekModel hardcoded DeepSeek-V3's
128 attention heads in several MLA GEMM / attention ops, making them
produce incorrect weight-size and latency estimates for any DEEPSEEK
model with a different head count (e.g. Kimi-K2.5 with 64 heads).
Replace every affected hardcode with self._num_heads:
- context/generation q_b_proj_gemm n = num_heads * 192 // tp
- context kv_b_proj_gemm n = num_heads * 256 // tp
- context/generation_attention n_heads = num_heads // tp
- context_proj_gemm k = num_heads * 128 // tp

Fix nextn (MTP) auto-assigned to all DEEPSEEK models (task.py):
nextn was unconditionally set to 1 for every DEEPSEEK model, adding a
spurious (nextn+1) activation-memory multiplier and incorrect MTP
latency scaling for models without Multi-Token Prediction support.
Now reads num_nextn_predict_layers from the raw model config (default 0),
so DeepSeek-V3/V3.1 still get nextn=1 while Kimi-K2.5 gets nextn=0.

Fix IndexError in get_worker_candidates() when all configs OOM (inference_session.py):
Same exceptions[-1]-on-empty-list crash fixed in agg_pareto() by #378
now also fixed in DisaggInferenceSession.get_worker_candidates().

Fix disagg per-worker GPU search space not scaling with --total-gpus (task.py):
_finalize_disagg used total_gpus only to cap max_gpu_per_replica (replica
scaling), but never updated num_gpu_per_worker / tp_list / dp_list /
moe_ep_list in the prefill and decode worker configs. Those lists were
hardcoded to [1,2,4,8], so large MoE models like Kimi-K2.5 (needing
EP=32+ to avoid OOM) were never explored regardless of --total-gpus.
_finalize_disagg now extends each non-singleton parallel list with
powers-of-2 up to total_gpus so that configurations like EP=32/64/128
are included in the sweep when sufficient GPUs are available.

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

  • closes GitHub issue: #xxx

Example:

********************************************************************************
*                     Dynamo aiconfigurator Final Results                      *
********************************************************************************
  ----------------------------------------------------------------------------
  Input Configuration & SLA Target:
    Model: moonshotai/Kimi-K2.5 (is_moe: True)
    Total GPUs: 128
    Best Experiment Chosen: agg at 525.51 tokens/s/gpu (disagg 0.65x better)
  ----------------------------------------------------------------------------
  Overall Best Configuration:
    - Best Throughput: 67,265.92 tokens/s
    - Per-GPU Throughput: 525.51 tokens/s/gpu
    - Per-User Throughput: 34.46 tokens/s/user
    - TTFT: 1112.75ms
    - TPOT: 29.02ms
    - Request Latency: 30103.02ms
  ----------------------------------------------------------------------------
  Pareto Frontier:
                                                                                
      ┌────────────────────────────────────────────────────────────────────────┐
2950.0┤ •• agg                                                                 │
      │ ff disagg                                                              │
      │ xx agg best                                                            │
2458.3┤           •••                                                          │
      │              ••••                                                      │
      │                  ••••                                                  │
1966.7┤                      •••••                                             │
      │                           ••                                           │
1475.0┤                             •••••• fffff                               │
      │                                   ••••  ff                             │
      │                                       ••••fff                          │
 983.3┤                                             •fff                       │
      │                                                •fff••                  │
      │                                                    fff••••             │
 491.7┤                                                       fff ••x••        │
      │                                                          fff   •       │
      │                                                                        │
   0.0┤                                                                        │
      └┬─────────────────┬─────────────────┬────────────────┬─────────────────┬┘
       0                10                20               30                40 
tokens/s/gpu_cluster                 tokens/s/user                              

  ----------------------------------------------------------------------------
  Deployment Details:
    (p) stands for prefill, (d) stands for decode, bs stands for batch size, a replica stands for the smallest scalable unit xPyD of the disagg system
    Some math: total gpus used = replicas * gpus/replica
               gpus/replica = (p)gpus/worker * (p)workers + (d)gpus/worker * (d)workers; for Agg, gpus/replica = gpus/worker
               gpus/worker = tp * pp * dp = etp * ep * pp for MoE models; tp * pp for dense models (underlined numbers are the actual values in math)

agg Top Configurations: (Sorted by tokens/s/gpu)
+------+---------+--------------+---------------+---------+-----------------+----------------+-------------------+----------+--------------+--------------+--------------------+----+
| Rank | backend | tokens/s/gpu | tokens/s/user |   TTFT  | request_latency |  concurrency   | total_gpus (used) | replicas | gpus/replica | gpus/worker  |      parallel      | bs |
+------+---------+--------------+---------------+---------+-----------------+----------------+-------------------+----------+--------------+--------------+--------------------+----+
|  1   |  trtllm |    525.51    |     34.46     | 1112.75 |     30103.02    | 2048 (=256x8)  |   128 (128=8x16)  |    8     |      16      | 16 (=1x1x16) | tp1pp1dp16etp1ep16 | 16 |
|  2   |  trtllm |    525.51    |     34.46     | 1112.75 |     30103.02    | 2048 (=512x4)  |   128 (128=4x32)  |    4     |      32      | 32 (=1x1x32) | tp1pp1dp32etp1ep32 | 16 |
|  3   |  trtllm |    525.51    |     34.46     | 1112.75 |     30103.02    | 2048 (=1024x2) |   128 (128=2x64)  |    2     |      64      | 64 (=1x1x64) | tp1pp1dp64etp1ep64 | 16 |
+------+---------+--------------+---------------+---------+-----------------+----------------+-------------------+----------+--------------+--------------+--------------------+----+

disagg Top Configurations: (Sorted by tokens/s/gpu)
+------+---------+--------------+---------------+--------+-----------------+----------------+-------------------+----------+-----------------+------------+----------------+--------------------+-------+------------+----------------+--------------------+-------+
| Rank | backend | tokens/s/gpu | tokens/s/user |  TTFT  | request_latency |  concurrency   | total_gpus (used) | replicas |   gpus/replica  | (p)workers | (p)gpus/worker |    (p)parallel     | (p)bs | (d)workers | (d)gpus/worker |    (d)parallel     | (d)bs |
+------+---------+--------------+---------------+--------+-----------------+----------------+-------------------+----------+-----------------+------------+----------------+--------------------+-------+------------+----------------+--------------------+-------+
|  1   |  trtllm |    339.83    |     33.55     | 514.14 |     30295.33    | 1408 (=1408x1) |   128 (80=1x80)   |    1     | 80 (=1x16+4x16) |     1      |  16 (=1x1x16)  | tp1pp1dp16etp1ep16 |   1   |     4      |  16 (=1x1x16)  | tp1pp1dp16etp1ep16 |   22  |
|  2   |  trtllm |    339.81    |     33.55     | 514.14 |     30295.33    | 1408 (=1408x1) |   128 (80=1x80)   |    1     | 80 (=1x16+2x32) |     1      |  16 (=1x1x16)  | tp1pp1dp16etp1ep16 |   1   |     2      |  32 (=1x1x32)  | tp1pp1dp32etp1ep32 |   22  |
|  3   |  trtllm |    339.81    |     33.55     | 514.14 |     30295.33    | 1408 (=1408x1) |   128 (80=1x80)   |    1     | 80 (=1x16+1x64) |     1      |  16 (=1x1x16)  | tp1pp1dp16etp1ep16 |   1   |     1      |  64 (=1x1x64)  | tp1pp1dp64etp1ep64 |   22  |
+------+---------+--------------+---------------+--------+-----------------+----------------+-------------------+----------+-----------------+------------+----------------+--------------------+-------+------------+----------------+--------------------+-------+
********************************************************************************
2026-02-20 10:00:02,921 - aiconfigurator.cli.main - INFO - All experiments completed in 6.36 seconds
(aic_venv) jasonzho@NV-25010035:~/repo/repo5/aiconfigurator$ aiconfigurator cli default --model-path moonshotai/Kimi-K2.5 --total-gpus 128 --system gb200 --database-mode HYBRID --enable-wideep

Summary by CodeRabbit

  • New Features

    • Added --enable-wideep CLI flag for enhanced model configuration support.
    • Added support for three new Kimi model variants: K2-Instruct, K2-Thinking, and K2.5.
  • Improvements

    • Enhanced multimodal model support (text + image configurations).
    • Expanded MLA test case coverage with additional parameter configurations.

@copy-pr-bot
Copy link

copy-pr-bot bot commented Feb 19, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@github-actions github-actions bot added the feat label Feb 19, 2026
@github-actions
Copy link
Contributor

filepath function $$\textcolor{#23d18b}{\tt{passed}}$$ $$\textcolor{#f14c4c}{\tt{failed}}$$ SUBTOTAL
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_workflow.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIIntegration.test\_cli\_default\_mode\_with\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{3}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{3}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_exp\_mode\_required\_args}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_database\_mode\_values\_parse\_successfully}}$$ $$\textcolor{#23d18b}{\tt{4}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{4}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_mode\_choices}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_generate\_mode\_required\_args}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_workflow.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIIntegration.test\_cli\_exp\_mode\_with\_database\_mode\_in\_yaml}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_generate\_mode\_defaults}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_database\_mode\_invalid\_value\_raises}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_generate\_mode\_model\_path}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_database\_mode\_choices\_validation}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_workflow.py}}$$ $$\textcolor{#23d18b}{\tt{TestBuildDefaultTaskConfigs.test\_skips\_disagg\_when\_total\_gpus\_less\_than\_2}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_backend\_choices\_validation}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_workflow.py}}$$ $$\textcolor{#23d18b}{\tt{TestBuildDefaultTaskConfigs.test\_includes\_disagg\_when\_total\_gpus\_at\_least\_2}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_supported\_systems\_parse\_successfully}}$$ $$\textcolor{#23d18b}{\tt{3}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{3}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_default\_values\_are\_set}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_debug\_mode\_flag}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_save\_directory\_argument}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_optional\_parameters}}$$ $$\textcolor{#23d18b}{\tt{6}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{6}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_generate\_combinations.py}}$$ $$\textcolor{#23d18b}{\tt{test\_cli\_generate\_combinations}}$$ $$\textcolor{#23d18b}{\tt{18}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{18}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_decode\_system\_defaults\_to\_system}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_model\_path\_accepts\_huggingface\_id}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestMergeExperimentResultsByMode.test\_merge\_with\_top\_n\_limiting}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestMergeExperimentResultsByMode.test\_merge\_with\_mixed\_agg\_disagg}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestMergeExperimentResultsByMode.test\_merge\_preserves\_backend\_information}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestMergeExperimentResultsByMode.test\_merge\_with\_empty\_experiments}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/generator/test\_aggregators.py}}$$ $$\textcolor{#23d18b}{\tt{TestK8sHfHomeDefaulting.test\_k8s\_hf\_home\_behavior}}$$ $$\textcolor{#23d18b}{\tt{5}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{5}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/generator/test\_aggregators.py}}$$ $$\textcolor{#23d18b}{\tt{TestK8sHfHomeDefaulting.test\_k8s\_hf\_home\_consistent\_across\_backends}}$$ $$\textcolor{#23d18b}{\tt{3}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{3}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_base\_queries.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_gemm\_empirical\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_base\_queries.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_custom\_allreduce\_database\_mode\_calculation}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_base\_queries.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_custom\_allreduce\_sol\_full\_returns\_full\_tuple}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_base\_queries.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_custom\_allreduce\_non\_database\_mode\_uses\_custom\_latency}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_base\_queries.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_nccl\_database\_mode\_all\_gather}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_base\_queries.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_nccl\_database\_mode\_alltoall\_and\_reduce\_scatter}}$$ $$\textcolor{#23d18b}{\tt{2}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{2}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_base\_queries.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_context\_attention\_hybrid\_fallback}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_base\_queries.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_p2p\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_base\_queries.py}}$$ $$\textcolor{#23d18b}{\tt{test\_system\_spec\_was\_loaded\_correctly}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_database\_with\_yaml\_and\_data\_path}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_all\_databases}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_database\_uses\_default\_systems\_paths}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_database\_conflict\_returns\_first}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_all\_databases\_system\_config\_conflict}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_all\_databases\_conflicting\_backend\_version\_keeps\_first}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_custom\_allreduce\_data\_nonexistent}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_custom\_allreduce\_data\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_nccl\_data\_nonexistent}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_nccl\_data\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_gemm\_data\_nonexistent}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_gemm\_data\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_moe\_data\_nonexistent}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_moe\_data\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_context\_attention\_data\_nonexistent}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_context\_attention\_data\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_generation\_attention\_data\_nonexistent}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_generation\_attention\_data\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_context\_mla\_data\_nonexistent}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_context\_mla\_data\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_generation\_mla\_data\_nonexistent}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_generation\_mla\_data\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_mla\_bmm\_data\_nonexistent}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_mla\_bmm\_data\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_data\_loaders.py}}$$ $$\textcolor{#23d18b}{\tt{test\_load\_wideep\_moe\_compute\_data}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_supported\_databases\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_supported\_databases\_empty\_dir}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_supported\_databases\_edge\_cases}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_supported\_databases\_multiple\_paths}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_build\_no\_databases\_message\_with\_missing\_path}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_build\_no\_databases\_message\_with\_existing\_empty\_path}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_set\_systems\_paths\_invalid\_entry\_raises}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_latest\_database\_version\_prefers\_stable\_over\_rc}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_latest\_database\_version\_rc\_only}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_latest\_database\_version\_nonexistent\_returns\_none}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_latest\_database\_version\_unparseable\_versions}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_database\_helpers.py}}$$ $$\textcolor{#23d18b}{\tt{test\_get\_latest\_database\_version\_major\_version\_rc\_is\_newer}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestContextAttention.test\_query\_context\_attention\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_backend\_any.py}}$$ $$\textcolor{#23d18b}{\tt{TestBackendAny.test\_build\_default\_task\_configs\_single\_backend}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestNcclEdgeCases.test\_query\_nccl\_silicon\_single\_gpu}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestContextAttention.test\_query\_context\_attention\_sol\_full\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestNcclEdgeCases.test\_query\_nccl\_silicon\_interpolation}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestContextAttention.test\_query\_context\_attention\_non\_database\_mode\_mha}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_backend\_any.py}}$$ $$\textcolor{#23d18b}{\tt{TestBackendAny.test\_build\_default\_task\_configs\_any\_backend}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_backend\_any.py}}$$ $$\textcolor{#23d18b}{\tt{TestBackendAny.test\_build\_default\_task\_configs\_any\_backend\_parameters}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_api.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIExpUnit.test\_cli\_exp\_dict\_config\_equivalent\_to\_example\_yaml}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_api.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIGenerateEquivalence.test\_cli\_generate\_api\_vs\_command}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestNcclEdgeCases.test\_query\_nccl\_silicon\_large\_gpu\_count}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestContextAttention.test\_query\_context\_attention\_non\_database\_mode\_xqa}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_api.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLISupportEquivalence.test\_cli\_support\_api\_vs\_command}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_workflow.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIIntegration.test\_cli\_main\_success\_flow}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_workflow.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIIntegration.test\_cli\_main\_success\_flow\_exp\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_workflow.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIIntegration.test\_cli\_main\_build\_dispatch}}$$ $$\textcolor{#23d18b}{\tt{2}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{2}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_workflow.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIIntegration.test\_cli\_main\_unsupported\_mode\_raises}}$$ $$\textcolor{#23d18b}{\tt{2}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{2}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_cli\_workflow.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIIntegration.test\_cli\_main\_runtime\_failure}}$$ $$\textcolor{#23d18b}{\tt{2}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{2}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestProcessExperimentResult.test\_process\_result\_with\_tpot\_constraint}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestProcessExperimentResult.test\_process\_result\_with\_request\_latency\_constraint}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestProcessExperimentResult.test\_process\_result\_with\_empty\_pareto\_df}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestProcessExperimentResult.test\_process\_result\_with\_none\_pareto\_df}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestProcessExperimentResult.test\_process\_result\_disagg\_mode\_uses\_correct\_group\_by}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestProcessExperimentResult.test\_process\_result\_top\_n\_limiting}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestProcessExperimentResult.test\_process\_result\_computes\_tokens\_per\_gpu\_cluster}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestMergeIntoTopN.test\_merge\_multiple\_backends}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestMergeIntoTopN.test\_merge\_with\_top\_n\_limiting}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestMergeIntoTopN.test\_merge\_with\_empty\_dataframes}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestMergeIntoTopN.test\_merge\_with\_missing\_pareto\_fronts}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestMergeIntoTopN.test\_merge\_recomputes\_pareto\_frontier}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_utils.py}}$$ $$\textcolor{#23d18b}{\tt{TestMergeExperimentResultsByMode.test\_merge\_six\_backends\_into\_two\_modes}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestDatabaseCache.test\_get\_database\_caching}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestDatabaseCache.test\_get\_database\_no\_data\_path}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestNcclEdgeCases.test\_query\_nccl\_edge\_message\_sizes}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestContextAttention.test\_query\_context\_attention\_non\_sol\_mode\_small\_s}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestSupportedQuantModes.test\_supported\_quant\_modes\_structure}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestAllreduceEdgeCases.test\_query\_custom\_allreduce\_single\_gpu}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestContextAttention.test\_query\_context\_attention\_assertion\_error}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestSupportedQuantModes.test\_supported\_quant\_modes\_values}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestAllreduceEdgeCases.test\_query\_custom\_allreduce\_large\_tp\_scaling}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestGenerationAttention.test\_query\_generation\_attention\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_fp8\_static.py}}$$ $$\textcolor{#23d18b}{\tt{test\_trtllm\_supported\_quant\_modes\_include\_fp8\_static}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestAllreduceEdgeCases.test\_query\_custom\_allreduce\_extrapolation}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestInitializationEdgeCases.test\_extrapolation\_during\_init}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestGenerationAttention.test\_query\_generation\_attention\_sol\_full\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_fp8\_static.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_gemm\_fp8\_static\_reuses\_fp8\_table}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestGemmInterpolation.test\_query\_gemm\_interpolation}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestGenerationAttention.test\_query\_generation\_attention\_non\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_fp8\_static.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_compute\_scale\_fp8\_static\_reuses\_fp8\_table}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_edge\_cases.py}}$$ $$\textcolor{#23d18b}{\tt{TestGemmInterpolation.test\_query\_gemm\_extrapolation}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestGenerationAttention.test\_query\_generation\_attention\_non\_database\_mode\_mha}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_fp8\_static.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_scale\_matrix\_fp8\_static\_reuses\_fp8\_table}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_fp8\_static.py}}$$ $$\textcolor{#23d18b}{\tt{test\_gemm\_query\_subtracts\_overheads\_only\_for\_fp8\_static\_and\_qwen\_proj\_fc2}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/e2e/cli/test\_cli\_build\_default.py}}$$ $$\textcolor{#23d18b}{\tt{test\_cli\_default\_build\_subset}}$$ $$\textcolor{#23d18b}{\tt{6}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{6}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestInterpolationMethods.test\_nearest\_1d\_point\_helper\_outer}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestGenerationAttention.test\_query\_generation\_attention\_edge\_cases}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestInterpolationMethods.test\_nearest\_1d\_point\_helper\_inner}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestInterpolationMethods.test\_nearest\_1d\_point\_helper\_errors}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestContextMLA.test\_query\_context\_mla\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMoE.test\_query\_moe\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestInterpolationMethods.test\_validate}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestContextMLA.test\_query\_context\_mla\_non\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMoE.test\_query\_moe\_sol\_full\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestContextMLA.test\_query\_context\_mla\_different\_tp\_sizes}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestInterpolationMethods.test\_interp\_1d}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMoE.test\_query\_moe\_non\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestGenerationMLA.test\_query\_generation\_mla\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestInterpolationMethods.test\_bilinear\_interpolation}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMoE.test\_query\_moe\_different\_workload\_distributions}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestGenerationMLA.test\_query\_generation\_mla\_non\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestInterpolationMethods.test\_interp\_3d\_linear}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMoE.test\_query\_moe\_edge\_cases}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{TestGenerationMLA.test\_query\_generation\_mla\_sol\_full\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestInterpolationMethods.test\_interp\_2d\_1d}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMLABMM.test\_query\_mla\_bmm\_database\_mode\_pre}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_attention.py}}$$ $$\textcolor{#23d18b}{\tt{test\_default\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_base\_queries.py}}$$ $$\textcolor{#23d18b}{\tt{test\_query\_gemm\_exact\_match}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestInterpolationMethods.test\_interp\_3d}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMLABMM.test\_query\_mla\_bmm\_database\_mode\_post}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMemoryOperations.test\_query\_mem\_op\_edge\_cases}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestExtrapolateDataGrid.test\_extrapolate\_data\_grid\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMLABMM.test\_query\_mla\_bmm\_sol\_full\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestP2P.test\_query\_p2p\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestExtrapolateDataGrid.test\_extrapolate\_data\_grid\_with\_sqrt}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMLABMM.test\_query\_mla\_bmm\_non\_database\_mode\_pre}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestP2P.test\_query\_p2p\_sol\_full\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestExtrapolateDataGrid.test\_extrapolate\_data\_grid\_edge\_cases}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMLABMM.test\_query\_mla\_bmm\_non\_database\_mode\_post}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestP2P.test\_query\_p2p\_non\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestExtrapolateDataGrid.test\_extrapolate\_data\_grid\_boundary\_extension}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMLABMM.test\_query\_mla\_bmm\_different\_configs}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestP2P.test\_query\_p2p\_edge\_cases}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoE.test\_initialization\_with\_default\_num\_slots}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoE.test\_initialization\_with\_custom\_num\_slots}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoE.test\_weight\_calculation\_gated}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoE.test\_weight\_calculation\_non\_gated}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoE.test\_query\_basic}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoE.test\_query\_with\_attention\_dp\_scaling}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoE.test\_query\_with\_scale\_factor}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoE.test\_query\_with\_quant\_mode\_override}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoE.test\_query\_with\_custom\_num\_slots}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoE.test\_query\_debug\_logging}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_initialization\_pre\_dispatch}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_initialization\_post\_dispatch\_with\_options}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_get\_weights}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_query\_pre\_dispatch}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_query\_post\_dispatch\_standard}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_query\_post\_dispatch\_low\_precision}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_query\_with\_scale\_factor}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_query\_with\_custom\_node\_num}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_query\_debug\_logging\_pre\_dispatch}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_query\_debug\_logging\_post\_dispatch\_low\_precision}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_trtllm\_wideep\_moe\_dispatch.py}}$$ $$\textcolor{#23d18b}{\tt{TestTrtLLMWideEPMoEDispatch.test\_different\_quant\_modes}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestSupportedModels.test\_get\_default\_models\_function\_exists}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestSupportedModels.test\_specific\_models\_are\_in\_default\_list}}$$ $$\textcolor{#23d18b}{\tt{4}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{4}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestSupportedModels.test\_model\_configs\_have\_correct\_structure}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestSupportedModels.test\_model\_moe\_detection}}$$ $$\textcolor{#23d18b}{\tt{6}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{6}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestHFModelSupport.test\_default\_hf\_models\_exists}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestHFModelSupport.test\_hf\_models\_have\_valid\_architecture}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestHFModelSupport.test\_hf\_id\_resolves\_to\_correct\_model\_family}}$$ $$\textcolor{#23d18b}{\tt{6}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{6}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestHFModelSupport.test\_hf\_id\_moe\_detection}}$$ $$\textcolor{#23d18b}{\tt{6}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{6}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestBackendConfiguration.test\_backend\_enum\_exists}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestBackendConfiguration.test\_default\_backend\_is\_trtllm}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestQuantizationModes.test\_gemm\_quant\_modes\_exist}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestQuantizationModes.test\_attention\_quant\_modes\_exist}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestQuantizationModes.test\_moe\_quant\_modes\_exist}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/models/test\_model\_config.py}}$$ $$\textcolor{#23d18b}{\tt{TestMOEModelFP8BlockQuantizationValidation.test\_fp8\_block\_quantization\_validation}}$$ $$\textcolor{#23d18b}{\tt{5}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{5}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_agg\_default}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_disagg\_default}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_profile\_application}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_fp8\_static\_requires\_trtllm\_backend}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_total\_gpus\_limits\_agg\_workers}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_agg\_yaml\_patch\_overrides}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_yaml\_file\_profiles\_and\_patch}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_disagg\_profile\_patch\_expands\_replica}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_disagg\_total\_gpus\_caps\_replica}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_disagg\_total\_gpus\_with\_patch}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_disagg\_wideep\_expands\_lists}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_disagg\_decode\_system\_name\_override}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_agg\_total\_gpus\_negative\_rejected}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_disagg\_total\_gpus\_small\_rejected}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_yaml\_replace\_uses\_full\_config}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_rejects\_unsupported\_quant\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_quant\_merge\_uses\_model\_info\_when\_missing}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_quant\_merge\_preserves\_explicit\_values}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskconfig\_quant\_merge\_deepseek\_fmha\_fallback}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#23d18b}{\tt{test\_taskrunner\_runs\_agg\_and\_disagg}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestCorrectData.test\_correct\_gemm\_data}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMemoryOperations.test\_query\_mem\_op\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMemoryOperations.test\_query\_mem\_op\_sol\_full\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_moe\_mla.py}}$$ $$\textcolor{#23d18b}{\tt{TestMemoryOperations.test\_query\_mem\_op\_non\_database\_mode}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestCorrectData.test\_correct\_generation\_attention\_data}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/sdk/database/test\_interpolation.py}}$$ $$\textcolor{#23d18b}{\tt{TestUpdateSupportMatrix.test\_support\_matrix\_creation}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/e2e/cli/test\_cli\_experiments.py}}$$ $$\textcolor{#23d18b}{\tt{TestExps.test\_exps}}$$ $$\textcolor{#23d18b}{\tt{9}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{9}}$$
$$\textcolor{#23d18b}{\tt{tests/e2e/cli/test\_cli\_request\_latency.py}}$$ $$\textcolor{#23d18b}{\tt{TestRequestLatency.test\_request\_latency}}$$ $$\textcolor{#23d18b}{\tt{3}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{3}}$$
$$\textcolor{#23d18b}{\tt{tests/e2e/tools/test\_sanity\_check.py}}$$ $$\textcolor{#23d18b}{\tt{test\_validate\_database}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{tests/unit/cli/test\_argument\_parsing.py}}$$ $$\textcolor{#23d18b}{\tt{TestCLIArgumentParsing.test\_default\_mode\_required\_args}}$$ $$\textcolor{#23d18b}{\tt{1}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#f14c4c}{\tt{tests/unit/sdk/task/test\_task.py}}$$ $$\textcolor{#f14c4c}{\tt{test\_sglang\_moe\_configs}}$$ $$\textcolor{#666666}{\tt{0}}$$ $$\textcolor{#f14c4c}{\tt{1}}$$ $$\textcolor{#f14c4c}{\tt{1}}$$
$$\textcolor{#f14c4c}{\tt{TOTAL}}$$ $$\textcolor{#23d18b}{\tt{306}}$$ $$\textcolor{#f14c4c}{\tt{1}}$$ $$\textcolor{#f14c4c}{\tt{307}}$$

@jasonqinzhou jasonqinzhou marked this pull request as ready for review February 20, 2026 02:41
@jasonqinzhou jasonqinzhou changed the title feat: add Kimi-K2.5 (moonshotai/Kimi-K2.5) model support feat: add Kimi-K2.5 (moonshotai/Kimi-K2.5) model support in HYBRID mode Feb 20, 2026
dtype_list = [tensorrt_llm.bindings.DataType.BF16] # not support f8 for trt < v1.1
test_cases = []
n_list = [128]
n_list = [64, 128]
Copy link
Contributor

@tianhaox tianhaox Feb 27, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the latency measurement is based on local num heads=n/tp. then you will have a lot of duplicate measurements. Remove 64 is a previous fix here. You don't need to modify this. i will suggest fixing that in sglang, to add tp 128 to collect_mla.py for sglang

dtype_list = [tensorrt_llm.bindings.DataType.BF16] # not support f8 for trt < v1.1
test_cases = []
n_list = [128]
n_list = [64, 128]
Copy link
Contributor

@tianhaox tianhaox Feb 27, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same as above.

@@ -0,0 +1,26 @@
{
"architectures": ["KimiK25ForConditionalGeneration"],
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how we handle the vision encoder?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here, no quantization, needs a full copy past

# num_heads, q_lora_rank, kv_lora_rank, qk_nope_head_dim, qk_rope_head_dim, v_head_dim
model_config_list = [
[128, 1536, 512, 128, 64, 128, "deepseek-ai/DeepSeek-V3"],
[64, 1536, 512, 128, 64, 128, "moonshotai/Kimi-K2.5"],
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think we don't need this. same for the previous explanation. 128/tp_list[1,2,4,...,128] naturally covers 64/tp_list[1,2,4,...,64]

jasonqinzhou and others added 6 commits February 27, 2026 13:38
- Add model config for Kimi-K2.5 (MLA-based MoE, 61 layers, 384 routed
  experts, 64 attention heads, 262k context)
- Register KimiK25ForConditionalGeneration architecture under the DEEPSEEK
  model family and add moonshotai/Kimi-K2.5 to DefaultHFModels
- Fix _parse_hf_config_json to fall back to top-level config when model
  params are nested under "text_config" (required for VLM-style HF configs
  like Kimi-K2.5)
- Extend MLA collector test cases and TRT-LLM collect_mla n_list to cover
  num_heads=64 (Kimi-K2.5) in addition to the existing 128 (DeepSeek-V3)

Fix DeepSeekModel / TrtllmWideEPDeepSeekModel hardcoded 128-head ops:
  DeepSeekModel and TrtllmWideEPDeepSeekModel hardcoded DeepSeek-V3's
  128 attention heads in several MLA GEMM / attention ops, making them
  produce incorrect weight-size and latency estimates for any DEEPSEEK
  model with a different head count (e.g. Kimi-K2.5 with 64 heads).
  Replace every affected hardcode with self._num_heads:
    - context/generation q_b_proj_gemm  n = num_heads * 192 // tp
    - context kv_b_proj_gemm            n = num_heads * 256 // tp
    - context/generation_attention      n_heads = num_heads // tp
    - context_proj_gemm                 k = num_heads * 128 // tp

Fix nextn (MTP) auto-assigned to all DEEPSEEK models (task.py):
  nextn was unconditionally set to 1 for every DEEPSEEK model, adding a
  spurious (nextn+1) activation-memory multiplier and incorrect MTP
  latency scaling for models without Multi-Token Prediction support.
  Now reads num_nextn_predict_layers from the raw model config (default 0),
  so DeepSeek-V3/V3.1 still get nextn=1 while Kimi-K2.5 gets nextn=0.

Fix IndexError in get_worker_candidates() when all configs OOM (inference_session.py):
  Same exceptions[-1]-on-empty-list crash fixed in agg_pareto() by #378
  now also fixed in DisaggInferenceSession.get_worker_candidates().

Fix disagg per-worker GPU search space not scaling with --total-gpus (task.py):
  _finalize_disagg used total_gpus only to cap max_gpu_per_replica (replica
  scaling), but never updated num_gpu_per_worker / tp_list / dp_list /
  moe_ep_list in the prefill and decode worker configs. Those lists were
  hardcoded to [1,2,4,8], so large MoE models like Kimi-K2.5 (needing
  EP=32+ to avoid OOM) were never explored regardless of --total-gpus.
  _finalize_disagg now extends each non-singleton parallel list with
  powers-of-2 up to total_gpus so that configurations like EP=32/64/128
  are included in the sweep when sufficient GPUs are available.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@coderabbitai
Copy link

coderabbitai bot commented Feb 27, 2026

Walkthrough

The PR adds support for three new Kimi model variants with corresponding JSON configurations, expands MLA test case generation parameters, introduces a new --enable-wideep CLI flag, refactors dimension calculations to use dynamic per-head values, and enhances configuration parsing for multimodal HuggingFace models.

Changes

Cohort / File(s) Summary
Kimi Model Support
src/aiconfigurator/model_configs/moonshotai--Kimi-K2-Instruct_config.json, src/aiconfigurator/model_configs/moonshotai--Kimi-K2-Thinking_config.json, src/aiconfigurator/model_configs/moonshotai--Kimi-K2.5_config.json, src/aiconfigurator/sdk/common.py
Added three new Kimi model configuration files (K2-Instruct, K2-Thinking, K2.5) and registered them in default HF models. Updated architecture-to-model-family mapping to classify KimiK25ForConditionalGeneration as DEEPSEEK.
MLA Test Case Expansion
collector/common_test_cases.py, collector/trtllm/collect_mla.py
Expanded MLA test case parameter space by adding n = 64 configurations alongside existing n = 128 entries, increasing test coverage for various model configurations.
CLI Enhancement
src/aiconfigurator/cli/main.py
Added new --enable-wideep CLI flag with boolean toggle (default false). Extended build_default_task_configs function signature to accept and propagate enable_wideep parameter. Included the flag in reserved keys for YAML/config merge logic.
Configuration Parsing & Task Management
src/aiconfigurator/sdk/utils.py, src/aiconfigurator/sdk/task.py
Enhanced _parse_hf_config_json to support multimodal HuggingFace models with fallback to top-level config when text_config is absent. Updated TaskConfigFactory._base_common_layer to derive nextn from model configuration rather than model family hardcoding.
Core Refactoring
src/aiconfigurator/sdk/models.py, src/aiconfigurator/sdk/inference_session.py
Replaced hard-coded GEMM and ContextMLA dimensions with dynamic per-head calculations using self._num_heads and self._head_size. Refactored exception handling in get_worker_candidates to store exception in variable before reuse instead of repeated list indexing.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Poem

🐰 Three new Kimi models hop into view,
With dynamic dimensions that sum per-head true,
Test cases expand with a sixty-four cheer,
Configs now multimodal, far and near,
Wideep flags flutter on the command-line tier! ✨

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name Status Explanation Resolution
Docstring Coverage ⚠️ Warning Docstring coverage is 25.00% which is insufficient. The required threshold is 80.00%. Write docstrings for the functions missing them to satisfy the coverage threshold.
✅ Passed checks (2 passed)
Check name Status Explanation
Title check ✅ Passed The title clearly and specifically describes adding Kimi-K2.5 model support in HYBRID mode, matching the primary objective of this PR.
Description check ✅ Passed The description provides comprehensive detail on all major changes: model config, architecture registration, config parsing fixes, test case extensions, hardcoded value replacements, nextn fixes, error handling, and GPU scaling improvements. However, the template structure is incomplete with unfilled placeholder sections.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

Tip

Try Coding Plans. Let us write the prompt for your AI agent so you can ship faster (with fewer bugs).
Share your feedback on Discord.


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🧹 Nitpick comments (2)
src/aiconfigurator/sdk/task.py (2)

306-307: Logic correctly derives nextn from model config.

This change properly reads num_nextn_predict_layers directly from the model's configuration instead of inferring it from the model family. This is more accurate for models like Kimi-K2.5 that may share architecture characteristics but have different MTP settings.

Consider adding defensive error handling similar to what's done in validate() (lines 773-776) to provide graceful fallback if model config loading fails:

🛡️ Optional defensive handling
 `@staticmethod`
 def _base_common_layer(ctx: TaskContext) -> dict:
-    raw_config = get_model_config_from_model_path(ctx.model_path).get("raw_config", {})
+    try:
+        raw_config = get_model_config_from_model_path(ctx.model_path).get("raw_config", {})
+    except Exception:
+        logger.warning("Could not load model config for %s; defaulting nextn to 0", ctx.model_path)
+        raw_config = {}
     nextn = raw_config.get("num_nextn_predict_layers", 0)
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/aiconfigurator/sdk/task.py` around lines 306 - 307, The code now reads
num_nextn_predict_layers via get_model_config_from_model_path(ctx.model_path)
into raw_config/nextn but lacks defensive error handling; wrap the model config
load in a try/except (or equivalent error check) around
get_model_config_from_model_path to catch failures, log or surface the error
consistently (similar to validate()'s handling), and fallback to a safe default
(e.g., 0) for nextn so downstream logic won’t crash if loading the model config
fails; reference get_model_config_from_model_path, ctx.model_path, raw_config,
nextn and mirror the validate() pattern for logging/fallback.

773-777: Consider caching model config to avoid duplicate loading.

get_model_config_from_model_path is called here in validate() and also earlier in _base_common_layer() (line 306). For remote HuggingFace models, this results in redundant network requests. Consider caching the result on the TaskConfig instance or passing it through the context to avoid the duplicate load.

🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/aiconfigurator/sdk/task.py` around lines 773 - 777, The call to
get_model_config_from_model_path is duplicated in validate() and
_base_common_layer(), causing redundant remote loads for HuggingFace models;
cache the result on the TaskConfig instance (e.g., add an attribute like
self._cached_model_config) or pass the fetched value through the shared context
so subsequent calls reuse it: update _base_common_layer() (or the earlier
loader) to set self._cached_model_config = model_info and change validate() to
check self._cached_model_config before calling get_model_config_from_model_path
again (falling back to fetching and caching if absent).
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@src/aiconfigurator/sdk/task.py`:
- Around line 306-307: The code now reads num_nextn_predict_layers via
get_model_config_from_model_path(ctx.model_path) into raw_config/nextn but lacks
defensive error handling; wrap the model config load in a try/except (or
equivalent error check) around get_model_config_from_model_path to catch
failures, log or surface the error consistently (similar to validate()'s
handling), and fallback to a safe default (e.g., 0) for nextn so downstream
logic won’t crash if loading the model config fails; reference
get_model_config_from_model_path, ctx.model_path, raw_config, nextn and mirror
the validate() pattern for logging/fallback.
- Around line 773-777: The call to get_model_config_from_model_path is
duplicated in validate() and _base_common_layer(), causing redundant remote
loads for HuggingFace models; cache the result on the TaskConfig instance (e.g.,
add an attribute like self._cached_model_config) or pass the fetched value
through the shared context so subsequent calls reuse it: update
_base_common_layer() (or the earlier loader) to set self._cached_model_config =
model_info and change validate() to check self._cached_model_config before
calling get_model_config_from_model_path again (falling back to fetching and
caching if absent).

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 90ad032 and 3270e64.

📒 Files selected for processing (11)
  • collector/common_test_cases.py
  • collector/trtllm/collect_mla.py
  • src/aiconfigurator/cli/main.py
  • src/aiconfigurator/model_configs/moonshotai--Kimi-K2-Instruct_config.json
  • src/aiconfigurator/model_configs/moonshotai--Kimi-K2-Thinking_config.json
  • src/aiconfigurator/model_configs/moonshotai--Kimi-K2.5_config.json
  • src/aiconfigurator/sdk/common.py
  • src/aiconfigurator/sdk/inference_session.py
  • src/aiconfigurator/sdk/models.py
  • src/aiconfigurator/sdk/task.py
  • src/aiconfigurator/sdk/utils.py

Model the ViT vision encoder (27-layer, 1152-dim), patch merger, and
projector as GEMM/ElementWise ops prepended to context_ops. Each vision
op carries _vision_num_tokens so the backend uses the correct token
count (4096 pre-merge, 1024 post-merge) instead of isl.

Also reverts collector changes per review comments.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@@ -0,0 +1,26 @@
{
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we need to do a full copy paste. this is missing quant field. https://huggingface.co/moonshotai/Kimi-K2-Instruct/blob/main/config.json

@@ -0,0 +1,26 @@
{
"architectures": ["DeepseekV3ForCausalLM"],
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks liek this model employs a 4bit quant,

https://huggingface.co/moonshotai/Kimi-K2-Thinking/blob/main/config.json
Are all these fields generated by claude? I think we need a full copy past to avoid misalignment.

help="Optional end-to-end request latency target (ms). Enables request-latency optimization mode.",
)
parser.add_argument("--prefix", type=int, default=0, help="Prefix cache length. Default to 0.")
parser.add_argument(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we remove wideep and design in a seperate PR?

@jasonqinzhou jasonqinzhou marked this pull request as draft March 12, 2026 00:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants