[docs] update v0.13.0 docs

ILikeIneine · ILikeIneine · commit cd578d635b40 · 2026-02-06T10:20:14.000Z
Signed-off-by: Hank &lt;hcc.mayday@gmail.com&gt;
diff --git a/README.md b/README.md
@@ -17,7 +17,9 @@ vLLM MetaX Plugin
 ---
 
 *Latest News* 🔥
-
+- [2026/2] Released vllm-metax **v0.13.0** 🧨 — aligned with vLLM *v0.13.0*, brings you the latest features and model in v0.13.0!
+- [2026/1] Released vllm-metax **v0.12.0** 😎 — aligned with vLLM *v0.12.0*, supported more models and improved performance.
+- [2026/1] Released vllm-metax **v0.11.2** 👻 — aligned with vLLM *v0.11.2*, supported more models and improved performance.
 - [2025/11] Released vllm-metax **v0.10.2** 🎉 — aligned with vLLM *v0.10.2*, improved model performance, and fixed key decoding bugs.
 - [2025/11] We hosted [vLLM Beijing Meetup](https://mp.weixin.qq.com/s/xSrYXjNgr1HbCP4ExYNG1w) focusing on distributed inference and diverse accelerator support with vLLM! Please find the meetup slides [here](https://drive.google.com/drive/folders/1nQJ8ZkLSjKxvu36sSHaceVXtttbLvvu-?usp=drive_link).
 - [2025/08] We hosted [vLLM Shanghai Meetup](https://mp.weixin.qq.com/s/pDmAXHcN7Iqc8sUKgJgGtg) focusing on building, developing, and integrating with vLLM! Please find the meetup slides [here](https://drive.google.com/drive/folders/1OvLx39wnCGy_WKq8SiVKf7YcxxYI3WCH).
@@ -46,29 +48,33 @@ Which ensured the hardware features and functionality support on integration of
 
 ## Getting Started
 
-vLLM MetaX currently only support starting on docker images release by [MetaX develop community](https://developer.metax-tech.com/softnova/docker) which is out of box. (DockerFile for other OS is undertesting)
+vLLM MetaX currently only support starting on docker images release by [MetaX develop community](https://developer.metax-tech.com/softnova/docker?chip_name=%E6%9B%A6%E4%BA%91C500%E7%B3%BB%E5%88%97&package_kind=AI&dimension=docker&deliver_type=%E5%88%86%E5%B1%82%E5%8C%85&ai_frame=vllm-metax) which is out of box. (DockerFile for other OS is undertesting)
 
-If you want to develop, debug or test the newest feature on vllm-metax, you may need to build from scratch and follow this [*source build tutorial*](https://vllm-metax.readthedocs.io/en/latest/getting_started/installation/maca.html). 
+If you want to develop, debug or test the newest feature on vllm-metax, you may need to build from scratch and follow this [*source build tutorial*](https://vllm-metax.readthedocs.io/en/v0.13.0/getting_started/installation/maca.html). 
 
 ## Branch
 
-vllm-metax has master branch and dev branch.
+vllm-metax has three kind of branches.
 
 - **master**: main branch，catching up with main branch of vLLM upstream.
-- **vX.Y.Z-dev**: development branch, created with part of new releases of vLLM. For example, `v0.10.2-dev` is the dev branch for vLLM `v0.10.2` version.
+- **releases/vX.Y.Z**: release branch, created when a new version of vLLM is released. For example, `releases/v0.13.0` is the release branch for vLLM `v0.13.0` version. (Same tag name)
+- **vX.Y.Z-dev**: development branch, created with part of new releases of vLLM. For example, `v0.14.0-dev` is the dev branch for vLLM `v0.14.0` version.
 
 Below is maintained branches:
 
 | Branch      | Status       | Note                                 |
 |-------------|--------------|--------------------------------------|
-| master      | Maintained   | trying to support vllm main, no gurantee on functionality |
-| v0.11.1rc6-dev| Maintained | under testing |
-| v0.11.0-dev | Maintained   | under testing |
-| releases/v0.10.2 | Maintained   | Released |
+| master      | N/A | trying to support vllm main, no gurantee on functionality |
+| v0.15.0-dev | N/A | under testing |
+| v0.14.0-dev | N/A | under testing |
+| releases/v0.13.0 | Released | related to vllm release v0.13.0 |
+| releases/v0.12.0 | Released | related to vllm release v0.12.0 |
+| releases/v0.11.2 | Released | related to vllm release v0.11.2 |
+| releases/v0.10.2 | Released | related to vllm release v0.10.2 |
 
-Please check [here](https://vllm-metax.readthedocs.io/en/latest/getting_started/quickstart.html) for more details .
 
-## License
+Please check [here](https://vllm-metax.readthedocs.io/en/v0.13.0/getting_started/quickstart.html) for v0.13.0 details.
 
-Apache License 2.0, as found in the [LICENSE](./LICENSE) file.
+## License
 
+Apache License 2.0, as found in the [LICENSE](./LICENSE) file.
diff --git a/docs/getting_started/installation/maca.md b/docs/getting_started/installation/maca.md
@@ -1,83 +1,80 @@
 # Installation
 
+!!! warning "Breaking Change Notice"
+    After v0.11.2, vLLM-MetaX moved its `_C` and `_moe_C` kernel into a separate package named `mcoplib`. 
+    
+    mcoplib is open-sourced at [MetaX-mcoplib](https://github.com/MetaX-MACA/mcoplib) and would maintain its own release cycle. vllm-metax's release rely on its corresponding version of mcoplib. Check it at the [Release Page](../quickstart.md#releases).
+
+    Though the *csrc* folder is still kept in this repo for development convenience, and there is no guarantee that the code is always in sync with mcoplib. Not only the performance but also the correctness may differ from mcoplib. 
+
+    To build and use the vllm-metax csrc , you need to set: 
+
+    ```bash    
+    export USE_PRECOMPILED_KERNEL=0
+    ```
+    
+    in both *build* and *runtime* environment variables.
+    
+    **Please always use mcoplib for production usage.**
+
 ## Requirements
 
 - OS: Linux
 - Python: 3.10 -- 3.12
 
-## Set up using pip (without UV)
-
-### Build wheel from source
-
-!!! note
-    If using pip, all the build and installation steps are based on *corresponding docker images*. You can find them on [quick start](../quickstart.md).
-    We need to add `-no-build-isolation` flag (or an equivalent one) during package building, since all the requirements are already pre-installed in released docker image.
 
-#### Setup environment variables
+## Build from source
 
+### Prepare environment
 ```bash
 # setup MACA path
 export MACA_PATH="/opt/maca"
 
 # cu-bridge
 export CUCC_PATH="${MACA_PATH}/tools/cu-bridge"
-export CUDA_PATH=/root/cu-bridge/CUDA_DIR
+export CUDA_PATH="${HOME}/cu-bridge/CUDA_DIR"
 export CUCC_CMAKE_ENTRY=2
 
 # update PATH
 export PATH=${MACA_PATH}/mxgpu_llvm/bin:${MACA_PATH}/bin:${CUCC_PATH}/tools:${CUCC_PATH}/bin:${PATH}
 export LD_LIBRARY_PATH=${MACA_PATH}/lib:${MACA_PATH}/ompi/lib:${MACA_PATH}/mxgpu_llvm/lib:${LD_LIBRARY_PATH}
-
-export VLLM_INSTALL_PUNICA_KERNELS=1
 ```
 
-#### Build vllm
+=== "PIP"
+    --8<-- "docs/getting_started/installation/pip.inc.md:prepare-env"
+=== "UV"
+    --8<-- "docs/getting_started/installation/uv.inc.md:prepare-env"
+
+### Build vllm
 
 Clone vllm project:
 
 ```bash 
-git clone  --depth 1 --branch main https://github.com/vllm-project/vllm
+git clone  --depth 1 --branch releases/v0.13.0 https://github.com/vllm-project/vllm 
 cd vllm
 ```
 
 Build with *empty device*:
 
-```bash
-python use_existing_torch.py
-pip install -r requirements/build.txt
-VLLM_TARGET_DEVICE=empty pip install -v . --no-build-isolation
-```
+=== "PIP"
+    --8<-- "docs/getting_started/installation/pip.inc.md:build-vllm"
+=== "UV"
+    --8<-- "docs/getting_started/installation/uv.inc.md:build-vllm"
 
-#### Build plugin
+### Build plugin
 
-Install the build requirments first:
+Clone vllm-metax project:
 
-```bash
-python use_existing_metax.py
-pip install -r requirements/build.txt
-```
-
-Build and install vLLM:
-
-```bash
-pip install . -v --no-build-isolation
-```
-
-If you want to develop vLLM, install it in editable mode instead.
-
-```bash
-pip install . -e -v --no-build-isolation
+```bash 
+git clone --branch releases/v0.13.0 https://github.com/MetaX-MACA/vLLM-metax
+cd vLLM-metax
 ```
 
-Optionally, build a portable wheel which you can then install elsewhere:
-
-```bash
-python -m build -w -n
-pip install dist/*.whl
-``` 
-
-## Set up using UV (experimental)
+Build the plugin:
 
-Todo
+=== "PIP"
+    --8<-- "docs/getting_started/installation/pip.inc.md:build-vllm-metax"
+=== "UV"
+    --8<-- "docs/getting_started/installation/uv.inc.md:build-vllm-metax"
 
-## Extra information
+## Extra information
diff --git a/docs/getting_started/installation/pip.inc.md b/docs/getting_started/installation/pip.inc.md
@@ -0,0 +1,39 @@
+# --8<-- [start:prepare-env]
+!!! note
+    If using pip, all the build and installation steps are ***based on corresponding docker images***. You can find them on [QuickStart page](../quickstart.md).
+    We need to add `--no-build-isolation` flag during the whole package building since we need all the requirements that were pre-installed in released docker image.
+# --8<-- [end:prepare-env]
+
+# --8<-- [start:build-vllm-metax]
+!!! note
+
+    ```bash
+    python use_existing_metax.py
+    pip install -r requirements/build.txt
+    pip install .  --no-build-isolation
+    ```
+
+    ??? console "Additional installation options"
+        If you want to develop vllm-metax, install it in **editable mode** instead.
+
+        ```bash
+        pip install -v -e . --no-build-isolation
+        ```
+
+        Optionally, build a portable wheel which you can then install elsewhere.
+
+        ```bash 
+        python -m build -w -n 
+        pip install dist/*.whl
+        ```
+# --8<-- [end:build-vllm-metax]
+
+# --8<-- [start:build-vllm]
+!!! note "To build vllm-metax using an existing PyTorch installation"
+
+    ```bash
+    python use_existing_pytorch.py
+    pip install -r requirements/build.txt
+    VLLM_TARGET_DEVICE=empty pip install . --no-build-isolation
+    ```
+# --8<-- [end:build-vllm]
diff --git a/docs/getting_started/installation/uv.inc.md b/docs/getting_started/installation/uv.inc.md
@@ -0,0 +1,77 @@
+# --8<-- [start:prepare-env]
+!!! note
+
+    UV **does not rely** on any pre-installed packages in the docker, and would install all the dependencies in a virtual environment from scratch.
+
+    ??? console "UV installation guide"
+        We'd recommend install uv with pip (this is not forcibly required):
+
+        ```bash
+        pip install uv
+        ```
+
+        Then create the virtual environment with python 3.10 or above:
+
+        ```bash
+        uv venv /opt/venv --python python3.10
+        ```
+
+        And activate the virtual environment:
+
+        ```bash
+        source /opt/venv/bin/activate
+        ```
+
+    You need to manually set Metax PyPi repo to download maca-related dependencies.
+
+    ```
+    export UV_EXTRA_INDEX_URL=https://repos.metax-tech.com/r/maca-pypi/simple
+    export UV_INDEX_STRATEGY=unsafe-best-match
+    ```
+
+    ??? console "Optional: Change PyPi default mirror"
+        You could set Aliyun PyPi mirror as default to speed up package downloading:
+
+        ```bash
+        export UV_INDEX_URL=https://mirrors.aliyun.com/pypi/simple
+        ```
+# --8<-- [end:prepare-env]
+
+# --8<-- [start:build-vllm-metax]
+!!! note
+
+    ```bash
+    uv pip install -r requirements/build.txt
+    uv pip install . 
+    ```
+
+    ??? console "Additional installation options"
+        If you want to develop vLLM, install it in editable mode instead.
+
+        ```bash
+        uv pip install -v -e .
+        ```
+
+        Optionally, build a portable wheel which you can then install elsewhere.
+
+        ```bash 
+        uv build --wheel
+        ```
+# --8<-- [end:build-vllm-metax]
+
+
+
+# --8<-- [start:build-vllm]
+!!! note "To build vLLM using local uv environment"
+
+    ```bash
+    uv pip install -r requirements/build.txt
+    VLLM_TARGET_DEVICE=empty uv pip install . --no-build-isolation
+    ```
+
+    ??? note "About isolation"
+        `--no-build-isolation` is optional. we add this option for speeding up installation.
+        uv would still trying to download cuda-related packages during build even if you set 
+        `VLLM_TARGET_DEVICE=empty`, which may take a long time.
+
+# --8<-- [end:build-vllm]
diff --git a/docs/getting_started/quickstart.md b/docs/getting_started/quickstart.md
@@ -2,26 +2,35 @@
 
 Currently the recommanded way to start ***vLLM-MetaX*** is via *docker*.
 
-You could get the docker image at [MetaX develop community](https://developer.metax-tech.com/softnova/docker).
+You could get the docker image at [MetaX develop community](https://developer.metax-tech.com/softnova/docker?chip_name=%E6%9B%A6%E4%BA%91C500%E7%B3%BB%E5%88%97&package_kind=AI&dimension=docker&deliver_type=%E5%88%86%E5%B1%82%E5%8C%85&ai_frame=vllm-metax&arch=amd64&system=ubuntu).
 
-*Belows is version mapping to released plugin and maca*:
+!!! note
+    After v0.11.2, vllm-metax moved its `_C` and `_moe_C` kernel into a separate package named `mcoplib`. 
+    
+    **mcoplib** is open-sourced at [MetaX-mcoplib](https://github.com/MetaX-MACA/mcoplib) and would maintain its own release cycle. Please always install the corresponding version of mcoplib when using vLLM-MetaX.
 
-| plugin version | maca version | docker distribution tag |
-|:--------------:|:------------:|:-----------------------:|
-|v0.8.5          |maca2.33.1.13 | vllm:maca.ai2.33.1.13-torch2.6-py310-ubuntu22.04-amd64 |
-|v0.9.1          |maca3.0.0.5   | vllm:maca.ai3.0.0.5-torch2.6-py310-ubuntu22.04-amd64 |
-|v0.10.1.1 (dev) |maca3.1.0.7   | not released |
-|v0.10.2         |maca3.2.0.7   | not released |
-|v0.11.0         |maca3.2.x.x   | not released |
-|master          |maca3.2.x.x.  | not released |
+    Though the *csrc* folder is still kept in this repo for development convenience, and there is no guarantee that the code is always in sync with mcoplib. Not only the performance but also the correctness may differ from mcoplib. 
 
-> Note: All the vllm tests are based on the related maca version. Using incorresponding version of maca for vllm may cause unexpected bugs or errors. This is not garanteed.
+    If you want build the latest vllm-metax, please refer to [installation](./installation/maca.md) to build from source.
 
-vLLM-MetaX is out of box via these docker images.
+    **Please always use mcoplib for production usage.**
 
-## Offline Batched Inference
+## Releases
 
-## OpenAI-Compatible Server
+*Belows is version mapping to released plugin and mcoplib with maca*:
 
-## On Attention Backends
+| plugin version | maca version | mcoplib version | docker distribution tag |
+|:--------------:|:------------:|:-----------------------:|:-----------------------:|
+|v0.8.5          |maca2.33.1.13 | N/A | vllm:maca.ai2.33.1.13-torch2.6-py310-ubuntu22.04-amd64 |
+|v0.9.1          |maca3.0.0.5   | N/A | vllm:maca.ai3.0.0.5-torch2.6-py310-ubuntu22.04-amd64 |
+|v0.10.2         |maca3.2.1.7   | N/A | vllm-metax:0.10.2-maca.ai3.2.1.7-torch2.6-py310-ubuntu22.04-amd64 |
+|v0.11.0         |maca3.3.0.15   | 0.1.1 | vllm-metax:0.11.0-maca.ai3.3.0.11-torch2.6-py312-ubuntu22.04-amd64 |
+|v0.11.2         |maca3.3.0.15   | 0.2.0 | vllm-metax:0.11.2-maca.ai3.3.0.103-torch2.8-py312-ubuntu22.04-amd64 |
+|v0.12.0         |maca3.3.0.15   | 0.3.0 | vllm-metax:0.12.0-maca.ai3.3.0.204-torch2.8-py312-ubuntu22.04-amd64 |
+|v0.13.0         |maca3.3.0.15   | 0.3.1 | vllm-metax:0.13.0-maca.ai3.3.0.303-torch2.8-py312-ubuntu22.04-amd64 |
+|master          |maca3.3.0.15   | >=0.3.0 | not released |
 
+!!! warning "Usage Warning"
+    **vLLM-MetaX is out of box via these docker images.**
+
+    All the vllm tests are based on the related maca version. Using incorresponding version of maca for vllm may cause unexpected bugs or errors. This is not garanteed.