Skip to content

Maxwilson1101/vLLM-metax

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

146 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

vLLM

vLLM MetaX Plugin

| About MetaX | Documentation | #sig-maca |


Latest News 🔥

  • [2025/11] We hosted vLLM Beijing Meetup focusing on distributed inference and diverse accelerator support with vLLM! Please find the meetup slides here.
  • [2025/08] We hosted vLLM Shanghai Meetup focusing on building, developing, and integrating with vLLM! Please find the meetup slides here.

About

vLLM MetaX is a hardware plugin for running vLLM seamlessly on MetaX GPU, which is a cuda_alike backend and provided near-native CUDA experiences on MetaX Hardware with MACA.

It is the recommended approach for supporting the MetaX backend within the vLLM community.

The plugin follows the vLLM plugin RFCs by default:

Which ensured the hardware features and functionality support on integration of the MetaX GPU with vLLM.

Prerequisites

  • Hardware: MetaX C-series
  • OS: Linux
  • Software:
    • Python >= 3.9, < 3.12
    • vLLM (the same version as vllm-metax)
    • Docker support

Getting Started

vLLM MetaX currently only support starting on docker images release by MetaX develop community which is out of box. (DockerFile for other OS is undertesting)

If you want to develop, debug or test the newest feature on vllm-metax, you may need to build from scratch and follow this source build tutorial.

Branch

vllm-metax has master branch and dev branch.

  • master: main branch,catching up with main branch of vLLM upstream.
  • vX.Y.Z-dev: development branch, created with part of new releases of vLLM. For example, v0.10.2-dev is the dev branch for vLLM v0.10.2 version.

Below is maintained branches:

Branch Status Note
master Maintained trying to support vllm main, no gurantee on functionality
v0.11.0-dev Maintained under testing
v0.10.2-dev Maintained release on Nov.2025

Please check here for more details .

License

Apache License 2.0, as found in the LICENSE file.

About

No description, website, or topics provided.

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 73.5%
  • Cuda 15.0%
  • C++ 8.4%
  • C 1.7%
  • CMake 0.9%
  • Shell 0.4%
  • Dockerfile 0.1%