vllm inference supports?

Hi, I want to ask whether your MLLM structure is support to use vLLM to improve the inference speed? Previous work, such as LLAMAGen, it is easily to support vLLM. I am not sure whether your MLLM structure support vLLM.

best wishes,