+This flexible hardware backend plugin mechanism would not have been possible without the efforts contributed by a lot of vLLM contributors. Thus we are deeply grateful to the vLLM maintainers, including [Kaichao You](https://github.com/youkaichao), [Simon Mo](https://github.com/simon-mo), [Cyrus Leung](https://github.com/DarkLight1337), [Robert Shaw](https://github.com/robertgshaw2-redhat), [Michael Goin](https://github.com/mgoin) and [Jee Jee Li](https://github.com/jeejeelee) for related refactor, deeply discuss and quickly review, [Xiyuan Wang](https://github.com/wangxiyuan), [Shanshan Shen](https://github.com/shen-shanshan), [Chenguang Li](https://github.com/noemotiovon) and [Mengqing Cao](https://github.com/MengqingCao) from vLLM Ascend team for mechanism design and implentment, [Joe Runde](https://github.com/joerunde) and [Yannick Schnider](https://github.com/yannicks1) from the vLLM Spyre team for pluggable scheduler design and implentment, and other contributors, including [yancong](https://github.com/ice-tong) for extendable quantization method design and implentment, [Aviv Keshet](https://github.com/akeshet) for extendable `SamplingParams`.
0 commit comments