Is your feature request related to a problem? Please describe
It can be slow on CPU mode but recently Intel introduced its NPU
Describe the solution you'd like
I suggest xinference support running on intel NPU
Describe alternatives you've considered
or it will have to run on CPU mode