**Is your feature request related to a problem? Please describe.** Thus we could use slice GPU for multi Model Hosting **Describe the solution you'd like** **Describe alternatives you've considered** **Additional context**