Skip to content

Optimize main.py for inference efficiency and GPU throughput (torch.compile, memory tuning, warp alignment)#253

Open
abdullatifcodes wants to merge 1 commit intomistralai:mainfrom
abdullatifcodes:perf/main-inference-optimization
Open

Optimize main.py for inference efficiency and GPU throughput (torch.compile, memory tuning, warp alignment)#253
abdullatifcodes wants to merge 1 commit intomistralai:mainfrom
abdullatifcodes:perf/main-inference-optimization

Commits