You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I used the exact same code to evaluate UI-Tars and InfiGUI 3B, and found that the inference time of InfiGUI is significantly longer than that of UI-Tars 2B, and is roughly the same with UI-Tars 7B. However, since they are both based on the Qwen model architecture, what might explain this unexpected discrepancy?