Would like to see this weighing against the other SOTAs to see if it is good https://github.com/simular-ai/Agent-S https://os-world.github.io/