The vllmd sidecar is already using vllm-sim for local e2e testing. In order to test P/D the simulator must support - `max_completion_tokens` (alias for `max_tokens`) - P/D simulation. Details to follow as design is being worked out. @nilig @mayabar