The existing walk tests only check sites where references can appear, and they use models that do not test all the model features captured in the state machines.
The tests should use comprehensive models - so every transit is tested - and they should include visits to all named states.