- llama is generation, so can't really be used with chat
- vicuna is a chatbot
- alpaca is instruction model
If not able to determine "mode" a user could specify via --mode cli argument.
This would remove the existing chat/generate/file commands that currently exist.