#23 adds the definition of the countTokens RPC endpoint
This should be exposed as a new count_tokens method on model handles, so context length checks don't need to actually retrieve the entire token vector from the server when the client app is only interested in comparing it to the current model context length.