Feature: support server-side token counting

#23 adds the definition of the `countTokens` RPC endpoint

This should be exposed as a new `count_tokens` method on model handles, so context length checks don't need to actually retrieve the entire token vector from the server when the client app is only interested in comparing it to the current model context length.