The recently released OSS openai models come with a new vocabulary. The TiktokenTokenizer should be updated accordingly. https://github.com/openai/tiktoken/commit/3591ff175d6a80efbe4fcc7f0e219ddd4b8c52f1 cc: @tarekgh