Skip to content

Commit f0fb86b

Browse files
authored
Create tokens.md
1 parent c2bd661 commit f0fb86b

File tree

1 file changed

+13
-0
lines changed

1 file changed

+13
-0
lines changed

articles/cosmos-db/gen-ai/tokens.md

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
---
2+
title: Tokens
3+
description: Overview of tokens in the context of large language models.
4+
author: wmwxwa
5+
ms.author: wangwilliam
6+
ms.service: cosmos-db
7+
ms.topic: conceptual
8+
ms.date: 07/01/2024
9+
---
10+
11+
# What are tokens?
12+
13+
Tokens are small chunks of text generated by splitting the input text into smaller segments. These segments can either be words or groups of characters, varying in length from a single character to an entire word. For instance, the word hamburger would be divided into tokens such as ham, bur, and ger while a short and common word like pear would be considered a single token. LLMs like GPT-3.5 or GPT-4 break words into tokens for processing.

0 commit comments

Comments
 (0)