Commit 4cfb7f4
Add a rake task to generate an 'available model information' markdown guide. (#71)
Addresses issue #80 .
This script illustrates the access of the various models, and can be
used to produce a human-friendly tabular markdown display of the models,
grouped by type. Here is the current output of the script:
# Model List
This is a list of all the models available in the RubyLLM library, by
type:
### Chat Models
| ID | Type | Name | Provider | Context | MaxTok | Family | In$/M |
Out$/M |
| :-- | :-- | :-- | :-- | --: | --: | :-- | --: | --: |
| aqa | chat | Model that performs Attributed Question Answering. |
gemini | 7168 | 1024 | aqa | 0.00 | 0.00 |
| babbage-002 | chat | Babbage 002 | openai | 16385 | 16384 | babbage |
0.50 | 1.50 |
| chat-bison-001 | chat | PaLM 2 Chat (Legacy) | gemini | 4096 | 1024 |
other | 0.08 | 0.30 |
| chatgpt-4o-latest | chat | ChatGPT-4o Latest | openai | 128000 | 16384
| gpt4o | 0.50 | 1.50 |
| claude-2.0 | chat | Claude 2.0 | anthropic | 200000 | 4096 | claude2 |
3.00 | 15.00 |
| claude-2.1 | chat | Claude 2.1 | anthropic | 200000 | 4096 | claude2 |
3.00 | 15.00 |
| claude-3-5-haiku-20241022 | chat | Claude 3.5 Haiku | anthropic |
200000 | 8192 | claude35_haiku | 0.80 | 4.00 |
| claude-3-5-sonnet-20240620 | chat | Claude 3.5 Sonnet (Old) |
anthropic | 200000 | 8192 | claude35_sonnet | 3.00 | 15.00 |
| claude-3-5-sonnet-20241022 | chat | Claude 3.5 Sonnet (New) |
anthropic | 200000 | 8192 | claude35_sonnet | 3.00 | 15.00 |
| claude-3-7-sonnet-20250219 | chat | Claude 3.7 Sonnet | anthropic |
200000 | 8192 | claude37_sonnet | 3.00 | 15.00 |
| claude-3-haiku-20240307 | chat | Claude 3 Haiku | anthropic | 200000 |
4096 | claude3_haiku | 0.25 | 1.25 |
| claude-3-opus-20240229 | chat | Claude 3 Opus | anthropic | 200000 |
4096 | claude3_opus | 15.00 | 75.00 |
| claude-3-sonnet-20240229 | chat | Claude 3 Sonnet | anthropic | 200000
| 4096 | claude3_sonnet | 3.00 | 15.00 |
| davinci-002 | chat | Davinci 002 | openai | 16385 | 16384 | davinci |
0.50 | 1.50 |
| deepseek-chat | chat | DeepSeek V3 | deepseek | 64000 | 8192 | chat |
0.27 | 1.10 |
| deepseek-reasoner | chat | DeepSeek R1 | deepseek | 64000 | 8192 |
reasoner | 0.55 | 2.19 |
| gemini-1.0-pro-vision-latest | chat | Gemini 1.0 Pro Vision | gemini |
12288 | 4096 | gemini10_pro | 0.50 | 1.50 |
| gemini-1.5-flash | chat | Gemini 1.5 Flash | gemini | 1000000 | 8192 |
gemini15_flash | 0.15 | 0.60 |
| gemini-1.5-flash-001 | chat | Gemini 1.5 Flash 001 | gemini | 1000000
| 8192 | gemini15_flash | 0.15 | 0.60 |
| gemini-1.5-flash-001-tuning | chat | Gemini 1.5 Flash 001 Tuning |
gemini | 16384 | 8192 | gemini15_flash | 0.15 | 0.60 |
| gemini-1.5-flash-002 | chat | Gemini 1.5 Flash 002 | gemini | 1000000
| 8192 | gemini15_flash | 0.15 | 0.60 |
| gemini-1.5-flash-8b | chat | Gemini 1.5 Flash-8B | gemini | 1000000 |
8192 | gemini15_flash_8b | 0.08 | 0.30 |
| gemini-1.5-flash-8b-001 | chat | Gemini 1.5 Flash-8B 001 | gemini |
1000000 | 8192 | gemini15_flash_8b | 0.08 | 0.30 |
| gemini-1.5-flash-8b-exp-0827 | chat | Gemini 1.5 Flash 8B Experimental
0827 | gemini | 1000000 | 8192 | gemini15_flash_8b | 0.08 | 0.30 |
| gemini-1.5-flash-8b-exp-0924 | chat | Gemini 1.5 Flash 8B Experimental
0924 | gemini | 1000000 | 8192 | gemini15_flash_8b | 0.08 | 0.30 |
| gemini-1.5-flash-8b-latest | chat | Gemini 1.5 Flash-8B Latest |
gemini | 1000000 | 8192 | gemini15_flash_8b | 0.08 | 0.30 |
| gemini-1.5-flash-latest | chat | Gemini 1.5 Flash Latest | gemini |
1000000 | 8192 | gemini15_flash | 0.15 | 0.60 |
| gemini-1.5-pro | chat | Gemini 1.5 Pro | gemini | 2000000 | 8192 |
gemini15_pro | 2.50 | 10.00 |
| gemini-1.5-pro-001 | chat | Gemini 1.5 Pro 001 | gemini | 2000000 |
8192 | gemini15_pro | 2.50 | 10.00 |
| gemini-1.5-pro-002 | chat | Gemini 1.5 Pro 002 | gemini | 2000000 |
8192 | gemini15_pro | 2.50 | 10.00 |
| gemini-1.5-pro-latest | chat | Gemini 1.5 Pro Latest | gemini |
2000000 | 8192 | gemini15_pro | 2.50 | 10.00 |
| gemini-2.0-flash | chat | Gemini 2.0 Flash | gemini | 1048576 | 8192 |
gemini20_flash | 0.10 | 0.40 |
| gemini-2.0-flash-001 | chat | Gemini 2.0 Flash 001 | gemini | 1048576
| 8192 | gemini20_flash | 0.10 | 0.40 |
| gemini-2.0-flash-exp | chat | Gemini 2.0 Flash Experimental | gemini |
1048576 | 8192 | gemini20_flash | 0.10 | 0.40 |
| gemini-2.0-flash-exp-image-generation | chat | Gemini 2.0 Flash (Image
Generation) Experimental | gemini | 1048576 | 8192 | gemini20_flash |
0.10 | 0.40 |
| gemini-2.0-flash-lite | chat | Gemini 2.0 Flash-Lite | gemini |
1048576 | 8192 | gemini20_flash_lite | 0.08 | 0.30 |
| gemini-2.0-flash-lite-001 | chat | Gemini 2.0 Flash-Lite 001 | gemini
| 1048576 | 8192 | gemini20_flash_lite | 0.08 | 0.30 |
| gemini-2.0-flash-lite-preview | chat | Gemini 2.0 Flash-Lite Preview |
gemini | 1048576 | 8192 | gemini20_flash_lite | 0.08 | 0.30 |
| gemini-2.0-flash-lite-preview-02-05 | chat | Gemini 2.0 Flash-Lite
Preview 02-05 | gemini | 1048576 | 8192 | gemini20_flash_lite | 0.08 |
0.30 |
| gemini-2.0-flash-thinking-exp | chat | Gemini 2.0 Flash Thinking
Experimental 01-21 | gemini | 1048576 | 65536 | gemini20_flash | 0.10 |
0.40 |
| gemini-2.0-flash-thinking-exp-01-21 | chat | Gemini 2.0 Flash Thinking
Experimental 01-21 | gemini | 1048576 | 65536 | gemini20_flash | 0.10 |
0.40 |
| gemini-2.0-flash-thinking-exp-1219 | chat | Gemini 2.0 Flash Thinking
Experimental | gemini | 1048576 | 65536 | gemini20_flash | 0.10 | 0.40 |
| gemini-2.0-pro-exp | chat | Gemini 2.0 Pro Experimental | gemini |
2097152 | 8192 | other | 0.08 | 0.30 |
| gemini-2.0-pro-exp-02-05 | chat | Gemini 2.0 Pro Experimental 02-05 |
gemini | 2097152 | 8192 | other | 0.08 | 0.30 |
| gemini-exp-1206 | chat | Gemini Experimental 1206 | gemini | 2097152 |
8192 | other | 0.08 | 0.30 |
| gemini-pro-vision | chat | Gemini 1.0 Pro Vision | gemini | 12288 |
4096 | other | 0.08 | 0.30 |
| gemma-3-27b-it | chat | Gemma 3 27B | gemini | 131072 | 8192 | other |
0.08 | 0.30 |
| gpt-3.5-turbo | chat | GPT-3.5-Turbo | openai | 16385 | 4096 | gpt35 |
0.50 | 1.50 |
| gpt-3.5-turbo-0125 | chat | GPT-3.5-Turbo 0125 | openai | 4096 | 4096
| gpt35 | 0.50 | 1.50 |
| gpt-3.5-turbo-1106 | chat | GPT-3.5-Turbo 1106 | openai | 4096 | 4096
| gpt35 | 0.50 | 1.50 |
| gpt-3.5-turbo-16k | chat | GPT-3.5-Turbo 16k | openai | 16385 | 4096 |
gpt35 | 0.50 | 1.50 |
| gpt-3.5-turbo-instruct | chat | GPT-3.5-Turbo Instruct | openai | 4096
| 4096 | gpt35_instruct | 0.50 | 1.50 |
| gpt-3.5-turbo-instruct-0914 | chat | GPT-3.5-Turbo Instruct 0914 |
openai | 4096 | 4096 | gpt35_instruct | 0.50 | 1.50 |
| gpt-4 | chat | GPT-4 | openai | 4096 | 4096 | gpt4 | 0.50 | 1.50 |
| gpt-4-0125-preview | chat | GPT-4-0125 Preview | openai | 8192 | 8192
| gpt4 | 0.50 | 1.50 |
| gpt-4-0613 | chat | GPT-4-0613 | openai | 8192 | 8192 | gpt4 | 0.50 |
1.50 |
| gpt-4-1106-preview | chat | GPT-4-1106 Preview | openai | 4096 | 4096
| gpt4 | 0.50 | 1.50 |
| gpt-4-turbo | chat | GPT-4-Turbo | openai | 128000 | 4096 | gpt4_turbo
| 0.50 | 1.50 |
| gpt-4-turbo-2024-04-09 | chat | GPT-4-Turbo 20240409 | openai | 128000
| 4096 | gpt4_turbo | 0.50 | 1.50 |
| gpt-4-turbo-preview | chat | GPT-4-Turbo Preview | openai | 128000 |
4096 | gpt4_turbo | 0.50 | 1.50 |
| gpt-4.5-preview | chat | GPT-4.5 Preview | openai | 4096 | 4096 | gpt4
| 0.50 | 1.50 |
| gpt-4.5-preview-2025-02-27 | chat | GPT-4.5 Preview 20250227 | openai
| 4096 | 4096 | gpt4 | 0.50 | 1.50 |
| gpt-4o | chat | GPT-4o | openai | 128000 | 16384 | gpt4o | 0.50 | 1.50
|
| gpt-4o-2024-05-13 | chat | GPT-4o 20240513 | openai | 128000 | 16384 |
gpt4o | 0.50 | 1.50 |
| gpt-4o-2024-08-06 | chat | GPT-4o 20240806 | openai | 128000 | 16384 |
gpt4o | 0.50 | 1.50 |
| gpt-4o-2024-11-20 | chat | GPT-4o 20241120 | openai | 128000 | 16384 |
gpt4o | 0.50 | 1.50 |
| gpt-4o-audio-preview | chat | GPT-4o-Audio Preview | openai | 128000 |
16384 | gpt4o_audio | 0.50 | 1.50 |
| gpt-4o-audio-preview-2024-10-01 | chat | GPT-4o-Audio Preview 20241001
| openai | 128000 | 16384 | gpt4o_audio | 0.50 | 1.50 |
| gpt-4o-audio-preview-2024-12-17 | chat | GPT-4o-Audio Preview 20241217
| openai | 128000 | 16384 | gpt4o_audio | 0.50 | 1.50 |
| gpt-4o-mini | chat | GPT-4o-Mini | openai | 128000 | 16384 |
gpt4o_mini | 0.50 | 1.50 |
| gpt-4o-mini-2024-07-18 | chat | GPT-4o-Mini 20240718 | openai | 128000
| 16384 | gpt4o_mini | 0.50 | 1.50 |
| gpt-4o-mini-audio-preview | chat | GPT-4o-Mini Audio Preview | openai
| 128000 | 16384 | gpt4o_mini_audio | 0.50 | 1.50 |
| gpt-4o-mini-audio-preview-2024-12-17 | chat | GPT-4o-Mini Audio
Preview 20241217 | openai | 128000 | 16384 | gpt4o_mini_audio | 0.50 |
1.50 |
| gpt-4o-mini-realtime-preview | chat | GPT-4o-Mini Realtime Preview |
openai | 128000 | 16384 | gpt4o_mini_realtime | 0.50 | 1.50 |
| gpt-4o-mini-realtime-preview-2024-12-17 | chat | GPT-4o-Mini Realtime
Preview 20241217 | openai | 128000 | 16384 | gpt4o_mini_realtime | 0.50
| 1.50 |
| gpt-4o-mini-search-preview | chat | GPT-4o-Mini Search Preview |
openai | 128000 | 16384 | gpt4o_mini | 0.50 | 1.50 |
| gpt-4o-mini-search-preview-2025-03-11 | chat | GPT-4o-Mini Search
Preview 20250311 | openai | 128000 | 16384 | gpt4o_mini | 0.50 | 1.50 |
| gpt-4o-mini-transcribe | chat | GPT-4o-Mini Transcribe | openai |
128000 | 16384 | gpt4o_mini | 0.50 | 1.50 |
| gpt-4o-realtime-preview | chat | GPT-4o-Realtime Preview | openai |
128000 | 16384 | gpt4o_realtime | 0.50 | 1.50 |
| gpt-4o-realtime-preview-2024-10-01 | chat | GPT-4o-Realtime Preview
20241001 | openai | 128000 | 16384 | gpt4o_realtime | 0.50 | 1.50 |
| gpt-4o-realtime-preview-2024-12-17 | chat | GPT-4o-Realtime Preview
20241217 | openai | 128000 | 16384 | gpt4o_realtime | 0.50 | 1.50 |
| gpt-4o-search-preview | chat | GPT-4o Search Preview | openai | 128000
| 16384 | gpt4o | 0.50 | 1.50 |
| gpt-4o-search-preview-2025-03-11 | chat | GPT-4o Search Preview
20250311 | openai | 128000 | 16384 | gpt4o | 0.50 | 1.50 |
| gpt-4o-transcribe | chat | GPT-4o Transcribe | openai | 128000 | 16384
| gpt4o | 0.50 | 1.50 |
| learnlm-1.5-pro-experimental | chat | LearnLM 1.5 Pro Experimental |
gemini | 32767 | 8192 | other | 0.08 | 0.30 |
| o1 | chat | O1 | openai | 4096 | 4096 | o1 | 0.50 | 1.50 |
| o1-2024-12-17 | chat | O1-20241217 | openai | 200000 | 100000 | o1 |
0.50 | 1.50 |
| o1-mini | chat | O1-Mini | openai | 128000 | 4096 | o1_mini | 0.50 |
1.50 |
| o1-mini-2024-09-12 | chat | O1-Mini 20240912 | openai | 128000 | 65536
| o1_mini | 0.50 | 1.50 |
| o1-preview | chat | O1-Preview | openai | 4096 | 4096 | o1 | 0.50 |
1.50 |
| o1-preview-2024-09-12 | chat | O1-Preview 20240912 | openai | 4096 |
4096 | o1 | 0.50 | 1.50 |
| o1-pro | chat | O1-Pro | openai | 4096 | 4096 | o1 | 0.50 | 1.50 |
| o1-pro-2025-03-19 | chat | O1-Pro 20250319 | openai | 4096 | 4096 | o1
| 0.50 | 1.50 |
| o3-mini | chat | O3-Mini | openai | 200000 | 100000 | o3_mini | 0.50 |
1.50 |
| o3-mini-2025-01-31 | chat | O3-Mini 20250131 | openai | 200000 |
100000 | o3_mini | 0.50 | 1.50 |
| text-bison-001 | chat | PaLM 2 (Legacy) | gemini | 8196 | 1024 | other
| 0.08 | 0.30 |
### Image Models
| ID | Type | Name | Provider | Context | MaxTok | Family | In$/M |
Out$/M |
| :-- | :-- | :-- | :-- | --: | --: | :-- | --: | --: |
| dall-e-2 | image | DALL-E-2 | openai | 4096 | 4096 | dalle2 | 0.50 |
1.50 |
| dall-e-3 | image | DALL-E-3 | openai | 4096 | 4096 | dalle3 | 0.50 |
1.50 |
| imagen-3.0-generate-002 | image | Imagen 3.0 002 model | gemini | 480
| 8192 | other | 0.08 | 0.30 |
### Audio Models
| ID | Type | Name | Provider | Context | MaxTok | Family | In$/M |
Out$/M |
| :-- | :-- | :-- | :-- | --: | --: | :-- | --: | --: |
| gpt-4o-mini-tts | audio | GPT-4o-Mini Tts | openai | 128000 | 16384 |
gpt4o_mini | 0.50 | 1.50 |
| tts-1 | audio | TTS-1 | openai | 4096 | 4096 | tts1 | 0.50 | 1.50 |
| tts-1-1106 | audio | TTS-1 1106 | openai | 4096 | 4096 | tts1 | 0.50 |
1.50 |
| tts-1-hd | audio | TTS-1 HD | openai | 4096 | 4096 | tts1_hd | 0.50 |
1.50 |
| tts-1-hd-1106 | audio | TTS-1 HD 1106 | openai | 4096 | 4096 | tts1_hd
| 0.50 | 1.50 |
| whisper-1 | audio | Whisper 1 | openai | 4096 | 4096 | whisper1 | 0.50
| 1.50 |
### Embedding Models
| ID | Type | Name | Provider | Context | MaxTok | Family | In$/M |
Out$/M |
| :-- | :-- | :-- | :-- | --: | --: | :-- | --: | --: |
| embedding-001 | embedding | Embedding 001 | gemini | 2048 | 1 |
embedding1 | 0.00 | 0.00 |
| embedding-gecko-001 | embedding | Embedding Gecko | gemini | 1024 | 1
| other | 0.00 | 0.00 |
| gemini-embedding-exp | embedding | Gemini Embedding Experimental |
gemini | 8192 | 1 | other | 0.00 | 0.00 |
| gemini-embedding-exp-03-07 | embedding | Gemini Embedding Experimental
03-07 | gemini | 8192 | 1 | other | 0.00 | 0.00 |
| text-embedding-004 | embedding | Text Embedding 004 | gemini | 2048 |
1 | embedding4 | 0.00 | 0.00 |
| text-embedding-3-large | embedding | Text Embedding 3 Large | openai |
4096 | 4096 | embedding3_large | 0.50 | 1.50 |
| text-embedding-3-small | embedding | Text Embedding 3 Small | openai |
4096 | 4096 | embedding3_small | 0.50 | 1.50 |
| text-embedding-ada-002 | embedding | Text Embedding Ada 002 | openai |
4096 | 4096 | embedding2 | 0.50 | 1.50 |
### Moderation Models
| ID | Type | Name | Provider | Context | MaxTok | Family | In$/M |
Out$/M |
| :-- | :-- | :-- | :-- | --: | --: | :-- | --: | --: |
| omni-moderation-2024-09-26 | moderation | Omni-Moderation 20240926 |
openai | 4096 | 4096 | moderation | 0.50 | 1.50 |
| omni-moderation-latest | moderation | Omni-Moderation Latest | openai
| 4096 | 4096 | moderation | 0.50 | 1.50 |
---------
Co-authored-by: Carmine Paolino <carmine@paolino.me>1 parent 5340c93 commit 4cfb7f4
File tree
3 files changed
+488
-2
lines changed- .github/workflows
- docs/guides
- lib/tasks
3 files changed
+488
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
| 8 | + | |
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
| |||
23 | 24 | | |
24 | 25 | | |
25 | 26 | | |
26 | | - | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
27 | 38 | | |
28 | 39 | | |
29 | 40 | | |
30 | 41 | | |
31 | 42 | | |
| 43 | + | |
32 | 44 | | |
33 | 45 | | |
| 46 | + | |
34 | 47 | | |
35 | 48 | | |
36 | 49 | | |
37 | 50 | | |
38 | 51 | | |
| 52 | + | |
39 | 53 | | |
40 | 54 | | |
41 | 55 | | |
| |||
50 | 64 | | |
51 | 65 | | |
52 | 66 | | |
53 | | - | |
| 67 | + | |
0 commit comments