Skip to content

Commit 4cfb7f4

Browse files
keithrbennettcrmne
andauthored
Add a rake task to generate an 'available model information' markdown guide. (#71)
Addresses issue #80 . This script illustrates the access of the various models, and can be used to produce a human-friendly tabular markdown display of the models, grouped by type. Here is the current output of the script: # Model List This is a list of all the models available in the RubyLLM library, by type: ### Chat Models | ID | Type | Name | Provider | Context | MaxTok | Family | In$/M | Out$/M | | :-- | :-- | :-- | :-- | --: | --: | :-- | --: | --: | | aqa | chat | Model that performs Attributed Question Answering. | gemini | 7168 | 1024 | aqa | 0.00 | 0.00 | | babbage-002 | chat | Babbage 002 | openai | 16385 | 16384 | babbage | 0.50 | 1.50 | | chat-bison-001 | chat | PaLM 2 Chat (Legacy) | gemini | 4096 | 1024 | other | 0.08 | 0.30 | | chatgpt-4o-latest | chat | ChatGPT-4o Latest | openai | 128000 | 16384 | gpt4o | 0.50 | 1.50 | | claude-2.0 | chat | Claude 2.0 | anthropic | 200000 | 4096 | claude2 | 3.00 | 15.00 | | claude-2.1 | chat | Claude 2.1 | anthropic | 200000 | 4096 | claude2 | 3.00 | 15.00 | | claude-3-5-haiku-20241022 | chat | Claude 3.5 Haiku | anthropic | 200000 | 8192 | claude35_haiku | 0.80 | 4.00 | | claude-3-5-sonnet-20240620 | chat | Claude 3.5 Sonnet (Old) | anthropic | 200000 | 8192 | claude35_sonnet | 3.00 | 15.00 | | claude-3-5-sonnet-20241022 | chat | Claude 3.5 Sonnet (New) | anthropic | 200000 | 8192 | claude35_sonnet | 3.00 | 15.00 | | claude-3-7-sonnet-20250219 | chat | Claude 3.7 Sonnet | anthropic | 200000 | 8192 | claude37_sonnet | 3.00 | 15.00 | | claude-3-haiku-20240307 | chat | Claude 3 Haiku | anthropic | 200000 | 4096 | claude3_haiku | 0.25 | 1.25 | | claude-3-opus-20240229 | chat | Claude 3 Opus | anthropic | 200000 | 4096 | claude3_opus | 15.00 | 75.00 | | claude-3-sonnet-20240229 | chat | Claude 3 Sonnet | anthropic | 200000 | 4096 | claude3_sonnet | 3.00 | 15.00 | | davinci-002 | chat | Davinci 002 | openai | 16385 | 16384 | davinci | 0.50 | 1.50 | | deepseek-chat | chat | DeepSeek V3 | deepseek | 64000 | 8192 | chat | 0.27 | 1.10 | | deepseek-reasoner | chat | DeepSeek R1 | deepseek | 64000 | 8192 | reasoner | 0.55 | 2.19 | | gemini-1.0-pro-vision-latest | chat | Gemini 1.0 Pro Vision | gemini | 12288 | 4096 | gemini10_pro | 0.50 | 1.50 | | gemini-1.5-flash | chat | Gemini 1.5 Flash | gemini | 1000000 | 8192 | gemini15_flash | 0.15 | 0.60 | | gemini-1.5-flash-001 | chat | Gemini 1.5 Flash 001 | gemini | 1000000 | 8192 | gemini15_flash | 0.15 | 0.60 | | gemini-1.5-flash-001-tuning | chat | Gemini 1.5 Flash 001 Tuning | gemini | 16384 | 8192 | gemini15_flash | 0.15 | 0.60 | | gemini-1.5-flash-002 | chat | Gemini 1.5 Flash 002 | gemini | 1000000 | 8192 | gemini15_flash | 0.15 | 0.60 | | gemini-1.5-flash-8b | chat | Gemini 1.5 Flash-8B | gemini | 1000000 | 8192 | gemini15_flash_8b | 0.08 | 0.30 | | gemini-1.5-flash-8b-001 | chat | Gemini 1.5 Flash-8B 001 | gemini | 1000000 | 8192 | gemini15_flash_8b | 0.08 | 0.30 | | gemini-1.5-flash-8b-exp-0827 | chat | Gemini 1.5 Flash 8B Experimental 0827 | gemini | 1000000 | 8192 | gemini15_flash_8b | 0.08 | 0.30 | | gemini-1.5-flash-8b-exp-0924 | chat | Gemini 1.5 Flash 8B Experimental 0924 | gemini | 1000000 | 8192 | gemini15_flash_8b | 0.08 | 0.30 | | gemini-1.5-flash-8b-latest | chat | Gemini 1.5 Flash-8B Latest | gemini | 1000000 | 8192 | gemini15_flash_8b | 0.08 | 0.30 | | gemini-1.5-flash-latest | chat | Gemini 1.5 Flash Latest | gemini | 1000000 | 8192 | gemini15_flash | 0.15 | 0.60 | | gemini-1.5-pro | chat | Gemini 1.5 Pro | gemini | 2000000 | 8192 | gemini15_pro | 2.50 | 10.00 | | gemini-1.5-pro-001 | chat | Gemini 1.5 Pro 001 | gemini | 2000000 | 8192 | gemini15_pro | 2.50 | 10.00 | | gemini-1.5-pro-002 | chat | Gemini 1.5 Pro 002 | gemini | 2000000 | 8192 | gemini15_pro | 2.50 | 10.00 | | gemini-1.5-pro-latest | chat | Gemini 1.5 Pro Latest | gemini | 2000000 | 8192 | gemini15_pro | 2.50 | 10.00 | | gemini-2.0-flash | chat | Gemini 2.0 Flash | gemini | 1048576 | 8192 | gemini20_flash | 0.10 | 0.40 | | gemini-2.0-flash-001 | chat | Gemini 2.0 Flash 001 | gemini | 1048576 | 8192 | gemini20_flash | 0.10 | 0.40 | | gemini-2.0-flash-exp | chat | Gemini 2.0 Flash Experimental | gemini | 1048576 | 8192 | gemini20_flash | 0.10 | 0.40 | | gemini-2.0-flash-exp-image-generation | chat | Gemini 2.0 Flash (Image Generation) Experimental | gemini | 1048576 | 8192 | gemini20_flash | 0.10 | 0.40 | | gemini-2.0-flash-lite | chat | Gemini 2.0 Flash-Lite | gemini | 1048576 | 8192 | gemini20_flash_lite | 0.08 | 0.30 | | gemini-2.0-flash-lite-001 | chat | Gemini 2.0 Flash-Lite 001 | gemini | 1048576 | 8192 | gemini20_flash_lite | 0.08 | 0.30 | | gemini-2.0-flash-lite-preview | chat | Gemini 2.0 Flash-Lite Preview | gemini | 1048576 | 8192 | gemini20_flash_lite | 0.08 | 0.30 | | gemini-2.0-flash-lite-preview-02-05 | chat | Gemini 2.0 Flash-Lite Preview 02-05 | gemini | 1048576 | 8192 | gemini20_flash_lite | 0.08 | 0.30 | | gemini-2.0-flash-thinking-exp | chat | Gemini 2.0 Flash Thinking Experimental 01-21 | gemini | 1048576 | 65536 | gemini20_flash | 0.10 | 0.40 | | gemini-2.0-flash-thinking-exp-01-21 | chat | Gemini 2.0 Flash Thinking Experimental 01-21 | gemini | 1048576 | 65536 | gemini20_flash | 0.10 | 0.40 | | gemini-2.0-flash-thinking-exp-1219 | chat | Gemini 2.0 Flash Thinking Experimental | gemini | 1048576 | 65536 | gemini20_flash | 0.10 | 0.40 | | gemini-2.0-pro-exp | chat | Gemini 2.0 Pro Experimental | gemini | 2097152 | 8192 | other | 0.08 | 0.30 | | gemini-2.0-pro-exp-02-05 | chat | Gemini 2.0 Pro Experimental 02-05 | gemini | 2097152 | 8192 | other | 0.08 | 0.30 | | gemini-exp-1206 | chat | Gemini Experimental 1206 | gemini | 2097152 | 8192 | other | 0.08 | 0.30 | | gemini-pro-vision | chat | Gemini 1.0 Pro Vision | gemini | 12288 | 4096 | other | 0.08 | 0.30 | | gemma-3-27b-it | chat | Gemma 3 27B | gemini | 131072 | 8192 | other | 0.08 | 0.30 | | gpt-3.5-turbo | chat | GPT-3.5-Turbo | openai | 16385 | 4096 | gpt35 | 0.50 | 1.50 | | gpt-3.5-turbo-0125 | chat | GPT-3.5-Turbo 0125 | openai | 4096 | 4096 | gpt35 | 0.50 | 1.50 | | gpt-3.5-turbo-1106 | chat | GPT-3.5-Turbo 1106 | openai | 4096 | 4096 | gpt35 | 0.50 | 1.50 | | gpt-3.5-turbo-16k | chat | GPT-3.5-Turbo 16k | openai | 16385 | 4096 | gpt35 | 0.50 | 1.50 | | gpt-3.5-turbo-instruct | chat | GPT-3.5-Turbo Instruct | openai | 4096 | 4096 | gpt35_instruct | 0.50 | 1.50 | | gpt-3.5-turbo-instruct-0914 | chat | GPT-3.5-Turbo Instruct 0914 | openai | 4096 | 4096 | gpt35_instruct | 0.50 | 1.50 | | gpt-4 | chat | GPT-4 | openai | 4096 | 4096 | gpt4 | 0.50 | 1.50 | | gpt-4-0125-preview | chat | GPT-4-0125 Preview | openai | 8192 | 8192 | gpt4 | 0.50 | 1.50 | | gpt-4-0613 | chat | GPT-4-0613 | openai | 8192 | 8192 | gpt4 | 0.50 | 1.50 | | gpt-4-1106-preview | chat | GPT-4-1106 Preview | openai | 4096 | 4096 | gpt4 | 0.50 | 1.50 | | gpt-4-turbo | chat | GPT-4-Turbo | openai | 128000 | 4096 | gpt4_turbo | 0.50 | 1.50 | | gpt-4-turbo-2024-04-09 | chat | GPT-4-Turbo 20240409 | openai | 128000 | 4096 | gpt4_turbo | 0.50 | 1.50 | | gpt-4-turbo-preview | chat | GPT-4-Turbo Preview | openai | 128000 | 4096 | gpt4_turbo | 0.50 | 1.50 | | gpt-4.5-preview | chat | GPT-4.5 Preview | openai | 4096 | 4096 | gpt4 | 0.50 | 1.50 | | gpt-4.5-preview-2025-02-27 | chat | GPT-4.5 Preview 20250227 | openai | 4096 | 4096 | gpt4 | 0.50 | 1.50 | | gpt-4o | chat | GPT-4o | openai | 128000 | 16384 | gpt4o | 0.50 | 1.50 | | gpt-4o-2024-05-13 | chat | GPT-4o 20240513 | openai | 128000 | 16384 | gpt4o | 0.50 | 1.50 | | gpt-4o-2024-08-06 | chat | GPT-4o 20240806 | openai | 128000 | 16384 | gpt4o | 0.50 | 1.50 | | gpt-4o-2024-11-20 | chat | GPT-4o 20241120 | openai | 128000 | 16384 | gpt4o | 0.50 | 1.50 | | gpt-4o-audio-preview | chat | GPT-4o-Audio Preview | openai | 128000 | 16384 | gpt4o_audio | 0.50 | 1.50 | | gpt-4o-audio-preview-2024-10-01 | chat | GPT-4o-Audio Preview 20241001 | openai | 128000 | 16384 | gpt4o_audio | 0.50 | 1.50 | | gpt-4o-audio-preview-2024-12-17 | chat | GPT-4o-Audio Preview 20241217 | openai | 128000 | 16384 | gpt4o_audio | 0.50 | 1.50 | | gpt-4o-mini | chat | GPT-4o-Mini | openai | 128000 | 16384 | gpt4o_mini | 0.50 | 1.50 | | gpt-4o-mini-2024-07-18 | chat | GPT-4o-Mini 20240718 | openai | 128000 | 16384 | gpt4o_mini | 0.50 | 1.50 | | gpt-4o-mini-audio-preview | chat | GPT-4o-Mini Audio Preview | openai | 128000 | 16384 | gpt4o_mini_audio | 0.50 | 1.50 | | gpt-4o-mini-audio-preview-2024-12-17 | chat | GPT-4o-Mini Audio Preview 20241217 | openai | 128000 | 16384 | gpt4o_mini_audio | 0.50 | 1.50 | | gpt-4o-mini-realtime-preview | chat | GPT-4o-Mini Realtime Preview | openai | 128000 | 16384 | gpt4o_mini_realtime | 0.50 | 1.50 | | gpt-4o-mini-realtime-preview-2024-12-17 | chat | GPT-4o-Mini Realtime Preview 20241217 | openai | 128000 | 16384 | gpt4o_mini_realtime | 0.50 | 1.50 | | gpt-4o-mini-search-preview | chat | GPT-4o-Mini Search Preview | openai | 128000 | 16384 | gpt4o_mini | 0.50 | 1.50 | | gpt-4o-mini-search-preview-2025-03-11 | chat | GPT-4o-Mini Search Preview 20250311 | openai | 128000 | 16384 | gpt4o_mini | 0.50 | 1.50 | | gpt-4o-mini-transcribe | chat | GPT-4o-Mini Transcribe | openai | 128000 | 16384 | gpt4o_mini | 0.50 | 1.50 | | gpt-4o-realtime-preview | chat | GPT-4o-Realtime Preview | openai | 128000 | 16384 | gpt4o_realtime | 0.50 | 1.50 | | gpt-4o-realtime-preview-2024-10-01 | chat | GPT-4o-Realtime Preview 20241001 | openai | 128000 | 16384 | gpt4o_realtime | 0.50 | 1.50 | | gpt-4o-realtime-preview-2024-12-17 | chat | GPT-4o-Realtime Preview 20241217 | openai | 128000 | 16384 | gpt4o_realtime | 0.50 | 1.50 | | gpt-4o-search-preview | chat | GPT-4o Search Preview | openai | 128000 | 16384 | gpt4o | 0.50 | 1.50 | | gpt-4o-search-preview-2025-03-11 | chat | GPT-4o Search Preview 20250311 | openai | 128000 | 16384 | gpt4o | 0.50 | 1.50 | | gpt-4o-transcribe | chat | GPT-4o Transcribe | openai | 128000 | 16384 | gpt4o | 0.50 | 1.50 | | learnlm-1.5-pro-experimental | chat | LearnLM 1.5 Pro Experimental | gemini | 32767 | 8192 | other | 0.08 | 0.30 | | o1 | chat | O1 | openai | 4096 | 4096 | o1 | 0.50 | 1.50 | | o1-2024-12-17 | chat | O1-20241217 | openai | 200000 | 100000 | o1 | 0.50 | 1.50 | | o1-mini | chat | O1-Mini | openai | 128000 | 4096 | o1_mini | 0.50 | 1.50 | | o1-mini-2024-09-12 | chat | O1-Mini 20240912 | openai | 128000 | 65536 | o1_mini | 0.50 | 1.50 | | o1-preview | chat | O1-Preview | openai | 4096 | 4096 | o1 | 0.50 | 1.50 | | o1-preview-2024-09-12 | chat | O1-Preview 20240912 | openai | 4096 | 4096 | o1 | 0.50 | 1.50 | | o1-pro | chat | O1-Pro | openai | 4096 | 4096 | o1 | 0.50 | 1.50 | | o1-pro-2025-03-19 | chat | O1-Pro 20250319 | openai | 4096 | 4096 | o1 | 0.50 | 1.50 | | o3-mini | chat | O3-Mini | openai | 200000 | 100000 | o3_mini | 0.50 | 1.50 | | o3-mini-2025-01-31 | chat | O3-Mini 20250131 | openai | 200000 | 100000 | o3_mini | 0.50 | 1.50 | | text-bison-001 | chat | PaLM 2 (Legacy) | gemini | 8196 | 1024 | other | 0.08 | 0.30 | ### Image Models | ID | Type | Name | Provider | Context | MaxTok | Family | In$/M | Out$/M | | :-- | :-- | :-- | :-- | --: | --: | :-- | --: | --: | | dall-e-2 | image | DALL-E-2 | openai | 4096 | 4096 | dalle2 | 0.50 | 1.50 | | dall-e-3 | image | DALL-E-3 | openai | 4096 | 4096 | dalle3 | 0.50 | 1.50 | | imagen-3.0-generate-002 | image | Imagen 3.0 002 model | gemini | 480 | 8192 | other | 0.08 | 0.30 | ### Audio Models | ID | Type | Name | Provider | Context | MaxTok | Family | In$/M | Out$/M | | :-- | :-- | :-- | :-- | --: | --: | :-- | --: | --: | | gpt-4o-mini-tts | audio | GPT-4o-Mini Tts | openai | 128000 | 16384 | gpt4o_mini | 0.50 | 1.50 | | tts-1 | audio | TTS-1 | openai | 4096 | 4096 | tts1 | 0.50 | 1.50 | | tts-1-1106 | audio | TTS-1 1106 | openai | 4096 | 4096 | tts1 | 0.50 | 1.50 | | tts-1-hd | audio | TTS-1 HD | openai | 4096 | 4096 | tts1_hd | 0.50 | 1.50 | | tts-1-hd-1106 | audio | TTS-1 HD 1106 | openai | 4096 | 4096 | tts1_hd | 0.50 | 1.50 | | whisper-1 | audio | Whisper 1 | openai | 4096 | 4096 | whisper1 | 0.50 | 1.50 | ### Embedding Models | ID | Type | Name | Provider | Context | MaxTok | Family | In$/M | Out$/M | | :-- | :-- | :-- | :-- | --: | --: | :-- | --: | --: | | embedding-001 | embedding | Embedding 001 | gemini | 2048 | 1 | embedding1 | 0.00 | 0.00 | | embedding-gecko-001 | embedding | Embedding Gecko | gemini | 1024 | 1 | other | 0.00 | 0.00 | | gemini-embedding-exp | embedding | Gemini Embedding Experimental | gemini | 8192 | 1 | other | 0.00 | 0.00 | | gemini-embedding-exp-03-07 | embedding | Gemini Embedding Experimental 03-07 | gemini | 8192 | 1 | other | 0.00 | 0.00 | | text-embedding-004 | embedding | Text Embedding 004 | gemini | 2048 | 1 | embedding4 | 0.00 | 0.00 | | text-embedding-3-large | embedding | Text Embedding 3 Large | openai | 4096 | 4096 | embedding3_large | 0.50 | 1.50 | | text-embedding-3-small | embedding | Text Embedding 3 Small | openai | 4096 | 4096 | embedding3_small | 0.50 | 1.50 | | text-embedding-ada-002 | embedding | Text Embedding Ada 002 | openai | 4096 | 4096 | embedding2 | 0.50 | 1.50 | ### Moderation Models | ID | Type | Name | Provider | Context | MaxTok | Family | In$/M | Out$/M | | :-- | :-- | :-- | :-- | --: | --: | :-- | --: | --: | | omni-moderation-2024-09-26 | moderation | Omni-Moderation 20240926 | openai | 4096 | 4096 | moderation | 0.50 | 1.50 | | omni-moderation-latest | moderation | Omni-Moderation Latest | openai | 4096 | 4096 | moderation | 0.50 | 1.50 | --------- Co-authored-by: Carmine Paolino <carmine@paolino.me>
1 parent 5340c93 commit 4cfb7f4

File tree

3 files changed

+488
-2
lines changed

3 files changed

+488
-2
lines changed

.github/workflows/docs.yml

Lines changed: 16 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,7 @@ on:
55
branches: [main]
66
paths:
77
- 'docs/**'
8+
- 'lib/**'
89
- '.github/workflows/docs.yml'
910
workflow_dispatch:
1011

@@ -23,19 +24,32 @@ jobs:
2324
steps:
2425
- name: Checkout
2526
uses: actions/checkout@v4
26-
- name: Setup Ruby
27+
28+
- name: Setup Ruby for models guide generation (ruby_llm environment)
29+
uses: ruby/setup-ruby@v1
30+
with:
31+
ruby-version: '3.3'
32+
bundler-cache: true
33+
34+
- name: Run models:docs rake task
35+
run: bundle exec rake models:docs
36+
37+
- name: Setup Ruby for Jekyll build (ruby_llm environment)
2738
uses: ruby/setup-ruby@v1
2839
with:
2940
ruby-version: '3.3'
3041
bundler-cache: true
3142
working-directory: docs
43+
3244
- name: Setup Pages
3345
uses: actions/configure-pages@v4
46+
3447
- name: Build with Jekyll
3548
working-directory: docs
3649
run: bundle exec jekyll build --baseurl "${{ steps.pages.outputs.base_path }}"
3750
env:
3851
JEKYLL_ENV: production
52+
3953
- name: Upload artifact
4054
uses: actions/upload-pages-artifact@v3
4155
with:
@@ -50,4 +64,4 @@ jobs:
5064
steps:
5165
- name: Deploy to GitHub Pages
5266
id: deployment
53-
uses: actions/deploy-pages@v4
67+
uses: actions/deploy-pages@v4

0 commit comments

Comments
 (0)