Skip to content

Commit 07cca4b

Browse files
authored
chore(model gallery): add ibm-granite_granite-4.0-h-small (#6373)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent dd927c3 commit 07cca4b

File tree

2 files changed

+70
-0
lines changed

2 files changed

+70
-0
lines changed

gallery/granite4.yaml

Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,48 @@
1+
---
2+
name: "granite-3.2"
3+
4+
config_file: |
5+
backend: "llama-cpp"
6+
mmap: true
7+
template:
8+
chat_message: |
9+
<|start_of_role|>{{ .RoleName }}<|end_of_role|>
10+
{{ if .FunctionCall -}}
11+
<tool_call>
12+
{{ else if eq .RoleName "tool" -}}
13+
<tool_response>
14+
{{ end -}}
15+
{{ if .Content -}}
16+
{{.Content }}
17+
{{ end -}}
18+
{{ if eq .RoleName "tool" -}}
19+
</tool_response>
20+
{{ end -}}
21+
{{ if .FunctionCall -}}
22+
{{toJson .FunctionCall}}
23+
</tool_call>
24+
{{ end -}}
25+
<|end_of_text|>
26+
function: |
27+
<|start_of_role|>system<|end_of_role|>
28+
You are a helpful AI assistant with access to the following tools. When a tool is required to answer the user's query, respond with <|tool_call|> followed by a JSON list of tools used. If a tool does not exist in the provided list of tools, notify the user that you do not have the ability to fulfill the request.
29+
30+
Write the response to the user's input by strictly aligning with the facts in the provided documents. If the information needed to answer the question is not available in the documents, inform the user that the question cannot be answered based on the available data.
31+
{{range .Functions}}
32+
{'type': 'function', 'function': {'name': '{{.Name}}', 'description': '{{.Description}}', 'parameters': {{toJson .Parameters}} }}
33+
{{end}}
34+
For each function call return a json object with function name and arguments
35+
{{.Input -}}
36+
<|start_of_role|>assistant<|end_of_role|>
37+
chat: |
38+
{{.Input -}}
39+
<|start_of_role|>assistant<|end_of_role|>
40+
completion: |
41+
{{.Input}}
42+
context_size: 8192
43+
f16: true
44+
stopwords:
45+
- '<|im_end|>'
46+
- '<dummy32000>'
47+
- '</s>'
48+
- '<|end_of_text|>'

gallery/index.yaml

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,26 @@
11
---
2+
- &granite4
3+
url: "github:mudler/LocalAI/gallery/granite4.yaml@master"
4+
name: "ibm-granite_granite-4.0-h-small"
5+
license: apache-2.0
6+
icon: https://cdn-avatars.huggingface.co/v1/production/uploads/639bcaa2445b133a4e942436/CEW-OjXkRkDNmTxSu8Egh.png
7+
tags:
8+
- gguf
9+
- GPU
10+
- CPU
11+
- text-to-text
12+
urls:
13+
- https://huggingface.co/ibm-granite/granite-4.0-h-small
14+
- https://huggingface.co/bartowski/ibm-granite_granite-4.0-h-small-GGUF
15+
description: |
16+
Granite-4.0-H-Small is a 32B parameter long-context instruct model finetuned from Granite-4.0-H-Small-Base using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets. This model is developed using a diverse set of techniques with a structured chat format, including supervised finetuning, model alignment using reinforcement learning, and model merging. Granite 4.0 instruct models feature improved instruction following (IF) and tool-calling capabilities, making them more effective in enterprise applications.
17+
overrides:
18+
parameters:
19+
model: ibm-granite_granite-4.0-h-small-Q4_K_M.gguf
20+
files:
21+
- filename: ibm-granite_granite-4.0-h-small-Q4_K_M.gguf
22+
sha256: c59ce76239bd5794acdbdf88616dfc296247f4e78792a9678d4b3e24966ead69
23+
uri: huggingface://bartowski/ibm-granite_granite-4.0-h-small-GGUF/ibm-granite_granite-4.0-h-small-Q4_K_M.gguf
224
- &ernie
325
url: "github:mudler/LocalAI/gallery/chatml.yaml@master"
426
name: "baidu_ernie-4.5-21b-a3b-thinking"

0 commit comments

Comments
 (0)