containers
diff --git a/‎.packit-copr-rpm.sh‎
Lines changed: 2 additions & 2 deletions b/‎.packit-copr-rpm.sh‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/ramalama-bench.1.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/ramalama-bench.1.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/ramalama-perplexity.1.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/ramalama-perplexity.1.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/ramalama-run.1.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/ramalama-run.1.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/ramalama-serve.1.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/ramalama-serve.1.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/ramalama.1.md‎
Lines changed: 20 additions & 19 deletions b/‎docs/ramalama.1.md‎
Lines changed: 20 additions & 19 deletions
diff --git a/‎docsite/docs/commands/ramalama/bench.mdx‎
Lines changed: 10 additions & 1 deletion b/‎docsite/docs/commands/ramalama/bench.mdx‎
Lines changed: 10 additions & 1 deletion
diff --git a/‎docsite/docs/commands/ramalama/chat.mdx‎
Lines changed: 6 additions & 0 deletions b/‎docsite/docs/commands/ramalama/chat.mdx‎
Lines changed: 6 additions & 0 deletions
diff --git a/‎docsite/docs/commands/ramalama/daemon.mdx‎
Lines changed: 83 additions & 0 deletions b/‎docsite/docs/commands/ramalama/daemon.mdx‎
Lines changed: 83 additions & 0 deletions
diff --git a/‎docsite/docs/commands/ramalama/inspect.mdx‎
Lines changed: 42 additions & 1 deletion b/‎docsite/docs/commands/ramalama/inspect.mdx‎
Lines changed: 42 additions & 1 deletion
@@ -6,8 +6,8 @@
 
 set -exo pipefail
 
-# Extract version from pyproject.toml instead of setup.py
-VERSION=$(awk -F'[""]' ' /^\s*version\s*/ {print $(NF-1)}' pyproject.toml )
+# Extract version from Python module since pyproject.toml uses dynamic versioning
+VERSION=$(python3 -c "import ramalama.version; print(ramalama.version.version())")
 
 SPEC_FILE=rpm/ramalama.spec
 
 
@@ -14,6 +14,7 @@ ramalama\-bench - benchmark specified AI Model
 | HuggingFace   | huggingface://, hf://, hf.co/ | [`huggingface.co`](https://www.huggingface.co)|
 | ModelScope    | modelscope://, ms:// | [`modelscope.cn`](https://modelscope.cn/)|
 | Ollama        | ollama:// | [`ollama.com`](https://www.ollama.com)|
+| rlcr          | rlcr://   | [`ramalama.com`](https://registry.ramalama.com/projects/ramalama) |
 | OCI Container Registries | oci:// | [`opencontainers.org`](https://opencontainers.org)|
 |||Examples: [`quay.io`](https://quay.io),  [`Docker Hub`](https://docker.io),[`Artifactory`](https://artifactory.com)|
 
 
@@ -14,6 +14,7 @@ ramalama\-perplexity - calculate the perplexity value of an AI Model
 | HuggingFace   | huggingface://, hf://, hf.co/ | [`huggingface.co`](https://www.huggingface.co)|
 | ModelScope    | modelscope://, ms:// | [`modelscope.cn`](https://modelscope.cn/)|
 | Ollama        | ollama:// | [`ollama.com`](https://www.ollama.com)|
+| rlcr          | rlcr://   | [`ramalama.com`](https://registry.ramalama.com/projects/ramalama) |
 | OCI Container Registries | oci:// | [`opencontainers.org`](https://opencontainers.org)|
 |||Examples: [`quay.io`](https://quay.io),  [`Docker Hub`](https://docker.io),[`Artifactory`](https://artifactory.com)|
 
 
@@ -14,6 +14,7 @@ ramalama\-run - run specified AI Model as a chatbot
 | HuggingFace   | huggingface://, hf://, hf.co/ | [`huggingface.co`](https://www.huggingface.co)|
 | ModelScope    | modelscope://, ms:// | [`modelscope.cn`](https://modelscope.cn/)|
 | Ollama        | ollama:// | [`ollama.com`](https://www.ollama.com)|
+| rlcr          | rlcr://   | [`ramalama.com`](https://registry.ramalama.com/projects/ramalama) |
 | OCI Container Registries | oci:// | [`opencontainers.org`](https://opencontainers.org)|
 |||Examples: [`quay.io`](https://quay.io),  [`Docker Hub`](https://docker.io),[`Artifactory`](https://artifactory.com)|
 
 
@@ -19,6 +19,7 @@ registry if it does not exist in local storage.
 | ModelScope    | modelscope://, ms:// | [`modelscope.cn`](https://modelscope.cn/)|
 | Ollama        | ollama:// | [`ollama.com`](https://www.ollama.com)|
 | OCI Container Registries | oci:// | [`opencontainers.org`](https://opencontainers.org)|
+| rlcr          | rlcr://   | [`ramalama.com`](https://registry.ramalama.com/projects/ramalama) |
 |||Examples: [`quay.io`](https://quay.io),  [`Docker Hub`](https://docker.io),[`Artifactory`](https://artifactory.com)|
 
 RamaLama defaults to the Ollama registry transport. This default can be overridden in the `ramalama.conf` file or via the RAMALAMA_TRANSPORTS
 
@@ -58,6 +58,7 @@ RamaLama supports multiple AI model registries types called transports. Supporte
 | HuggingFace   | huggingface://, hf://, hf.co/ | [`huggingface.co`](https://www.huggingface.co)|
 | ModelScope    | modelscope://, ms:// | [`modelscope.cn`](https://modelscope.cn/)|
 | Ollama        | ollama:// | [`ollama.com`](https://www.ollama.com)|
+| rlcr          | rlcr://   | [`ramalama.com`](https://registry.ramalama.com/projects/ramalama) |
 | OCI Container Registries | oci:// | [`opencontainers.org`](https://opencontainers.org)|
 |||Examples: [`quay.io`](https://quay.io),  [`Docker Hub`](https://docker.io),[`Artifactory`](https://artifactory.com)|
 
@@ -135,25 +136,25 @@ The default can be overridden in the ramalama.conf file.
 
 | Command                                           | Description                                                |
 | ------------------------------------------------- | ---------------------------------------------------------- |
-| [ramalama-bench(1)](ramalama-bench.1.md)          | benchmark specified AI Model                               |
-| [ramalama-chat(1)](ramalama-chat.1.md)            |  OpenAI chat with the specified REST API URL               |
-| [ramalama-containers(1)](ramalama-containers.1.md)| list all RamaLama containers                               |
-| [ramalama-convert(1)](ramalama-convert.1.md)      | convert AI Models from local storage to OCI Image          |
-| [ramalama-daemon(1)](ramalama-daemon.1.md)        | run a RamaLama REST server                                 |
-| [ramalama-info(1)](ramalama-info.1.md)            | display RamaLama configuration information                 |
-| [ramalama-inspect(1)](ramalama-inspect.1.md)      | inspect the specified AI Model                             |
-| [ramalama-list(1)](ramalama-list.1.md)            | list all downloaded AI Models                              |
-| [ramalama-login(1)](ramalama-login.1.md)          | login to remote registry                                   |
-| [ramalama-logout(1)](ramalama-logout.1.md)        | logout from remote registry                                |
-| [ramalama-perplexity(1)](ramalama-perplexity.1.md)| calculate the perplexity value of an AI Model              |
-| [ramalama-pull(1)](ramalama-pull.1.md)            | pull AI Models from Model registries to local storage      |
-| [ramalama-push(1)](ramalama-push.1.md)            | push AI Models from local storage to remote registries     |
-| [ramalama-rag(1)](ramalama-rag.1.md)              | generate and convert Retrieval Augmented Generation (RAG) data from provided documents into an OCI Image |
-| [ramalama-rm(1)](ramalama-rm.1.md)                | remove AI Models from local storage                        |
-| [ramalama-run(1)](ramalama-run.1.md)              | run specified AI Model as a chatbot                        |
-| [ramalama-serve(1)](ramalama-serve.1.md)          | serve REST API on specified AI Model                       |
-| [ramalama-stop(1)](ramalama-stop.1.md)            | stop named container that is running AI Model              |
-| [ramalama-version(1)](ramalama-version.1.md)      | display version of RamaLama                                |
+| [ramalama-bench(1)](ramalama-bench.1.md)          |benchmark specified AI Model|
+| [ramalama-chat(1)](ramalama-chat.1.md)            |OpenAI chat with the specified REST API URL|
+| [ramalama-containers(1)](ramalama-containers.1.md)|list all RamaLama containers|
+| [ramalama-convert(1)](ramalama-convert.1.md)      |convert AI Models from local storage to OCI Image|
+| [ramalama-daemon(1)](ramalama-daemon.1.md)        |run a RamaLama REST server|
+| [ramalama-info(1)](ramalama-info.1.md)            |display RamaLama configuration information|
+| [ramalama-inspect(1)](ramalama-inspect.1.md)      |inspect the specified AI Model|
+| [ramalama-list(1)](ramalama-list.1.md)            |list all downloaded AI Models|
+| [ramalama-login(1)](ramalama-login.1.md)          |login to remote registry|
+| [ramalama-logout(1)](ramalama-logout.1.md)        |logout from remote registry|
+| [ramalama-perplexity(1)](ramalama-perplexity.1.md)|calculate the perplexity value of an AI Model|
+| [ramalama-pull(1)](ramalama-pull.1.md)            |pull AI Models from Model registries to local storage|
+| [ramalama-push(1)](ramalama-push.1.md)            |push AI Models from local storage to remote registries|
+| [ramalama-rag(1)](ramalama-rag.1.md)              |generate and convert Retrieval Augmented Generation (RAG) data from provided documents into an OCI Image|
+| [ramalama-rm(1)](ramalama-rm.1.md)                |remove AI Models from local storage|
+| [ramalama-run(1)](ramalama-run.1.md)              |run specified AI Model as a chatbot|
+| [ramalama-serve(1)](ramalama-serve.1.md)          |serve REST API on specified AI Model|
+| [ramalama-stop(1)](ramalama-stop.1.md)            |stop named container that is running AI Model|
+| [ramalama-version(1)](ramalama-version.1.md)      |display version of RamaLama|
 
 ## CONFIGURATION FILES
 
 
@@ -18,6 +18,7 @@ description: benchmark specified AI Model
 | HuggingFace   | huggingface://, hf://, hf.co/ | [`huggingface.co`](https://www.huggingface.co)|
 | ModelScope    | modelscope://, ms:// | [`modelscope.cn`](https://modelscope.cn/)|
 | Ollama        | ollama:// | [`ollama.com`](https://www.ollama.com)|
+| rlcr          | rlcr://   | [`ramalama.com`](https://registry.ramalama.com/projects/ramalama) |
 | OCI Container Registries | oci:// | [`opencontainers.org`](https://opencontainers.org)|
 |||Examples: [`quay.io`](https://quay.io),  [`Docker Hub`](https://docker.io),[`Artifactory`](https://artifactory.com)|
 
@@ -40,6 +41,11 @@ write, and m for mknod(2).
 
 Example: --device=/dev/dri/renderD128:/dev/xvdc:rwm
 
+The device specification is passed directly to the underlying container engine.  See documentation of the supported container engine for more information.
+
+Pass '--device=none' explicitly add no device to the container, eg for
+running a CPU-only performance comparison.
+
 #### **--env**=
 
 Set environment variables inside of the container.
@@ -57,7 +63,7 @@ OCI container image to run with specified AI model. RamaLama defaults to using
 images based on the accelerator it discovers. For example:
 `quay.io/ramalama/ramalama`. See the table below for all default images.
 The default image tag is based on the minor version of the RamaLama package.
-Version 0.11.1 of RamaLama pulls an image with a `:0.11` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
+Version 0.12.1 of RamaLama pulls an image with a `:0.12` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
 
 The default can be overridden in the ramalama.conf file or via the
 RAMALAMA_IMAGE environment variable. `export RAMALAMA_IMAGE=quay.io/ramalama/aiimage:1.2` tells
@@ -139,6 +145,9 @@ llama.cpp explains this as:
 
         Usage: Lower numbers are good for virtual assistants where we need deterministic responses. Higher numbers are good for roleplay or creative tasks like editing stories
 
+#### **--thinking**=*true*
+Enable or disable thinking mode in reasoning models
+
 #### **--threads**, **-t**
 Maximum number of cpu threads to use.
 The default is to use half the cores available on this system for the number of threads.
 
@@ -56,6 +56,12 @@ $ ramalama chat
 Communicate with an alternative OpenAI REST API URL. With Docker containers.
 $ ramalama chat --url http://localhost:1234
 🐋 >
+
+Send multiple lines at once
+$ ramalama chat
+🦭 > Hi \
+🦭 > tell me a funny story \
+🦭 > please
 ```
 
 ## See Also
 
@@ -0,0 +1,83 @@
+---
+title: ramalama daemon.1
+description: run a RamaLama REST server
+# This file is auto-generated from manpages. Do not edit manually.
+# Source: ramalama-daemon.1.md
+---
+
+# ramalama daemon.1
+
+## Synopsis
+**ramalama daemon** [*options*] [start|run]
+
+## Description
+Inspect the specified AI Model about additional information
+like the repository, its metadata and tensor information.
+
+## Options
+
+#### **--help**, **-h**
+Print usage message
+
+## COMMANDS
+
+#### **start**
+pepares to run a new RamaLama REST server so it will be run either inside a RamaLama container or on the host
+
+#### **run**
+start a new RamaLama REST server
+
+## Examples
+
+Inspect the smollm:135m model for basic information
+```bash
+$ ramalama inspect smollm:135m
+smollm:135m
+   Path: /var/lib/ramalama/models/ollama/smollm:135m
+   Registry: ollama
+   Format: GGUF
+   Version: 3
+   Endianness: little
+   Metadata: 39 entries
+   Tensors: 272 entries
+```
+
+Inspect the smollm:135m model for all information in json format
+```bash
+$ ramalama inspect smollm:135m --all --json
+{
+    "Name": "smollm:135m",
+    "Path": "/home/mengel/.local/share/ramalama/models/ollama/smollm:135m",
+    "Registry": "ollama",
+    "Format": "GGUF",
+    "Version": 3,
+    "LittleEndian": true,
+    "Metadata": {
+        "general.architecture": "llama",
+        "general.base_model.0.name": "SmolLM 135M",
+        "general.base_model.0.organization": "HuggingFaceTB",
+        "general.base_model.0.repo_url": "https://huggingface.co/HuggingFaceTB/SmolLM-135M",
+        ...
+    },
+    "Tensors": [
+        {
+            "dimensions": [
+                576,
+                49152
+            ],
+            "n_dimensions": 2,
+            "name": "token_embd.weight",
+            "offset": 0,
+            "type": 8
+        },
+        ...
+    ]
+}
+```
+
+## See Also
+[ramalama(1)](/docs/commands/ramalama/)
+
+---
+
+*Feb 2025, Originally compiled by Michael Engel &lt;mengel&#64;redhat.com&gt;*
@@ -18,7 +18,14 @@ like the repository, its metadata and tensor information.
 
 #### **--all**
 Print all available information about the AI Model.
-By default, only a basic subset is printed.
+By default, only a basic subset is printed. 
+
+#### **--get**=*field*
+Print the value of a specific metadata field of the AI Model.
+This option supports autocomplete with the available metadata
+fields of the given model.
+The special value `all` will print all available metadata
+fields and values.
 
 #### **--help**, **-h**
 Print usage message
@@ -74,6 +81,40 @@ $ ramalama inspect smollm:135m --all --json
 }
 ```
 
+Use the autocomplete function of `--get` to view a list of fields:
+```bash
+$ ramalama inspect smollm:135m --get general.
+general.architecture               general.languages
+general.base_model.0.name          general.license
+general.base_model.0.organization  general.name
+general.base_model.0.repo_url      general.organization
+general.base_model.count           general.quantization_version
+general.basename                   general.size_label
+general.datasets                   general.tags
+general.file_type                  general.type
+general.finetune
+```
+
+Print the value of a specific field of the smollm:135m model:
+```bash
+$ ramalama inspect smollm:135m --get tokenizer.chat_template
+{% for message in messages %}{{'<|im_start|>' + message['role'] + '
+' + message['content'] + '<|im_end|>' + '
+'}}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>assistant
+' }}{% endif %}
+```
+
+Print all key-value pairs of the metadata of the smollm:135m model:
+```bash
+$ ramalama inspect smollm:135m --get all
+general.architecture: llama
+general.base_model.0.name: SmolLM 135M
+general.base_model.0.organization: HuggingFaceTB
+general.base_model.0.repo_url: https://huggingface.co/HuggingFaceTB/SmolLM-135M
+general.base_model.count: 1
+...
+```
+
 ## See Also
 [ramalama(1)](/docs/commands/ramalama/)