[feat] add README and doc.

wangzhaode · wangzhaode · commit 7d8f166e10ec · 2025-12-16T11:59:22.000+08:00
diff --git a/README.md b/README.md
@@ -0,0 +1,111 @@
+# jinja.cpp
+
+![License](https://img.shields.io/badge/license-Apache%20License%202.0-green)
+![Build Status](https://github.com/wangzhaode/jinja.cpp/actions/workflows/build.yml/badge.svg)
+[![中文版本](https://img.shields.io/badge/Language-%E7%AE%80%E4%BD%93%E4%B8%AD%E6%96%87-green)](README_CN.md)
+
+A lightweight, minimal C++11 implementation of the Jinja2 template engine, designed specifically for **LLM Chat Templates** (HuggingFace style).
+
+It focuses on supporting the subset of Jinja2 used by modern Large Language Models (LLMs) like Llama 3, Qwen 2.5/3, DeepSeek, and others, enabling seamless inference integration in C++ environments.
+
+## Features
+
+- **C++11 Compatible**: Ensures maximum compatibility across older compiler versions and embedded systems.
+- **Lightweight**: Minimal dependencies (only `nlohmann/json`).
+- **LLM Focused**: Native support for `messages`, `tools`, `add_generation_prompt`, and special tokens.
+- **Strictly Typed**: Uses `nlohmann::json` for context management.
+- **Custom Function Interop**: Easily inject C++ functions (e.g., `strftime_now`) into templates.
+- **Robust**: Validated against official Python `transformers` outputs using fuzzy matching tests.
+
+## Supported Models
+
+Tested and verified with templates from:
+- **Llama 3 / 3.1 / 3.2** (Instruct & Vision)
+- **Qwen 2.5** (Coder, Math, VL, Omni)
+- **Qwen 3** (Instruct, Thinking, QwQ)
+- **DeepSeek** (V3, R1)
+- **Mistral**
+- **Gemma**
+- And more...
+
+## Build Instructions
+
+### Prerequisites
+- CMake 3.10+
+- C++11 compatible compiler (GCC, Clang, MSVC)
+
+```bash
+mkdir build
+cd build
+cmake ..
+make
+```
+
+### Run Tests
+
+The project includes a comprehensive test suite based on real-world model templates.
+
+```bash
+./test_main
+```
+
+## Usage
+
+### Basic Rendering
+
+```cpp
+#include "jinja.hpp"
+#include <iostream>
+
+int main() {
+    std::string template_str = "Hello {{ name }}!";
+    jinja::Template tpl(template_str);
+
+    nlohmann::json context;
+    context["name"] = "World";
+
+    std::string result = tpl.render(context);
+    std::cout << result << std::endl; // Output: Hello World!
+    return 0;
+}
+```
+
+### LLM Chat Template
+
+```cpp
+#include "jinja.hpp"
+
+// Load your tokenizer_config.json's "chat_template"
+std::string chat_template_str = "...";
+jinja::Template tpl(chat_template_str);
+
+nlohmann::json messages = nlohmann::json::array({
+    {{"role", "user"}, {"content", "Hello!"}}
+});
+
+// Apply template
+std::string prompt = tpl.apply_chat_template(
+    messages,
+    true, // add_generation_prompt
+    nlohmann::json::array() // tools
+);
+```
+
+### Custom Functions
+
+You can register custom C++ functions to be called from within the template.
+
+```cpp
+tpl.add_function("strftime_now", [](const std::vector<nlohmann::json>& args) {
+    // Return current time string
+    return "2025-12-16";
+});
+```
+
+## Documentation
+
+For detailed implementation details, see [doc/implementation_details.md](doc/implementation_details.md).
+
+## License
+
+Apache License 2.0. See [LICENSE](LICENSE) file for details.
diff --git a/README_CN.md b/README_CN.md
@@ -0,0 +1,111 @@
+# jinja.cpp (中文版)
+
+![License](https://img.shields.io/badge/license-Apache%20License%202.0-green)
+![Build Status](https://github.com/wangzhaode/jinja.cpp/actions/workflows/build.yml/badge.svg)
+[![English Version](https://img.shields.io/badge/Language-English-green)](README.md)
+
+一个轻量级、完全兼容 C++11 的 Jinja2 模板引擎实现，专为 **LLM 对话模板** (HuggingFace 风格) 设计。
+
+它专注于支持现代大语言模型 (如 Llama 3, Qwen 2.5/3, DeepSeek 等) 所需的 Jinja2 语法子集，使得在 C++ 环境中进行推理集成变得无缝且高效。
+
+## 特性
+
+- **C++11 兼容**：确保在旧版编译器和嵌入式系统上的最大兼容性。
+- **轻量级**：依赖极少 (仅依赖 `nlohmann/json`，已包含在项目中)。
+- **专注 LLM**：原生支持 `messages`, `tools`, `add_generation_prompt` 以及特殊 token 的处理。
+- **类型安全**：使用 `nlohmann::json` 进行上下文管理。
+- **自定义函数**：支持轻松注入 C++ 函数 (如 `strftime_now`) 到模板中。
+- **健壮性**：通过模糊匹配测试，与官方 Python `transformers` 输出进行对齐验证。
+
+## 支持的模型
+
+已基于以下模型的真实模板进行测试验证：
+- **Llama 3 / 3.1 / 3.2** (Instruct & Vision)
+- **Qwen 2.5** (Coder, Math, VL, Omni)
+- **Qwen 3** (Instruct, Thinking, QwQ)
+- **DeepSeek** (V3, R1)
+- **Mistral**
+- **Gemma**
+- 更多...
+
+## 构建指南
+
+### 前置要求
+- CMake 3.10+
+- 支持 C++11 的编译器 (GCC, Clang, MSVC)
+
+```bash
+mkdir build
+cd build
+cmake ..
+make
+```
+
+### 运行测试
+
+本项目包含一个基于真实模型模板的全面测试套件。
+
+```bash
+./test_main
+```
+
+## 使用方法
+
+### 基础渲染
+
+```cpp
+#include "jinja.hpp"
+#include <iostream>
+
+int main() {
+    std::string template_str = "Hello {{ name }}!";
+    jinja::Template tpl(template_str);
+
+    nlohmann::json context;
+    context["name"] = "World";
+
+    std::string result = tpl.render(context);
+    std::cout << result << std::endl; // 输出: Hello World!
+    return 0;
+}
+```
+
+### LLM 对话模板 (Chat Template)
+
+```cpp
+#include "jinja.hpp"
+
+// 加载 tokenizer_config.json 中的 "chat_template" 字符串
+std::string chat_template_str = "...";
+jinja::Template tpl(chat_template_str);
+
+nlohmann::json messages = nlohmann::json::array({
+    {{"role", "user"}, {"content", "你好！"}}
+});
+
+// 应用模板
+std::string prompt = tpl.apply_chat_template(
+    messages,
+    true, // add_generation_prompt
+    nlohmann::json::array() // tools
+);
+```
+
+### 自定义函数
+
+你可以注册自定义 C++ 函数，供模板内部调用。
+
+```cpp
+tpl.add_function("strftime_now", [](const std::vector<nlohmann::json>& args) {
+    // 返回当前时间字符串
+    return "2025-12-16";
+});
+```
+
+## 文档
+
+关于具体的实现细节，请参阅 [doc/implementation_details_CN.md](doc/implementation_details_CN.md)。
+
+## 许可证
+
+Apache License 2.0。 详见 [LICENSE](LICENSE) 文件。
diff --git a/doc/implementation_details.md b/doc/implementation_details.md
@@ -0,0 +1,93 @@
+# Implementation Details
+
+This document provides an overview of the internal architecture and design decisions of `jinja.cpp`.
+
+## Architecture
+
+The engine follows a standard compiler/interpreter pipeline:
+
+1.  **Lexer (`Lexer` class)**:
+    *   Scans the input string.
+    *   Tokenizes Jinja delimiters `{{ ... }}`, `{% ... %}`, `{# ... #}`.
+    *   Handles whitespace control modifiers (`-`, like `{{-`) by tracking state and stripping preceding/succeeding whitespace from text tokens.
+    *   Produces a flat list of `Token`s.
+
+2.  **Parser (`Parser` class)**:
+    *   Recursive Descent Parser.
+    *   Converts the valid tokens into an Abstract Syntax Tree (AST).
+    *   Handles operator precedence for expressions.
+    *   Supports:
+        *   Binary operators (`+`, `-`, `*`, `/`, `%`, `==`, `!=`, `<`, `>`, `<=`, `>=`, `and`, `or`, `in`, `not in`, `~`).
+        *   Unary operators (`not`, `-`).
+        *   Literals (String, Number, Boolean, Array, Object).
+        *   Variables and Attribute Access (`foo.bar`, `foo['bar']`).
+        *   Function Calls and Filters (`foo | filter`).
+        *   Control Structures (`for`, `if`, `set`, `macro`).
+
+3.  **AST (`Node` hierarchy)**:
+    *   Base `Node` class with virtual `render(Context&, string& out)` method.
+    *   Nodes: `TextNode`, `PrintNode`, `ForStmt`, `IfNode`, `SetNode`, `MacroNode`.
+    *   Expressions (`Expr` hierarchy) evaluate to `nlohmann::json` values.
+
+4.  **Interpreter / Renderer (`Template::render`)**:
+    *   Iterates through root nodes and calls `render`.
+    *   Manages `Context` (scopes, variables).
+
+## Supported Features
+
+### Filters
+*   **`tojson(indent=None)`**: Serializes a variable to JSON string. Supports indentation.
+*   **`safe`**: Marks a string as safe (no-op in this implementation as HTML escaping is not enforced by default, but supported for compatibility). *Note: Implicitly supported by pass-through.*
+*   **`string`**: Converts a value to its string representation.
+*   **`length`**: Returns the size of a list, string, or object.
+*   **`trim`**: Removes leading and trailing whitespace from a string.
+*   **`items`**: Returns a list of `[key, value]` pairs from a dictionary (useful for iterating over objects).
+*   **`capitalize`**: Capitalizes the first character of a string and lowercases the rest.
+*   **`lower`**: Converts a string to lowercase.
+*   **`upper`**: Converts a string to uppercase.
+*   **`map(attribute=name)`**: Extracts a specific attribute from each element in a list (e.g., `users | map(attribute='name')`).
+
+### Global Functions
+*   **`range([start], stop, [step])`**: Generates a sequence of integers.
+*   **`namespace(...)`**: Creates a mutable object, useful for updating variables inside loops (e.g., `set ns.i = ns.i + 1`).
+*   **`strftime_now(format)`**: Returns the current time formatted according to the given string.
+
+### Tests (`is ...`)
+*   **`defined`**: Checks if a variable exists.
+*   **`undefined`**: Checks if a variable is not defined.
+*   **`none`**: Checks if a variable is null.
+*   **`boolean`**: Checks if a variable is a boolean.
+*   **`string`**: Checks if a variable is a string.
+*   **`number`**: Checks if a variable is a number.
+*   **`sequence` / `iterable`**: Checks if a variable is a list or string.
+*   **`mapping`**: Checks if a variable is an object/dictionary.
+*   **`true` / `false`**: Checks boolean value.
+
+## Key Implementation Features
+
+### 1. JSON Data Model
+We utilize `nlohmann::json` as the unified data type for all variables. This simplifies type checking and allows easy integration with JSON-based LLM APIs.
+
+### 2. Custom Function / Filter Dispatch
+*   **Filters**: Implemented in `FilterExpr`. Standard Jinja2 filters like `safe`, `tojson`, `trim`, `lower` are hardcoded.
+*   **Functions**: `CallExpr` handles global functions (`range`, `namespace`) and user-registered functions.
+*   **User Hooks**: `Template::add_function` allows users to bind C++ lambdas to Jinja function calls.
+
+### 3. `tojson` Serialization
+Strict control over JSON serialization is critical for chat templates (e.g., Tool definitions).
+We implemented a custom recursive serializer `to_json_string` (in `src/jinja.cpp`) that:
+*   Supports indentation matching Python's generic output.
+*   **Sorts keys** in a specific order (`type` -> `function` -> `name` -> ...) to match common LLM training data formats, ensuring high consistency.
+
+### 4. Whitespace Control
+Jinja2's `lstrip_blocks` and `trim_blocks` behavior is partially emulated in the Lexer. The manual whitespace stripping logic (`trim_prev`, `trim_next`) ensures that the generated prompt doesn't contain excess newlines, which can affect LLM performance.
+
+### 5. C++11 Compatibility
+To support a wide range of deployment environments:
+*   Structure bindings were replaced with standard iterators.
+*   `std::make_unique` polyfill used for C++11.
+
+## Testing Strategy
+
+*   **Real Data**: We use `tests/test_chat_template.json` generated from the official Python `transformers` library on typically supported models.
+*   **Fuzzy Matching**: For dynamic content (like dates), tests use regex normalization to ensure pass consistency across time and environments.
diff --git a/doc/implementation_details_CN.md b/doc/implementation_details_CN.md