Add NewBie-image Exp0.1 tutorials (#666)

mintlify[bot] · web-flow · commit 67727d968059 · 2025-12-23T01:19:53.000+08:00
* Update tutorials/image/newbie-image/newbie-image-exp-0-1.mdx

Co-Authored-By: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;

* Update zh-CN/tutorials/image/newbie-image/newbie-image-exp-0-1.mdx

Co-Authored-By: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;

* Update docs.json

Co-Authored-By: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;

* Update docs.json

Co-Authored-By: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;

* Update tutorials/image/newbie-image/newbie-image-exp-0-1.mdx

Co-Authored-By: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;

* Update zh-CN/tutorials/image/newbie-image/newbie-image-exp-0-1.mdx

Co-Authored-By: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;

---------

Co-authored-by: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;
diff --git a/docs.json b/docs.json
@@ -164,6 +164,12 @@
                           "tutorials/image/ovis/ovis-image"
                         ]
                       },
+                      {
+                        "group": "NewBie-image",
+                        "pages": [
+                          "tutorials/image/newbie-image/newbie-image-exp-0-1"
+                        ]
+                      },
                       "tutorials/image/cosmos/cosmos-predict2-t2i",
                       "tutorials/image/omnigen/omnigen2"
                     ]
@@ -809,6 +815,12 @@
                           "zh-CN/tutorials/image/ovis/ovis-image"
                         ]
                       },
+                      {
+                        "group": "NewBie-image",
+                        "pages": [
+                          "zh-CN/tutorials/image/newbie-image/newbie-image-exp-0-1"
+                        ]
+                      },
                       "zh-CN/tutorials/image/cosmos/cosmos-predict2-t2i",
                       "zh-CN/tutorials/image/omnigen/omnigen2"
                     ]
diff --git a/tutorials/image/newbie-image/newbie-image-exp-0-1.mdx b/tutorials/image/newbie-image/newbie-image-exp-0-1.mdx
@@ -0,0 +1,128 @@
+---
+title: "ComfyUI NewBie-image-Exp0.1 Workflow Example"
+description: "NewBie-image-Exp0.1 is a 3.5B parameter anime-style text-to-image generation model based on Next-DiT architecture, optimized for high-quality anime image generation with XML structured prompts."
+sidebarTitle: "NewBie-image-Exp0.1"
+---
+
+import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'
+
+**NewBie-image-Exp0.1** is a 3.5B parameter DiT model developed by NewBieAI Lab for anime-style text-to-image generation. Built on the Next-DiT architecture, it delivers remarkably detailed and visually striking anime images.
+
+**Key Features**:
+- **3.5B Parameter Model**: Efficient yet powerful model size for high-quality anime generation
+- **Next-DiT Architecture**: Based on research from the Lumina architecture with a newly designed NewBie architecture
+- **Dual Text Encoders**: Uses Gemma3-4B-it as primary encoder with Jina CLIP v2 for improved prompt understanding
+- **FLUX VAE**: Utilizes FLUX.1-dev 16-channel VAE for richer colors and finer texture details
+- **XML Structured Prompts**: Supports XML format for better attention binding and attribute disentanglement
+
+**Related Links**:
+- [GitHub](https://github.com/NewBieAI-Lab/NewBie-image-Exp0.1)
+- [Hugging Face](https://huggingface.co/NewBie-AI/NewBie-image-Exp0.1)
+- [Getting Started Guide](https://ai.feishu.cn/wiki/NZl9wm7V1iuNzmkRKCUcb1USnsh)
+
+## NewBie-image text-to-image workflow
+
+<a className="prose"  target='_blank'  href="https://raw.githubusercontent.com/Comfy-Org/workflow_templates/refs/heads/main/templates/image_newbieimage_exp0_1-t2i.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold', marginRight: '10px'}}>
+    <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Download JSON Workflow File</p>
+</a>
+
+<a className="prose"  target='_blank'  href="https://cloud.comfy.org/?template=image_newbieimage_exp0_1-t2i&utm_source=docs" style={{ display: 'inline-block', backgroundColor: '#28a745', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}>
+    <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Run on ComfyUI Cloud</p>
+</a>
+
+<UpdateReminder />
+
+## Model links
+
+**text_encoders**
+
+- [gemma_3_4b_it_bf16.safetensors](https://huggingface.co/Comfy-Org/NewBie-image-Exp0.1_repackaged/resolve/main/split_files/text_encoders/gemma_3_4b_it_bf16.safetensors)
+- [jina_clip_v2_bf16.safetensors](https://huggingface.co/Comfy-Org/NewBie-image-Exp0.1_repackaged/resolve/main/split_files/text_encoders/jina_clip_v2_bf16.safetensors)
+
+**diffusion_models**
+
+- [NewBie-Image-Exp0.1-bf16.safetensors](https://huggingface.co/Comfy-Org/NewBie-image-Exp0.1_repackaged/resolve/main/split_files/diffusion_models/NewBie-Image-Exp0.1-bf16.safetensors)
+
+**vae**
+
+- [ae.safetensors](https://huggingface.co/Comfy-Org/z_image_turbo/resolve/main/split_files/vae/ae.safetensors)
+
+**Model Storage Location**
+
+```
+ComfyUI/
+├── models/
+│   ├── text_encoders/
+│   │      ├── gemma_3_4b_it_bf16.safetensors
+│   │      └── jina_clip_v2_bf16.safetensors
+│   ├── diffusion_models/
+│   │      └── NewBie-Image-Exp0.1-bf16.safetensors
+│   └── vae/
+│          └── ae.safetensors
+```
+
+## Prompt format
+
+NewBie-image is an anime image generation model optimized for character generation. It uses XML structured prompts for training, where each `<>` tag defines a category (like `<appearance>`, `<clothing>`) and `</>` closes it. The tags inside are standard Danbooru tags. This structure enables precise control over multi-character scenes with better attribute binding.
+
+For the complete prompt writing guide, see the [official documentation](https://ai.feishu.cn/wiki/NZl9wm7V1iuNzmkRKCUcb1USnsh).
+
+NewBie-image-Exp0.1 supports three prompt formats:
+- **Natural language**: Standard text descriptions
+- **Tags**: Danbooru-style tags
+- **XML structured format**: Recommended for multi-character scenes
+
+### XML structured prompt
+
+For multi-character scenes, using XML structured prompts typically leads to more accurate image generation results with better attention binding and attribute disentanglement.
+
+```xml
+<character_1>
+<n>$character_1$</n>
+<gender>1girl</gender>
+<appearance>chibi, red_eyes, blue_hair, long_hair, hair_between_eyes, head_tilt, tareme, closed_mouth</appearance>
+<clothing>school_uniform, serafuku, white_sailor_collar, white_shirt, short_sleeves, red_neckerchief, bow, blue_skirt, miniskirt, pleated_skirt, blue_hat, mini_hat, thighhighs, grey_thighhighs, black_shoes, mary_janes</clothing>
+<expression>happy, smile</expression>
+<action>standing, holding, holding_briefcase</action>
+<position>center_left</position>
+</character_1>
+
+<character_2>
+<n>$character_2$</n>
+<gender>1girl</gender>
+<appearance>chibi, red_eyes, pink_hair, long_hair, very_long_hair, multi-tied_hair, open_mouth</appearance>
+<clothing>school_uniform, serafuku, white_sailor_collar, white_shirt, short_sleeves, red_neckerchief, bow, red_skirt, miniskirt, pleated_skirt, hair_bow, multiple_hair_bows, white_bow, ribbon_trim, ribbon-trimmed_bow, white_thighhighs, black_shoes, mary_janes, bow_legwear, bare_arms</clothing>
+<expression>happy, smile</expression>
+<action>standing, holding, holding_briefcase, waving</action>
+<position>center_right</position>
+</character_2>
+
+<general_tags>
+<count>2girls, multiple_girls</count>
+<style>anime_style, digital_art</style>
+<background>white_background, simple_background</background>
+<atmosphere>cheerful</atmosphere>
+<quality>high_resolution, detailed</quality>
+<objects>briefcase</objects>
+<other>alternate_costume</other>
+</general_tags>
+```
+
+### XML tag reference
+
+| Tag | Description |
+|-----|-------------|
+| `<n>` | Character name/identifier |
+| `<gender>` | Character gender (1girl, 1boy, etc.) |
+| `<appearance>` | Physical features (hair, eyes, body type) |
+| `<clothing>` | Outfit and accessories |
+| `<expression>` | Facial expression |
+| `<action>` | Pose and actions |
+| `<position>` | Position in the image |
+| `<count>` | Number of characters |
+| `<style>` | Art style |
+| `<background>` | Background description |
+| `<atmosphere>` | Overall mood |
+| `<quality>` | Quality tags |
+| `<objects>` | Objects in the scene |
+| `<other>` | Additional tags |
diff --git a/zh-CN/tutorials/image/newbie-image/newbie-image-exp-0-1.mdx b/zh-CN/tutorials/image/newbie-image/newbie-image-exp-0-1.mdx
@@ -0,0 +1,128 @@
+---
+title: "ComfyUI NewBie-image-Exp0.1 工作流示例"
+description: "NewBie-image-Exp0.1 是一个基于 Next-DiT 架构的 35 亿参数动漫风格文生图模型，针对高质量动漫图像生成进行了优化，支持 XML 结构化提示词。"
+sidebarTitle: "NewBie-image-Exp0.1"
+---
+
+import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'
+
+**NewBie-image-Exp0.1** 是由 NewBieAI Lab 开发的 35 亿参数 DiT 模型，专为动漫风格文生图设计。基于 Next-DiT 架构构建，能够生成细节丰富、视觉效果出色的动漫图像。
+
+**核心特性**：
+- **35 亿参数模型**：高效且强大的模型规模，适合高质量动漫生成
+- **Next-DiT 架构**：基于 Lumina 架构研究，采用全新设计的 NewBie 架构
+- **双文本编码器**：使用 Gemma3-4B-it 作为主编码器，配合 Jina CLIP v2 提升提示词理解能力
+- **FLUX VAE**：采用 FLUX.1-dev 16 通道 VAE，呈现更丰富的色彩和更精细的纹理细节
+- **XML 结构化提示词**：支持 XML 格式，实现更好的注意力绑定和属性解耦
+
+**相关链接**：
+- [GitHub](https://github.com/NewBieAI-Lab/NewBie-image-Exp0.1)
+- [Hugging Face](https://huggingface.co/NewBie-AI/NewBie-image-Exp0.1)
+- [入门指南](https://ai.feishu.cn/wiki/P3sgwUUjWih8ZWkpr0WcwXSMnTb)
+
+## NewBie-image 文生图工作流
+
+<a className="prose"  target='_blank'  href="https://raw.githubusercontent.com/Comfy-Org/workflow_templates/refs/heads/main/templates/image_newbieimage_exp0_1-t2i.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold', marginRight: '10px'}}>
+    <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>下载 JSON 工作流文件</p>
+</a>
+
+<a className="prose"  target='_blank'  href="https://cloud.comfy.org/?template=image_newbieimage_exp0_1-t2i&utm_source=docs" style={{ display: 'inline-block', backgroundColor: '#28a745', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}>
+    <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>在 ComfyUI Cloud 上运行</p>
+</a>
+
+<UpdateReminder />
+
+## 模型下载链接
+
+**text_encoders**
+
+- [gemma_3_4b_it_bf16.safetensors](https://huggingface.co/Comfy-Org/NewBie-image-Exp0.1_repackaged/resolve/main/split_files/text_encoders/gemma_3_4b_it_bf16.safetensors)
+- [jina_clip_v2_bf16.safetensors](https://huggingface.co/Comfy-Org/NewBie-image-Exp0.1_repackaged/resolve/main/split_files/text_encoders/jina_clip_v2_bf16.safetensors)
+
+**diffusion_models**
+
+- [NewBie-Image-Exp0.1-bf16.safetensors](https://huggingface.co/Comfy-Org/NewBie-image-Exp0.1_repackaged/resolve/main/split_files/diffusion_models/NewBie-Image-Exp0.1-bf16.safetensors)
+
+**vae**
+
+- [ae.safetensors](https://huggingface.co/Comfy-Org/z_image_turbo/resolve/main/split_files/vae/ae.safetensors)
+
+**模型存放位置**
+
+```
+ComfyUI/
+├── models/
+│   ├── text_encoders/
+│   │      ├── gemma_3_4b_it_bf16.safetensors
+│   │      └── jina_clip_v2_bf16.safetensors
+│   ├── diffusion_models/
+│   │      └── NewBie-Image-Exp0.1-bf16.safetensors
+│   └── vae/
+│          └── ae.safetensors
+```
+
+## 提示词格式
+
+NewBie-image 是一个针对角色生成优化的动漫图像生成模型。它使用 XML 结构化提示词进行训练，每个 `<>` 标签定义一个类别（如 `<appearance>`、`<clothing>`），`</>` 作为结束标记。标签内部使用标准的 Danbooru 标签。这种结构能够精确控制多角色场景，实现更好的属性绑定。
+
+完整的提示词编写指南请参阅[官方文档](https://ai.feishu.cn/wiki/NZl9wm7V1iuNzmkRKCUcb1USnsh)。
+
+NewBie-image-Exp0.1 支持三种提示词格式：
+- **自然语言**：标准文本描述
+- **标签**：Danbooru 风格标签
+- **XML 结构化格式**：推荐用于多角色场景
+
+### XML 结构化提示词
+
+对于多角色场景，使用 XML 结构化提示词通常能获得更准确的图像生成结果，具有更好的注意力绑定和属性解耦效果。
+
+```xml
+<character_1>
+<n>$character_1$</n>
+<gender>1girl</gender>
+<appearance>chibi, red_eyes, blue_hair, long_hair, hair_between_eyes, head_tilt, tareme, closed_mouth</appearance>
+<clothing>school_uniform, serafuku, white_sailor_collar, white_shirt, short_sleeves, red_neckerchief, bow, blue_skirt, miniskirt, pleated_skirt, blue_hat, mini_hat, thighhighs, grey_thighhighs, black_shoes, mary_janes</clothing>
+<expression>happy, smile</expression>
+<action>standing, holding, holding_briefcase</action>
+<position>center_left</position>
+</character_1>
+
+<character_2>
+<n>$character_2$</n>
+<gender>1girl</gender>
+<appearance>chibi, red_eyes, pink_hair, long_hair, very_long_hair, multi-tied_hair, open_mouth</appearance>
+<clothing>school_uniform, serafuku, white_sailor_collar, white_shirt, short_sleeves, red_neckerchief, bow, red_skirt, miniskirt, pleated_skirt, hair_bow, multiple_hair_bows, white_bow, ribbon_trim, ribbon-trimmed_bow, white_thighhighs, black_shoes, mary_janes, bow_legwear, bare_arms</clothing>
+<expression>happy, smile</expression>
+<action>standing, holding, holding_briefcase, waving</action>
+<position>center_right</position>
+</character_2>
+
+<general_tags>
+<count>2girls, multiple_girls</count>
+<style>anime_style, digital_art</style>
+<background>white_background, simple_background</background>
+<atmosphere>cheerful</atmosphere>
+<quality>high_resolution, detailed</quality>
+<objects>briefcase</objects>
+<other>alternate_costume</other>
+</general_tags>
+```
+
+### XML 标签参考
+
+| 标签 | 描述 |
+|-----|-------------|
+| `<n>` | 角色名称/标识符 |
+| `<gender>` | 角色性别（1girl、1boy 等） |
+| `<appearance>` | 外貌特征（头发、眼睛、体型） |
+| `<clothing>` | 服装和配饰 |
+| `<expression>` | 面部表情 |
+| `<action>` | 姿势和动作 |
+| `<position>` | 图像中的位置 |
+| `<count>` | 角色数量 |
+| `<style>` | 艺术风格 |
+| `<background>` | 背景描述 |
+| `<atmosphere>` | 整体氛围 |
+| `<quality>` | 质量标签 |
+| `<objects>` | 场景中的物品 |
+| `<other>` | 其他标签 |