liyuxuan-bd
diff --git a/‎demohouse/deepdoubao/README.md‎
Lines changed: 102 additions & 0 deletions b/‎demohouse/deepdoubao/README.md‎
Lines changed: 102 additions & 0 deletions
diff --git a/‎demohouse/deepdoubao/assets/技术路线.png‎
22.3 KB b/‎demohouse/deepdoubao/assets/技术路线.png‎
22.3 KB
diff --git a/‎demohouse/deepdoubao/backend/code/__init__.py‎
Lines changed: 10 additions & 0 deletions b/‎demohouse/deepdoubao/backend/code/__init__.py‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎demohouse/deepdoubao/backend/code/main.py‎
Lines changed: 138 additions & 0 deletions b/‎demohouse/deepdoubao/backend/code/main.py‎
Lines changed: 138 additions & 0 deletions
@@ -0,0 +1,102 @@
+# DeepDoubao
+## 应用介绍
+这是一款结合 DeepSeek R1 模型的强大推理能力与 Doubao 模型的高效对话能力的应用，为用户提供智能问答服务。结合 Deepseek R1 模型的推理能力与 Doubao 自然流畅的对话总结能力，为用户带来精准、专业且具有互动性的智能体验。
+
+### 应用优势
+DeepSeek R1 模型强大的思考能力，搭配豆包模型扎实的对话总结基础能力，优势互补。在处理复杂流程、参考问答、信息抽取等任务时，两者协作，能更精准高效地执行，取得比纯 R1 模型更好的结果。
+
+### 相关模型
+- 思考模型：使用 DeepSeek-R1/250120，生成推理步骤和依据，辅助回答用户问题。
+- 回答模型：结合思考步骤，对用户的初始问题进行回答总结。不同场景可以选择不同的回答模型：
+  - 综合任务：Doubao-1.5-pro-32k/250115
+  - 角色扮演：Doubao-pro-32k/character-241215
+
+## 环境准备
+
+- Python 版本要求大于等于 3.8，小于 3.12
+- 已获取火山方舟 API Key [参考文档](https://www.volcengine.com/docs/82379/1298459#api-key-%E7%AD%BE%E5%90%8D%E9%89%B4%E6%9D%83)
+- 已创建 DeepSeek-R1 的 endpoint  [参考文档](https://www.volcengine.com/docs/82379/1099522#594199f1)
+- 已创建 Douba 的endpoint [参考文档](https://www.volcengine.com/docs/82379/1099522#594199f1)
+
+## 快速开始
+
+1. 下载代码库
+
+   ```bash
+    git clone https://github.com/volcengine/ai-app-lab.git
+    cd demohouse/deepdoubao
+   ```
+2. 修改配置
+
+   - 修改`backend/code/main.py` 中配置，填入
+    ```text
+     | 配置变量名                | 说明                            |
+     | ------------------------ | -------------------------------|
+     | DEEPSEEK_R1_ENDPOINT     | DeepSeek-R1 endpoint id        |
+     | DOUBAO_ENDPOINT          | Doubao 模型 endpoint id         |
+    ```
+
+   - 修改 `backend/run.sh` 中配置，填入获取的API key
+    ```text
+     | 配置变量名    | 说明            |
+     | ----------- | --------------- |
+     | ARK_API_KEY | 火山方舟 API Key |
+    ```
+
+
+     
+3. 安装后端依赖
+
+   ```bash
+   cd demohouse/deepdoubao/backend
+
+   python -m venv .venv
+   source .venv/bin/activate
+   pip install poetry==1.6.1
+
+   poetry install
+   ```
+4. 启动后端
+
+   ```bash
+   cd demohouse/deepdoubao/backend
+   bash run.sh
+   ```
+   
+5. 测试
+
+   ```bash
+   curl --location 'http://localhost:8888/api/v3/bots/chat/completions' \
+    --header 'Content-Type: application/json' \
+    --data '{
+        "model": "test",
+        "stream": true,
+        "messages": [
+            {
+                "role": "user",
+                "content": "写一个适合3岁宝宝的睡前故事"
+            }
+        ]
+    }'
+   ```
+## 技术实现
+<img src="./assets/技术路线.png" alt="技术路线">
+
+本项目结合深度思考模型DeepSeek R1的思考能力，以及 Doubao 模型的对话能力，提供 Chat Completion API。API在接收到用户问题以后有两个步骤：
+1. DeepSeek R1 模型输入用户问题进行思考，输出思考内容（reasoning content）。
+2. Doubao 模型输入用户问题以及 R1 模型的思考过程，输出最终回答用户的结果。
+
+## 目录结构
+```text
+./
+├── README.md
+├── assets
+│   └── 技术路线.png
+└── backend
+    ├── code
+    │   ├── __init__.py
+    │   └── main.py        # 入口函数
+    ├── poetry.lock
+    ├── pyproject.toml
+    └── run.sh             # 启动脚本  
+```
@@ -0,0 +1,10 @@
+# Copyright (c) 2025 Bytedance Ltd. and/or its affiliates
+# Licensed under the 【火山方舟】原型应用软件自用许可协议
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#     https://www.volcengine.com/docs/82379/1433703
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
@@ -0,0 +1,138 @@
+# Copyright (c) 2025 Bytedance Ltd. and/or its affiliates
+# Licensed under the 【火山方舟】原型应用软件自用许可协议
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#     https://www.volcengine.com/docs/82379/1433703
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+"""
+DeepDoubao
+"""
+
+import logging
+import os
+from typing import AsyncIterable, Union
+
+from arkitect.core.component.llm import BaseChatLanguageModel
+from arkitect.core.component.llm.model import (
+    ArkChatCompletionChunk,
+    ArkChatParameters,
+    ArkChatRequest,
+    ArkChatResponse,
+    Response,
+    ArkMessage,
+    BotUsage,
+)
+from arkitect.launcher.local.serve import launch_serve
+from arkitect.telemetry.trace import task
+from volcenginesdkarkruntime.types.completion_usage import (
+    CompletionUsage,
+    PromptTokensDetails,
+    CompletionTokensDetails,
+)
+
+logger = logging.getLogger(__name__)
+
+DEEPSEEK_R1_ENDPOINT = "<ENDPOINT_ID_FOR_DEEPSEEK_R1>"
+DOUBAO_ENDPOINT = "<ENDPOINT_ID_FOR_DOUBAO>"
+
+
+def merge_usage(usage1: CompletionUsage, usage2: CompletionUsage) -> CompletionUsage:
+    usage = CompletionUsage(
+        prompt_tokens=usage1.prompt_tokens + usage2.prompt_tokens,
+        completion_tokens=usage1.completion_tokens + usage2.completion_tokens,
+        total_tokens=usage1.total_tokens + usage2.total_tokens,
+    )
+    if usage1.prompt_tokens_details or usage2.prompt_tokens_details:
+        usage.prompt_tokens_details = PromptTokensDetails(
+            cached_tokens=usage1.prompt_tokens_details.cached_tokens
+            + usage2.prompt_tokens_details.cached_tokens
+        )
+    if usage1.completion_tokens_details or usage2.completion_tokens_details:
+        r1 = usage1.completion_tokens_details.reasoning_tokens
+        r2 = usage2.completion_tokens_details.reasoning_tokens
+        usage.completion_tokens_details = CompletionTokensDetails(
+            reasoning_tokens=r1 if r2 is None else r2 if r1 is None else r1 + r2
+        )
+    return usage
+
+
+@task()
+async def default_model_calling(
+    request: ArkChatRequest,
+) -> AsyncIterable[Union[ArkChatCompletionChunk, ArkChatResponse]]:
+    parameters_r1 = ArkChatParameters(**request.__dict__)
+    parameters_r1.max_tokens = (
+        1  # Set max_tokens to 1, so R1 model will only output reasoning content.
+    )
+    deepseek = BaseChatLanguageModel(
+        endpoint_id=DEEPSEEK_R1_ENDPOINT,
+        messages=request.messages,
+        parameters=parameters_r1,
+    )
+    reasoning_content = ""
+    reasoning_usage = CompletionUsage(
+        completion_tokens=0,
+        total_tokens=0,
+        prompt_tokens=0,
+    )
+    if request.stream:
+        async for chunk in deepseek.astream():
+            if chunk.usage:
+                reasoning_usage = chunk.usage
+            if len(chunk.choices) > 0 and chunk.choices[0].delta.reasoning_content:
+                yield chunk
+                reasoning_content += chunk.choices[0].delta.reasoning_content
+    else:
+        response = await deepseek.arun()
+        reasoning_content = response.choices[0].message.reasoning_content
+        if response.usage:
+            reasoning_usage = response.usage
+
+    parameters_doubao = ArkChatParameters(**request.__dict__)
+    doubao = BaseChatLanguageModel(
+        endpoint_id=DOUBAO_ENDPOINT,
+        messages=request.messages
+        + [
+            ArkMessage(
+                role="assistant",
+                content="思考过程如下：\n"
+                + reasoning_content
+                + "\n请根据以上思考过程，给出完整的回答：\n",
+            )
+        ],
+        parameters=parameters_doubao,
+    )
+    if request.stream:
+        async for chunk in doubao.astream():
+            if chunk.usage:
+                chunk.bot_usage = BotUsage(model_usage=[reasoning_usage, chunk.usage])
+                chunk.usage = merge_usage(chunk.usage, reasoning_usage)
+            yield chunk
+    else:
+        response = await doubao.arun()
+        response.choices[0].message.reasoning_content = reasoning_content
+        if response.usage:
+            response.bot_usage = BotUsage(model_usage=[reasoning_usage, response.usage])
+            response.usage = merge_usage(response.usage, reasoning_usage)
+        yield response
+
+
+@task()
+async def main(request: ArkChatRequest) -> AsyncIterable[Response]:
+    async for resp in default_model_calling(request):
+        yield resp
+
+
+if __name__ == "__main__":
+    port = os.getenv("_FAAS_RUNTIME_PORT")
+    launch_serve(
+        package_path="main",
+        port=int(port) if port else 8888,
+        health_check_path="/v1/ping",
+        endpoint_path="/api/v3/bots/chat/completions",
+    )