Idea2Paper/README-zh_CN.md at main · drhiidden/Idea2Paper

📄 论文

Idea2Story（Idea2Paper 的核心子模块）

🌐 arXiv：https://arxiv.org/abs/2601.20833
📘 PDF：papers/Idea2Story.pdf

Idea2Story 提出了一种以“预计算”为核心驱动的自动化科研叙事生成框架，将对学术文献的理解从运行时推理（runtime reasoning）前移至离线的知识图谱构建阶段，从而实现更加高效、稳定且可审计的自主科学发现流程。

📖 项目概述

Idea2Paper 是一个把你的研究想法（Idea）自动变成“可投稿论文的 Story（论文叙事骨架）”的端到端流水线。它集成了知识图谱召回、Pattern 选择、Story 生成、可标定 Multi-Agent Review 以及 RAG 查重与智能修正等功能。

Idea2Paper 是一个面向端到端科研流程的总体研究智能体项目。
Idea2Story 是 Idea2Paper 中的核心子模块，专注于将尚不充分定义的科研想法自动转化为结构完整、可直接投稿的学术论文叙事框架。

核心路径

仓库核心路径：Paper-KG-Pipeline/

运行入口（不变）

python Paper-KG-Pipeline/scripts/idea2story_pipeline.py "your idea"

Idea2Paper 流水线架构

✨ 核心特性

🕸️ 知识图谱：从 ICLR 数据构建 Idea/Pattern/Domain/Paper 节点（当前导出规模示例：Idea 8,284 / Pattern 124 / Domain 98 / Paper 8,285）。
🎣 三路召回 + 两阶段加速：Idea 相似 / Domain 泛化 / Paper 相似；粗排（Jaccard）+ 精排（Embedding）。
📝 Idea2Story 生成链路：Pattern 选择 → Story 生成 → 评审（Anchored Multi‑Agent）→ 智能修正（含 Novelty 模式）。
🤖 可标定多智能体评审：使用论文图谱中的真实 review_stats 作为锚点（anchors），LLM 输出相对比较结果，由确定性算法拟合最终 1~10 分，过程可审计。
📊 完整运行日志与审计：每次 run 建立独立日志目录，记录 events、LLM/embedding 调用输入输出，便于回放与审计。

📦 你将得到（输出）

📄 Paper-KG-Pipeline/output/final_story.json：最终 Story（结构化字段：标题/摘要/问题/方法/贡献/实验等）
🔍 Paper-KG-Pipeline/output/pipeline_result.json：完整链路结果（包含评审、修正、查重、审计信息）
📂 Paper-KG-Pipeline/log/run_.../：每次运行的结构化运行日志

🚀 快速开始

Python 3.10+

安装依赖：

pip install -r Paper-KG-Pipeline/requirements.txt

配置：
- 复制 .env.example -> .env，填写 SILICONFLOW_API_KEY 等敏感键（不要提交）
- 可选：复制 i2p_config.example.json -> i2p_config.json 调整阈值/anchors 等

注意： 当前 embedding 模型暂时固定为 Qwen/Qwen3-Embedding-8B（SiliconFlow），暂不支持切换。后续会扩展支持更多 embedding 模型/服务商。

运行：

python Paper-KG-Pipeline/scripts/idea2story_pipeline.py "你的研究Idea描述"

🌐 前端（本地 Web UI）

运行一个极简的本地 UI，用于启动 pipeline，并且只展示高层阶段信息与最终结果（不在页面上展示原始日志内容）。

启动

python frontend/server/app.py --host 127.0.0.1 --port 8080

在浏览器中打开：

http://127.0.0.1:8080/

你可以在 UI 中做什么

从网页运行同一个 pipeline 入口 (idea2story_pipeline.py) 。
为本次运行配置 SILICONFLOW_API_KEY, LLM_API_URL, LLM_MODEL （服务端不会持久化保存）。
开关 Novelty / Verification.
一键下载本次运行的日志（zip）。

更多说明见 frontend/README.md.

🤖 Multi‑Agent Review（可标定、可追溯）

核心思想：用真实论文评分分布作为锚点，LLM 做相对比较（better/tie/worse + confidence + rationale），最终分数由确定性算法拟合，使评审结果可复现、可审计。详见仓库 MULTIAGENT_REVIEW.md（或 Paper-KG-Pipeline/docs 中相应文档）。

📚 Files & Docs (Important Paths)

Core code: Paper-KG-Pipeline/src/idea2paper/
Entry scripts:
- Paper-KG-Pipeline/scripts/idea2story_pipeline.py
- Paper-KG-Pipeline/scripts/simple_recall_demo.py
Scripts layout:
- Paper-KG-Pipeline/scripts/tools/ (build/ETL tools)
- Paper-KG-Pipeline/scripts/demos/ (examples)
- Legacy paths still work (root scripts are thin wrappers)
Docs:

No.	Document	Content	Target Audience
0	Project Overview	Overall architecture, core modules, parameter configuration, execution workflow	Everyone
1	Knowledge Graph Construction	Data sources, node/edge definitions, LLM enhancement, how to run	Developers
2	Retrieval System	Three-way retrieval strategies, similarity computation, performance optimization	Developers
3	Idea2Story Pipeline	Pattern selection, Idea fusion, story reflection, critic review	Developers

Multi-Agent details: MULTIAGENT_REVIEW.md

🤝 Contributing / License

欢迎 PR / Issue。遵循 repo 中的贡献指南与 Code of Conduct。默认 MIT 许可（见 LICENSE）。

🙏 致谢 / Credits

数据来源：ICLR（见 docs 中 KG 构建说明）
设计灵感：以可审计的真实锚点为中心的评审流程
社区支持：agentAlpha 社区

👥 贡献者 / Contributors

📑 引用（Idea2Story）

如果你在研究或项目中使用了 Idea2Story，请按如下方式引用：

@misc{xu2026idea2storyautomatedpipelinetransforming,
  title={Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives},
  author={Tengyue Xu and Zhuoyang Qian and Gaoge Liu and Li Ling and Zhentao Zhang and Biao Wu and Shuo Zhang and Ke Lu and Wei Shi and Ziqi Wang and Zheng Feng and Yan Luo and Shu Xu and Yongjin Chen and Zhibo Feng and Zhuo Chen and Bruce Yuan and Harry Wang and Kris Chen},
  year={2026},
  eprint={2601.20833},
  archivePrefix={arXiv},
  primaryClass={cs.CE},
  url={https://arxiv.org/abs/2601.20833}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📄 论文

Idea2Story（Idea2Paper 的核心子模块）

📖 项目概述

核心路径

运行入口（不变）

✨ 核心特性

📦 你将得到（输出）

🚀 快速开始

🌐 前端（本地 Web UI）

启动

你可以在 UI 中做什么

🤖 Multi‑Agent Review（可标定、可追溯）

📚 Files & Docs (Important Paths)

🤝 Contributing / License

🙏 致谢 / Credits

👥 贡献者 / Contributors

📑 引用（Idea2Story）

📈 Star History

FilesExpand file tree

README-zh_CN.md

Latest commit

History

README-zh_CN.md

File metadata and controls

📄 论文

Idea2Story（Idea2Paper 的核心子模块）

📖 项目概述

核心路径

运行入口（不变）

✨ 核心特性

📦 你将得到（输出）

🚀 快速开始

🌐 前端（本地 Web UI）

启动

你可以在 UI 中做什么

🤖 Multi‑Agent Review（可标定、可追溯）

📚 Files & Docs (Important Paths)

🤝 Contributing / License

🙏 致谢 / Credits

👥 贡献者 / Contributors

📑 引用（Idea2Story）

📈 Star History