Skip to content

Commit 1b4bcc2

Browse files
authored
feat: update graph rag prompt (#1429)
* feat: update graph rag prompt * feat: update
1 parent a0d025a commit 1b4bcc2

File tree

5 files changed

+48
-11
lines changed

5 files changed

+48
-11
lines changed

aperag/api/components/schemas/collection.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -116,8 +116,8 @@ knowledgeGraphConfig:
116116
language:
117117
type: string
118118
description: Language for entity extraction and query responses
119-
default: "English"
120-
example: "English"
119+
default: "simplified chinese"
120+
example: "simplified chinese"
121121
entity_types:
122122
type: array
123123
items:
@@ -157,7 +157,7 @@ collectionConfig:
157157
allOf:
158158
- $ref: '#/knowledgeGraphConfig'
159159
default:
160-
language: "English"
160+
language: "simplified chinese"
161161
entity_types: ["organization", "person", "geo", "event", "product", "technology", "date", "category"]
162162
embedding:
163163
$ref: './model.yaml#/modelSpec'

aperag/graph/lightrag/prompt.py

Lines changed: 37 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -171,6 +171,43 @@
171171
("relationship"{tuple_delimiter}"Noah Carter"{tuple_delimiter}"Carbon-Fiber Spikes"{tuple_delimiter}"Noah Carter used carbon-fiber spikes to enhance performance during the race."{tuple_delimiter}"athletic equipment, performance boost"{tuple_delimiter}7){record_delimiter}
172172
("relationship"{tuple_delimiter}"World Athletics Federation"{tuple_delimiter}"100m Sprint Record"{tuple_delimiter}"The World Athletics Federation is responsible for validating and recognizing new sprint records."{tuple_delimiter}"sports regulation, record certification"{tuple_delimiter}9){record_delimiter}
173173
("content_keywords"{tuple_delimiter}"athletics, sprinting, record-breaking, sports technology, competition"){completion_delimiter}
174+
#############################""",
175+
"""Example 4:
176+
177+
Entity_types: [organization, person, product, technology, location, event]
178+
Text:
179+
```
180+
在杭州召开的2024年度人工智能发展论坛上,云智科技公司CEO李明发表了题为"智能计算新纪元"的主题演讲。他宣布云智科技将推出全新的星云X1 AI芯片,该芯片采用先进的5纳米工艺,算力相比上一代提升了60%。
181+
182+
李明在演讲中指出,边缘计算与大模型的结合将是未来技术发展的重要方向。云智科技研发团队已经在苏州研发中心完成了基于星云芯片的多模态模型训练测试,结果显示新芯片在图像识别和自然语言理解任务中表现出色。
183+
184+
论坛期间,云智科技与江南理工大学签署了产学研合作协议,双方将在AI芯片架构设计和算法优化领域开展联合研究。江南理工大学计算机学院院长王芳教授表示,这次合作将为培养AI领域高端人才提供重要平台。
185+
```
186+
187+
Output:
188+
("entity"{tuple_delimiter}"2024年度人工智能发展论坛"{tuple_delimiter}"event"{tuple_delimiter}"2024年度人工智能发展论坛是在杭州召开的AI行业重要会议,聚焦人工智能技术发展和应用。"){record_delimiter}
189+
("entity"{tuple_delimiter}"杭州"{tuple_delimiter}"location"{tuple_delimiter}"杭州是2024年度人工智能发展论坛的举办城市。"){record_delimiter}
190+
("entity"{tuple_delimiter}"云智科技"{tuple_delimiter}"organization"{tuple_delimiter}"云智科技是一家专注于AI芯片和智能计算技术研发的科技公司。"){record_delimiter}
191+
("entity"{tuple_delimiter}"李明"{tuple_delimiter}"person"{tuple_delimiter}"李明是云智科技公司的CEO,在人工智能发展论坛上发表了关于智能计算的主题演讲。"){record_delimiter}
192+
("entity"{tuple_delimiter}"星云X1"{tuple_delimiter}"product"{tuple_delimiter}"星云X1是云智科技推出的新一代AI芯片,采用5纳米工艺,算力提升60%。"){record_delimiter}
193+
("entity"{tuple_delimiter}"5纳米工艺"{tuple_delimiter}"technology"{tuple_delimiter}"5纳米工艺是星云X1芯片采用的先进半导体制造技术。"){record_delimiter}
194+
("entity"{tuple_delimiter}"边缘计算"{tuple_delimiter}"technology"{tuple_delimiter}"边缘计算是一种分布式计算架构,与大模型结合被认为是未来技术发展方向。"){record_delimiter}
195+
("entity"{tuple_delimiter}"苏州研发中心"{tuple_delimiter}"location"{tuple_delimiter}"苏州研发中心是云智科技的研发基地,多模态模型训练测试在此完成。"){record_delimiter}
196+
("entity"{tuple_delimiter}"多模态模型"{tuple_delimiter}"technology"{tuple_delimiter}"多模态模型是基于星云芯片训练的AI技术,用于图像识别和自然语言理解。"){record_delimiter}
197+
("entity"{tuple_delimiter}"江南理工大学"{tuple_delimiter}"organization"{tuple_delimiter}"江南理工大学是与云智科技签署产学研合作协议的高校,在AI领域开展联合研究。"){record_delimiter}
198+
("entity"{tuple_delimiter}"江南理工大学计算机学院"{tuple_delimiter}"organization"{tuple_delimiter}"江南理工大学计算机学院是江南理工大学的二级学院,参与AI芯片研究合作。"){record_delimiter}
199+
("entity"{tuple_delimiter}"王芳"{tuple_delimiter}"person"{tuple_delimiter}"王芳是江南理工大学计算机学院院长,负责与云智科技的产学研合作项目。"){record_delimiter}
200+
("relationship"{tuple_delimiter}"李明"{tuple_delimiter}"云智科技"{tuple_delimiter}"李明是云智科技的CEO,负责公司战略决策和对外发言。"{tuple_delimiter}"企业领导, 战略管理"{tuple_delimiter}10){record_delimiter}
201+
("relationship"{tuple_delimiter}"云智科技"{tuple_delimiter}"星云X1"{tuple_delimiter}"云智科技研发并推出了星云X1 AI芯片产品。"{tuple_delimiter}"产品研发, 技术创新"{tuple_delimiter}10){record_delimiter}
202+
("relationship"{tuple_delimiter}"星云X1"{tuple_delimiter}"5纳米工艺"{tuple_delimiter}"星云X1芯片采用5纳米工艺制造技术。"{tuple_delimiter}"技术应用, 制造工艺"{tuple_delimiter}9){record_delimiter}
203+
("relationship"{tuple_delimiter}"李明"{tuple_delimiter}"2024年度人工智能发展论坛"{tuple_delimiter}"李明在2024年度人工智能发展论坛上发表主题演讲。"{tuple_delimiter}"会议演讲, 行业交流"{tuple_delimiter}8){record_delimiter}
204+
("relationship"{tuple_delimiter}"2024年度人工智能发展论坛"{tuple_delimiter}"杭州"{tuple_delimiter}"2024年度人工智能发展论坛在杭州举办。"{tuple_delimiter}"会议地点, 地理位置"{tuple_delimiter}8){record_delimiter}
205+
("relationship"{tuple_delimiter}"云智科技"{tuple_delimiter}"苏州研发中心"{tuple_delimiter}"云智科技在苏州研发中心进行AI芯片研发和模型训练测试。"{tuple_delimiter}"研发基地, 技术测试"{tuple_delimiter}9){record_delimiter}
206+
("relationship"{tuple_delimiter}"星云X1"{tuple_delimiter}"多模态模型"{tuple_delimiter}"星云X1芯片被用于多模态模型的训练,在图像和语言任务中表现出色。"{tuple_delimiter}"技术应用, 性能验证"{tuple_delimiter}9){record_delimiter}
207+
("relationship"{tuple_delimiter}"云智科技"{tuple_delimiter}"江南理工大学"{tuple_delimiter}"云智科技与江南理工大学签署产学研合作协议,在AI领域开展联合研究。"{tuple_delimiter}"产学研合作, 战略协议"{tuple_delimiter}10){record_delimiter}
208+
("relationship"{tuple_delimiter}"王芳"{tuple_delimiter}"江南理工大学计算机学院"{tuple_delimiter}"王芳担任江南理工大学计算机学院院长。"{tuple_delimiter}"学术领导, 学院管理"{tuple_delimiter}10){record_delimiter}
209+
("relationship"{tuple_delimiter}"江南理工大学计算机学院"{tuple_delimiter}"江南理工大学"{tuple_delimiter}"江南理工大学计算机学院是江南理工大学的下属学院。"{tuple_delimiter}"组织从属, 学术机构"{tuple_delimiter}10){record_delimiter}
210+
("content_keywords"{tuple_delimiter}"人工智能, 芯片研发, 产学研合作, 边缘计算, 多模态技术"){completion_delimiter}
174211
#############################""",
175212
]
176213

aperag/graph/lightrag_manager.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ class LightRAGConfig:
4747
SUMMARY_TO_MAX_TOKENS = 2000
4848
FORCE_LLM_SUMMARY_ON_MERGE = 10
4949
EMBEDDING_MAX_TOKEN_SIZE = 8192
50-
DEFAULT_LANGUAGE = "The same language like input text"
50+
DEFAULT_LANGUAGE = "simplified chinese"
5151

5252

5353
class LightRAGError(Exception):

aperag/schema/view_models.py

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414

1515
# generated by datamodel-codegen:
1616
# filename: openapi.merged.yaml
17-
# timestamp: 2026-01-13T12:50:23+00:00
17+
# timestamp: 2026-01-29T03:34:01+00:00
1818

1919
from __future__ import annotations
2020

@@ -72,9 +72,9 @@ class KnowledgeGraphConfig(BaseModel):
7272
"""
7373

7474
language: Optional[str] = Field(
75-
'English',
75+
'simplified chinese',
7676
description='Language for entity extraction and query responses',
77-
examples=['English'],
77+
examples=['simplified chinese'],
7878
)
7979
entity_types: Optional[list[str]] = Field(
8080
[
@@ -114,7 +114,7 @@ class CollectionConfig(BaseModel):
114114
knowledge_graph_config: Optional[KnowledgeGraphConfig] = Field(
115115
default_factory=lambda: KnowledgeGraphConfig.model_validate(
116116
{
117-
'language': 'English',
117+
'language': 'simplified chinese',
118118
'entity_types': [
119119
'organization',
120120
'person',

web/src/api/openapi.merged.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4583,8 +4583,8 @@ components:
45834583
language:
45844584
type: string
45854585
description: Language for entity extraction and query responses
4586-
default: English
4587-
example: English
4586+
default: simplified chinese
4587+
example: simplified chinese
45884588
entity_types:
45894589
type: array
45904590
items:
@@ -4635,7 +4635,7 @@ components:
46354635
allOf:
46364636
- $ref: '#/components/schemas/knowledgeGraphConfig'
46374637
default:
4638-
language: English
4638+
language: simplified chinese
46394639
entity_types:
46404640
- organization
46414641
- person

0 commit comments

Comments
 (0)