group by v2 by Honglei-Qiu · Pull Request #680 · PaddlePaddle/GraphNet

Honglei-Qiu · 2026-03-23T08:50:42Z

PR Category

Feature Enhancement

Description

新增group分组规则

paddle-bot · 2026-03-23T08:50:49Z

Thanks for your contribution!

sqlite/graph_net_sample_groups_insert2.py

Xreki

统计下每种分组方法，都能产生多少个group

sqlite/graph_net_sample_groups_insert_v2.py

Xreki · 2026-03-24T01:59:39Z

sqlite/graph_net_sample_groups_insert_v2.py

+    WHERE s.deleted = 0
+      AND s.sample_type != 'full_graph'
+) sub
+WHERE sub.rn = 1


这个判断的作用是什么？

去重吧，防止一个sample被重复选取

Xreki · 2026-03-24T02:01:19Z

sqlite/graph_net_sample_groups_insert_v2.py

+
+def get_v2_group_members(candidates: list[CandidateGraph], num_dtypes: int):
+    # Index candidates by op_seq
+    by_op_seq = defaultdict(list)


优化下所有的变量命名

sqlite/graph_net_sample_groups_insert_v2.py

Xreki · 2026-03-24T06:13:32Z

sqlite/graph_net_sample_groups_insert.py

+        b.input_shapes_bucket_id,
+        b.input_dtypes_bucket_id,
+        s.graph_hash,
+        ROW_NUMBER() OVER (


graph_hash不需要了吧？ROW_NUMBER在这里的作用是什么？

在每个 (op_seq, shapes, dtypes) 分区内，按创建时间排序编号，然后只取 rn = 1（最早的那条）。作用是桶内去重：同一个桶里可能有多个样本，只保留一个代表。
不过现在代码改了很多

Xreki · 2026-03-24T06:16:01Z

sqlite/graph_net_sample_groups_insert.py

+    """
+
+    # Index candidates by op_seq
+    by_op_seq = defaultdict(list)


by_op_seq这样的变量名太抽象了

candidates_by_op_seq，润色一下

Xreki · 2026-03-24T06:17:03Z

sqlite/graph_net_sample_groups_insert.py

+    for c in candidates:
+        by_op_seq[c.op_seq_bucket_id].append(c)
+
+    rule3_selected_uids = set()


不要以relux_这样的方式命名

group by v2

95b4b92

group by v2

6185419

Xreki reviewed Mar 23, 2026

View reviewed changes

sqlite/graph_net_sample_groups_insert2.py Outdated Show resolved Hide resolved

sqlite/graph_net_sample_groups_insert2.py Outdated Show resolved Hide resolved

Honglei-Qiu added 4 commits March 23, 2026 13:26

group by v2

91a7389

group by v2

e444c2e

group by all

a13a356

group by all

b4a098a

Xreki reviewed Mar 24, 2026

View reviewed changes

Honglei-Qiu added 4 commits March 24, 2026 07:05

group by all

238d966

group by all

489af43

group by all

05e5d0f

group by all

f784701

Conversation

Honglei-Qiu commented Mar 23, 2026

PR Category

Description

Uh oh!

paddle-bot bot commented Mar 23, 2026

Uh oh!

Uh oh!

Uh oh!

Xreki left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants