PaddlePaddle
diff --git a/‎.github/workflows/Validate-GPU.yml‎
Lines changed: 4 additions & 1 deletion b/‎.github/workflows/Validate-GPU.yml‎
Lines changed: 4 additions & 1 deletion
diff --git a/‎GraphNet_technical_report.pdf‎
2.01 MB b/‎GraphNet_technical_report.pdf‎
2.01 MB
diff --git a/‎README.md‎
Lines changed: 72 additions & 86 deletions b/‎README.md‎
Lines changed: 72 additions & 86 deletions
diff --git a/‎CONTRIBUTE_TUTORIAL.md‎ ‎docs/CONTRIBUTE_TUTORIAL.md‎CONTRIBUTE_TUTORIAL.md renamed to docs/CONTRIBUTE_TUTORIAL.md b/‎CONTRIBUTE_TUTORIAL.md‎ ‎docs/CONTRIBUTE_TUTORIAL.md‎CONTRIBUTE_TUTORIAL.md renamed to docs/CONTRIBUTE_TUTORIAL.md
diff --git a/‎CONTRIBUTE_TUTORIAL_cn.md‎ ‎docs/CONTRIBUTE_TUTORIAL_cn.md‎CONTRIBUTE_TUTORIAL_cn.md renamed to docs/CONTRIBUTE_TUTORIAL_cn.md b/‎CONTRIBUTE_TUTORIAL_cn.md‎ ‎docs/CONTRIBUTE_TUTORIAL_cn.md‎CONTRIBUTE_TUTORIAL_cn.md renamed to docs/CONTRIBUTE_TUTORIAL_cn.md
diff --git a/‎docs/README_contribute.md‎
Lines changed: 87 additions & 0 deletions b/‎docs/README_contribute.md‎
Lines changed: 87 additions & 0 deletions
diff --git a/‎docs/hardware_test.md‎
Lines changed: 20 additions & 0 deletions b/‎docs/hardware_test.md‎
Lines changed: 20 additions & 0 deletions
@@ -4,6 +4,9 @@ on:
   pull_request:
     branches:
       - develop
+    paths:
+      - "samples/**"
+      - "paddle_samples/**"
 
 env:
   PR_ID: ${{ github.event.pull_request.number }}
@@ -31,7 +34,7 @@ jobs:
         with:
           ref: ${{ github.event.pull_request.base.ref }}
           submodules: 'recursive'
-          fetch-depth: 1000
+          fetch-depth: 10
 
       - name: Check bypass
         id: check-bypass
 
@@ -1,90 +1,41 @@
-# GraphNet  ![](https://img.shields.io/badge/version-v0.1-brightgreen) ![](https://img.shields.io/github/issues/PaddlePaddle/GraphNet?label=open%20issues)    [![](https://img.shields.io/badge/Contribute%20to%20GraphNet-blue)](https://github.com/PaddlePaddle/GraphNet/issues/98)
 
-**GraphNet** is a large-scale dataset of deep learning **computation graphs**, built as a standard benchmark for **tensor compiler** optimization. It provides 2.7K computation graphs extracted from state-of-the-art deep learning models spanning diverse tasks and ML frameworks. With standardized formats and rich metadata, GraphNet enables fair comparison and reproducible evaluation of the general optimization capabilities of tensor compilers, thereby supporting advanced research such as AI for System on compilers (AI for Compiler).
+<h1 align="center">GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research</h1>
 
-<br>
 <div align="center">
-<img src="/pics/Eval_result.png" alt="Violin plots of speedup distributions" width="65%">
-</div>
-
-Compiler developers can use GraphNet samples to evaluate tensor compilers (e.g., CINN, TorchInductor, TVM) on target tasks. The figure above shows the speedup of two compilers (CINN and TorchInductor) across two tasks (CV and NLP).
-
-## 🧱 Dataset Construction
-
-To guarantee the dataset’s overall quality, reproducibility, and cross-compiler compatibility, we define the following construction **constraints**:
-
-1. Computation graphs must be executable in imperative (eager) mode.
-2. Computation graphs and their corresponding Python code must support serialization and deserialization.
-3. The full graph can be decomposed into two disjoint subgraphs.
-4. Operator names within each computation graph must be statically parseable.
-5. If custom operators are used, their implementation code must be fully accessible.
 
-### Graph Extraction & Validation
-
-We provide automated extraction and validation tools for constructing this dataset.
-
-<div align="center">
-<img src="/pics/graphnet_overview.jpg" alt="GraphNet Architecture Overview" width="65%">
+![](https://img.shields.io/github/issues/PaddlePaddle/GraphNet?label=open%20issues)
+[![arXiv](https://img.shields.io/badge/arXiv-2510.24035-b31b1b.svg)](https://arxiv.org/abs/2510.24035)
+<a href="https://github.com/user-attachments/assets/125e3494-25c9-4494-9acd-8ad65ca85d03"><img src="https://img.shields.io/badge/微信-green?logo=wechat&amp"></a>
 </div>
 
-**Demo: Extract & Validate ResNet‑18**
-```bash
-git clone https://github.com/PaddlePaddle/GraphNet.git
-cd GraphNet
-
-# Set your workspace directory
-export GRAPH_NET_EXTRACT_WORKSPACE=/home/yourname/graphnet_workspace/
-
-# Extract the ResNet‑18 computation graph
-python graph_net/test/vision_model_test.py
-
-# Validate the extracted graph (e.g. /home/yourname/graphnet_workspace/resnet18/)
-python -m graph_net.torch.validate \
-  --model-path $GRAPH_NET_EXTRACT_WORKSPACE/resnet18/
-```
-
-**Illustration: How does GraphNet extract and construct a computation graph sample on PyTorch?**
+**GraphNet** is a large-scale dataset of deep learning **computation graphs**, built as a standard benchmark for **tensor compiler** optimization. It provides over 2.7K computation graphs extracted from state-of-the-art deep learning models spanning diverse tasks and ML frameworks. With standardized formats and rich metadata, GraphNet enables fair comparison and reproducible evaluation of the general optimization capabilities of tensor compilers, thereby supporting advanced research such as AI for System on compilers.
 
+## 📣 News
+- [2025-10-14] ✨ Our technical report is out: a detailed study of dataset construction and compiler benchmarking, introducing the novel performance metrics Speedup Score S(t) and Error-aware Speedup Score ES(t). [📘 GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research](https://arxiv.org/abs/2510.24035)
+- [2025-8-20] 🚀 The second round of [open contribution tasks](https://github.com/PaddlePaddle/Paddle/issues/74773) was released. (completed ✅)
+- [2025-7-30] 🚀 The first round of [open contribution tasks](https://github.com/PaddlePaddle/GraphNet/issues/44) was released.  (completed ✅)
+## 📊 Benchmark Results
+We evaluate two representative tensor compiler backends, CINN (PaddlePaddle) and TorchInductor (PyTorch), on GraphNet's NLP and CV subsets. The evaluation adopts two quantitative metrics proposed in the [Technical Report](https://arxiv.org/abs/2510.24035):
+- **Speedup Score** S(t) — evaluates compiler performance under varying numerical tolerance levels.
 <div align="center">
-<img src="/pics/graphnet_sample.png" alt="GraphNet Extract Sample" width="65%">
+  <img src="/pics/St-result.jpg" alt="Speedup Score S_t Results" width="80%">
 </div>
 
-* Source code of custom_op is required **only when** corresponding operator is used in the module, and **no specific format** is required.
-
-**Step 1: graph_net.torch.extract**
-
-Import and wrap the model with `graph_net.torch.extract(name=model_name, dynamic=dynamic_mode)()` is all you need:
-
-```bash
-import graph_net
-
-# Instantiate the model (e.g. a torchvision model)
-model = ...  
-
-# Extract your own model
-model = graph_net.torch.extract(name="model_name", dynamic="True")(model)
-```
-
-After running, the extracted graph will be saved to: `$GRAPH_NET_EXTRACT_WORKSPACE/model_name/`.
-
-For more details, see docstring of `graph_net.torch.extract` defined in `graph_net/torch/extractor.py`.
-
-**Step 2: graph_net.torch.validate**
-
-To verify that the extracted model meets requirements, we use `graph_net.torch.validate` in CI tool and also ask contributors to self-check in advance:
+- **Error-aware Speedup Score** ES(t) — further accounts for runtime and compilation errors.
+<div align="center">
+  <img src="/pics/ESt-result.jpg" alt="Error-aware Speedup Score ES_t Results" width="80%">
 
-```bash
-python -m graph_net.torch.validate \
-  --model-path $GRAPH_NET_EXTRACT_WORKSPACE/model_name
-```
+</div>
 
-All the **construction constraints** will be examined automatically. After passing validation, a unique `graph_hash.txt` will be generated and later checked in CI procedure to avoid redundant.
+## ⚡ Quick Start
+This section shows how to evaluate tensor compilers and reproduce benchmark results (for compiler users and developers),
+as well as how to contribute new computation graphs (for GraphNet contributors).
 
-## ⚖️ Compiler Evaluation
+### ⚖️ Compiler Evaluation
 
 **Step 1: Benchmark**
 
-We use `graph_net.torch.test_compiler` to benchmark GraphNet samples with specific batch and log configurations:
+Use graph_net.torch.test_compiler to benchmark GraphNet samples with specific batch and logging configurations:
 
 ```bash
 # Set your benchmark directory
@@ -99,7 +50,7 @@ python -m graph_net.torch.test_compiler \
   --trials /times/to/test/ \
   > $GRAPH_NET_BENCHMARK_PATH/log.log 2>&1
 
-# Note: if --compiler is omitted, PyTorch’s built-in compiler is used by default
+# Note: If --compiler is omitted, PyTorch’s built-in compiler is used by default.
 ```
 
 After executing, `graph_net.torch.test_compiler` will:
@@ -110,30 +61,50 @@ After executing, `graph_net.torch.test_compiler` will:
 
 **Step 2: Generate JSON Record**
 
-This step is to extract information (including failure) from logs in benchmark.
-All the information will be saved to multiple `model_compiler.json` files via:
+Extract runtime, correctness, and failure information from benchmark logs:
 
 ```bash
-python -m graph_net.torch.log2json \
+python -m graph_net.log2json \
   --log-file $GRAPH_NET_BENCHMARK_PATH/log.log \
-  --output-dir $GRAPH_NET_BENCHMARK_PATH
+  --output-dir $GRAPH_NET_BENCHMARK_PATH/JSON_results/
 ```
 
 **Step 3: Analysis**
 
-After processing, we provide `graph_net/analysis.py` to generate [violin plot](https://en.m.wikipedia.org/wiki/Violin_plot) based on the JSON results.
+Use the three scripts `graph_net.plot_St`, `graph_net.plot_ESt` and `graph_net.plot_violin` to generate St plot, ESt plot, and [violin plot](https://en.m.wikipedia.org/wiki/Violin_plot) based on the JSON results.
 
 ```bash
-python -m graph_net.analysis \
-  --benchmark-path /path/to/read/JSON/result/file/ \
-  --output-dir /path/to/save/output/figures/
+python -m graph_net.plot_St \
+  --benchmark-path $GRAPH_NET_BENCHMARK_PATH/JSON_results/ \
+  --output-dir $GRAPH_NET_BENCHMARK_PATH \
+  --negative-speedup-penalty penalty/power/for/negative/speedup \
+  --fpdb base/penalty/for/severe/errors
+
+python -m graph_net.plot_ESt \
+  --benchmark-path $GRAPH_NET_BENCHMARK_PATH/JSON_results/ \
+  --output-dir $GRAPH_NET_BENCHMARK_PATH \
+  --negative-speedup-penalty penalty/power/for/negative/speedup \
+  --fpdb base/penalty/for/severe/errors
+
+# Note: If --negative-speedup-penalty is omitted, p=0 is used by default.
+# If --fpdb, b=0.1 is used by default.
+
+python -m graph_net.plot_violin \
+  --benchmark-path $GRAPH_NET_BENCHMARK_PATH/JSON_results/ \
+  --output-dir $GRAPH_NET_BENCHMARK_PATH
 ```
 
-After executing, one summary plot of results on all compilers, as well as multiple sub-plots of results in categories (model tasks, Library...) on a single compiler will be exported. 
+The scripts are designed to process a file structure as `/benchmark_path/category_name/`, and items on x-axis are identified by name of the sub-directories. After executing, several summary plots of result in categories (model tasks, libraries...) will be exported to `$GRAPH_NET_BENCHMARK_PATH`.
+
+### Hardware Regression Testing
+We also provide a two-step workflow that validates compiler correctness and performance against a "golden" reference, which is crucial for hardware-specific testing and regression tracking. Details can be found in this [guide](./docs/hardware_test.md).
+
+### 🧱 Construction & Contribution Guide
+Want to understand how GraphNet is built or contribute new samples?
+Check out the [Construction Guide](./docs/README_contribute.md) for details on the extraction and validation workflow.
 
-The script is designed to process a file structure as `/benchmark_path/compiler_name/category_name/` (for example `/benchmark_logs/paddle/nlp/`), and items on x-axis are identified by name of the folders. So you can modify  `read_all_speedups` function to fit the benchmark settings on your demand.
 
-## 📌 Roadmap
+## 🚀 Future Roadmap
 
 1. Scale GraphNet to 10K+ graphs.
 2. Further annotate GraphNet samples into more granular sub-categories
@@ -142,7 +113,7 @@ The script is designed to process a file structure as `/benchmark_path/compiler_
 
 **Vision**: GraphNet aims to lay the foundation for AI for Compiler by enabling **large-scale, systematic evaluation** of tensor compiler optimizations, and providing a **dataset for models to learn** and transfer optimization strategies.
 
-## 💬 GraphNet Community
+## GraphNet Community
 
 You can join our community via following group chats. Welcome to ask any questions about using and building GraphNet.
 
@@ -160,5 +131,20 @@ You can join our community via following group chats. Welcome to ask any questio
 </table>
 </div>
 
-## 🪪 License
-This project is released under the [MIT License](LICENSE).
+## License and Acknowledgement
+
+GraphNet is released under the [MIT License](./LICENSE).
+
+If you find this project helpful, please cite:
+
+```bibtex
+@misc{li2025graphnetlargescalecomputationalgraph,
+      title={GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research}, 
+      author={Xinqi Li and Yiqun Liu and Shan Jiang and Enrong Zheng and Huaijin Zheng and Wenhao Dai and Haodong Deng and Dianhai Yu and Yanjun Ma},
+      year={2025},
+      eprint={2510.24035},
+      archivePrefix={arXiv},
+      primaryClass={cs.LG},
+      url={https://arxiv.org/abs/2510.24035}, 
+}
+```
@@ -0,0 +1,87 @@
+# Contributing to GraphNet
+To guarantee the dataset’s overall quality, reproducibility, and cross-compiler compatibility, we define the following construction **constraints**:
+
+1. Computation graphs must be executable in imperative (eager) mode.
+2. Computation graphs and their corresponding Python code must support serialization and deserialization.
+3. The full graph can be decomposed into two disjoint subgraphs.
+4. Operator names within each computation graph must be statically parseable.
+5. If custom operators are used, their implementation code must be fully accessible.
+
+## Graph Extraction & Validation
+GraphNet provides automated tools for graph extraction and validation.
+
+<div align="center">
+<img src="/pics/graphnet_overview.jpg" alt="GraphNet Architecture Overview" width="65%">
+</div>
+
+**Demo: Extract & Validate ResNet‑18**
+```bash
+git clone https://github.com/PaddlePaddle/GraphNet.git
+cd GraphNet
+
+# Set your workspace directory
+export GRAPH_NET_EXTRACT_WORKSPACE=/home/yourname/graphnet_workspace/
+
+# Extract the ResNet‑18 computation graph
+python graph_net/test/vision_model_test.py
+
+# Validate the extracted graph (e.g. /home/yourname/graphnet_workspace/resnet18/)
+python -m graph_net.torch.validate \
+  --model-path $GRAPH_NET_EXTRACT_WORKSPACE/resnet18/
+```
+
+**Illustration – Extraction Workflow**
+
+<div align="center">
+<img src="/pics/dataset_composition.png" alt="GraphNet Extract Sample" width="65%">
+</div>
+
+* Source code of custom_op is required **only when** corresponding operator is used in the module, and **no specific format** is required.
+
+**Step 1: graph_net.torch.extract**
+
+Wrap the model with the extractor — that’s all you need:
+
+```bash
+import graph_net
+
+# Instantiate the model (e.g. a torchvision model)
+model = ...  
+
+# Extract your own model
+model = graph_net.torch.extract(name="model_name", dynamic="True")(model)
+```
+
+After running, the extracted graph will be saved to: `$GRAPH_NET_EXTRACT_WORKSPACE/model_name/`.
+
+For more details, see docstring of `graph_net.torch.extract` defined in `graph_net/torch/extractor.py`.
+
+**Step 2: graph_net.torch.validate**
+
+To verify that the extracted model meets requirements, we use `graph_net.torch.validate` in CI tool and also ask contributors to self-check in advance:
+
+```bash
+python -m graph_net.torch.validate \
+  --model-path $GRAPH_NET_EXTRACT_WORKSPACE/model_name
+```
+
+All the **construction constraints** will be examined automatically. After passing validation, a unique `graph_hash.txt` will be generated and later checked in CI procedure to avoid redundant.
+
+## 📁 Repository Structure
+This repository is organized as follows:
+
+| Directory | Description |
+|------------|--------------|
+| **graph_net/** | Core module for graph extraction, validation, and benchmarking |
+| **paddle_samples/** | Computation graph samples extracted from PaddlePaddle |
+| **samples/** | Computation graph samples extracted from PyTorch |
+| **docs/** | Technical documents and contributor guides|
+
+Below is the structure of the **graph_net/**:
+```text
+graph_net/
+ ├─ config/    # Config files, params
+ ├─ paddle/    # PaddlePaddle graph extraction & validation
+ ├─ torch/     # PyTorch graph extraction & validation
+ ├─ test/      # Unit tests and example scripts
+ └─ *.py       # Benchmark & analysis scripts 
@@ -0,0 +1,20 @@
+## Hardware Regression Testing
+### Step 1: Generate Reference Data
+First, use `graph_net.paddle.test_reference_device` on a trusted setting (e.g., a specific hardware/compiler version) to generate baseline logs and output files.
+```bash
+python -m graph_net.paddle.test_reference_device \
+    --model-path /path/to/all_models/ \
+    --reference-dir ./gold_reference \
+    --compiler cinn \
+    --device cuda
+# --reference-dir: (Required) Directory where the output .log (performance/config) and .pdout (output tensors) files will be saved.
+# --compiler: Specifies the compiler backend.
+```
+### Step 2: Run Regression Test
+After changing hardware, run the correctness test script. This script reads the reference data, re-runs the models using the exact same configuration, and compares the new results against the "golden" reference.
+```bash
+python -m graph_net.paddle.test_device_correctness \
+    --reference-dir ./golden_reference \
+    --device cuda
+```
+This script will report any failures (e.g., compilation errors, output mismatches) and print a performance comparison (speedup/slowdown) against the reference log, allowing you to quickly identify regressions.