You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Now before making changes to the files, go to your terminal under the repo you just cloned, and type the following:
15
+
16
+
```bash
17
+
git checkout -b add-my-name
18
+
```
19
+
20
+
By running the above command, you just created a new branch called add-my-name and checked it out, what this does is that it creates a new branch with the commit history of the master branch or the branch that you were on previously.
21
+
22
+
4. Make your changes and push your code.
23
+
24
+
```
25
+
git add .
26
+
git commit -m "xxx"
27
+
git push
28
+
```
29
+
30
+
This will create a new commit with the changes you made.
31
+
32
+
5. Now create a pull request and add the title.
33
+
34
+
Sit back and relax while your pull request is being reviewed and merged.
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
@@ -63,13 +62,14 @@ After data generation, you can use [LLaMA-Factory](https://github.com/hiyouga/LL
63
62
64
63
## 📌 Latest Updates
65
64
65
+
-**2025.12.1**: Added search support for [NCBI](https://www.ncbi.nlm.nih.gov/) and [RNAcentral](https://rnacentral.org/) databases, enabling extraction of DNA and RNA data from these bioinformatics databases.
66
66
-**2025.10.30**: We support several new LLM clients and inference backends including [Ollama_client](https://github.com/open-sciencelab/GraphGen/blob/main/graphgen/models/llm/api/ollama_client.py), [http_client](https://github.com/open-sciencelab/GraphGen/blob/main/graphgen/models/llm/api/http_client.py), [HuggingFace Transformers](https://github.com/open-sciencelab/GraphGen/blob/main/graphgen/models/llm/local/hf_wrapper.py) and [SGLang](https://github.com/open-sciencelab/GraphGen/blob/main/graphgen/models/llm/local/sglang_wrapper.py).
67
67
-**2025.10.23**: We support VQA(Visual Question Answering) data generation now. Run script: `bash scripts/generate/generate_vqa.sh`.
68
-
-**2025.10.21**: We support PDF as input format for data generation now via [MinerU](https://github.com/opendatalab/MinerU).
69
68
70
69
<details>
71
70
<summary>History</summary>
72
71
72
+
-**2025.10.21**: We support PDF as input format for data generation now via [MinerU](https://github.com/opendatalab/MinerU).
73
73
-**2025.09.29**: We auto-update gradio demo on [Hugging Face](https://huggingface.co/spaces/chenzihong/GraphGen) and [ModelScope](https://modelscope.cn/studios/chenzihong/GraphGen).
74
74
-**2025.08.14**: We have added support for community detection in knowledge graphs using the Leiden algorithm, enabling the synthesis of Chain-of-Thought (CoT) data.
75
75
-**2025.07.31**: We have added Google, Bing, Wikipedia, and UniProt as search back-ends.
@@ -83,9 +83,10 @@ After data generation, you can use [LLaMA-Factory](https://github.com/hiyouga/LL
83
83
We support various LLM inference servers, API servers, inference clients, input file formats, data modalities, output data formats, and output data types.
84
84
Users can flexibly configure according to the needs of synthetic data.
85
85
86
-
| Inference Server | Api Server | Inference Client | Input File Format | Data Modal | Data Format | Data Type |
Experience GraphGen through [Web](https://g-app-center-120612-6433-jpdvmvp.openxlab.space) or [Backup Web Entrance](https://openxlab.org.cn/apps/detail/chenzihonga/GraphGen)
124
+
Experience GraphGen Demo through [Huggingface](https://huggingface.co/spaces/chenzihong/GraphGen) or [Modelscope](https://modelscope.cn/studios/chenzihong/GraphGen).
111
125
112
126
For any questions, please check [FAQ](https://github.com/open-sciencelab/GraphGen/issues/10), open new [issue](https://github.com/open-sciencelab/GraphGen/issues) or join our [wechat group](https://cdn.vansin.top/internlm/dou.jpg) and ask.
113
127
@@ -263,4 +277,4 @@ This project is licensed under the [Apache License 2.0](LICENSE).
263
277
264
278
## 📅 Star History
265
279
266
-
[](https://www.star-history.com/#open-sciencelab/GraphGen&Date)
280
+
[](https://www.star-history.com/#open-sciencelab/GraphGen&Date)
[](https://www.star-history.com/#open-sciencelab/GraphGen&Date)
276
+
[](https://www.star-history.com/#open-sciencelab/GraphGen&Date)
0 commit comments