Skip to content

Commit 1e14c2a

Browse files
authored
[DOC] Update GPT Readme (#1718)
* [DOC] Update GPT Readme Update GPT Readme * Update README.md
1 parent 0782e91 commit 1e14c2a

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

examples/language_model/gpt/README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -31,8 +31,11 @@ GPT-[2](https://cdn.openai.com/better-language-models/language_models_are_unsupe
3131
- tqdm
3232
- visualdl
3333
- paddlepaddle-gpu >= 2.2rc
34+
- pybind11
35+
- lac (可选)
36+
- zstandard (可选)
3437

35-
安装命令 `pip install regex sentencepiece tqdm visualdl`
38+
安装命令 `pip install regex sentencepiece tqdm visualdl pybind11 lac zstandard`
3639
注:需要PaddlePaddle版本大于等于2.2rc,或者使用最新develop版本,安装方法请参见Paddle[官网](https://www.paddlepaddle.org.cn)
3740

3841
### 数据准备
@@ -50,7 +53,6 @@ tar -xvf openwebtext2.json.zst.tar -C /path/to/openwebtext
5053
```
5154

5255
然后使用[data_tools](../data_tools)工具下的`create_pretraining_data.py`脚本进行数据集制作:
53-
5456
```
5557
python -u create_pretraining_data.py \
5658
--model_name gpt2-en \

0 commit comments

Comments
 (0)