Skip to content

Commit 7ef34c4

Browse files
[Doc] add opendatalab download link (#1753)
* add opendatalab link * fix * fix * ip --------- Co-authored-by: gaotongxiao <[email protected]>
1 parent 47f5430 commit 7ef34c4

File tree

6 files changed

+40
-0
lines changed

6 files changed

+40
-0
lines changed

docs/en/user_guides/data_prepare/dataset_preparer.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,8 @@ python tools/dataset_converters/prepare_dataset.py icdar2015 totaltext --task te
3939

4040
To check the supported datasets of Dataset Preparer, please refer to [Dataset Zoo](./datasetzoo.md). Some of other datasets that need to be prepared manually are listed in [Text Detection](./det.md) and [Text Recognition](./recog.md).
4141

42+
For users in China, more datasets can be downloaded from the opensource dataset platform: [OpenDataLab](https://opendatalab.com/). After downloading the data, you can place the files listed in `data_obtainer.save_name` in `data/cache` and rerun the script.
43+
4244
## Advanced Usage
4345

4446
### LMDB Format

docs/en/user_guides/data_prepare/det.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -46,6 +46,14 @@ This page is a manual preparation guide for datasets not yet supported by [Datas
4646
# Default output format [None]
4747
```
4848

49+
For users in China, these datasets can also be downloaded from [OpenDataLab](https://opendatalab.com/) with high speed:
50+
51+
- [CTW1500](https://opendatalab.com/SCUT-CTW1500?source=OpenMMLab%20GitHub)
52+
- [ICDAR2013](https://opendatalab.com/ICDAR_2013?source=OpenMMLab%20GitHub)
53+
- [ICDAR2015](https://opendatalab.com/ICDAR2015?source=OpenMMLab%20GitHub)
54+
- [Totaltext](https://opendatalab.com/TotalText?source=OpenMMLab%20GitHub)
55+
- [MSRA-TD500](https://opendatalab.com/MSRA-TD500?source=OpenMMLab%20GitHub)
56+
4957
## Important Note
5058

5159
```{note}

docs/en/user_guides/data_prepare/recog.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -49,6 +49,16 @@ This page is a manual preparation guide for datasets not yet supported by [Datas
4949
# Default output format [None]
5050
```
5151

52+
For users in China, these datasets can also be downloaded from [OpenDataLab](https://opendatalab.com/) with high speed:
53+
54+
- [icdar_2013](https://opendatalab.com/ICDAR_2013?source=OpenMMLab%20GitHub)
55+
- [icdar_2015](https://opendatalab.com/ICDAR2015?source=OpenMMLab%20GitHub)
56+
- [IIIT5K](https://opendatalab.com/IIIT_5K?source=OpenMMLab%20GitHub)
57+
- [ct80](https://opendatalab.com/CUTE_80?source=OpenMMLab%20GitHub)
58+
- [svt](https://opendatalab.com/SVT?source=OpenMMLab%20GitHub)
59+
- [Totaltext](https://opendatalab.com/TotalText?source=OpenMMLab%20GitHub)
60+
- [IAM](https://opendatalab.com/IAM_Handwriting?source=OpenMMLab%20GitHub)
61+
5262
## ICDAR 2011 (Born-Digital Images)
5363

5464
- Step1: Download `Challenge1_Training_Task3_Images_GT.zip`, `Challenge1_Test_Task3_Images.zip`, and `Challenge1_Test_Task3_GT.txt` from [homepage](https://rrc.cvc.uab.es/?ch=1&com=downloads) `Task 1.3: Word Recognition (2013 edition)`.

docs/zh_cn/user_guides/data_prepare/dataset_preparer.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -38,6 +38,8 @@ python tools/dataset_converters/prepare_dataset.py icdar2015 totaltext --task te
3838
3939
进一步了解 Dataset Preparer 支持的数据集,您可以浏览[支持的数据集文档](./datasetzoo.md)。一些需要手动准备的数据集也列在了 [文字检测](./det.md) 和 [文字识别](./recog.md) 内。
4040
41+
对于中国境内的用户,我们也推荐通过开源数据平台[OpenDataLab](https://opendatalab.com/)来下载数据,以获得更好的下载体验。数据下载后,参考脚本中 `data_obtainer``save_name` 字段,将文件放在 `data/cache/` 下并重新运行脚本即可。
42+
4143
## 进阶用法
4244
4345
### LMDB 格式

docs/zh_cn/user_guides/data_prepare/det.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,14 @@
2020
| TextOCR | [下载地址](https://textvqa.org/textocr/dataset) | - | - | - |
2121
| Totaltext | [下载地址](https://github.com/cs-chan/Total-Text-Dataset) | - | - | - |
2222

23+
对于中国境内的用户,我们也推荐使用开源数据平台[OpenDataLab](https://opendatalab.com/)来获取这些数据集,以获得更好的下载体验:
24+
25+
- [CTW1500](https://opendatalab.com/SCUT-CTW1500?source=OpenMMLab%20GitHub)
26+
- [ICDAR2013](https://opendatalab.com/ICDAR_2013?source=OpenMMLab%20GitHub)
27+
- [ICDAR2015](https://opendatalab.com/ICDAR2015?source=OpenMMLab%20GitHub)
28+
- [Totaltext](https://opendatalab.com/TotalText?source=OpenMMLab%20GitHub)
29+
- [MSRA-TD500](https://opendatalab.com/MSRA-TD500?source=OpenMMLab%20GitHub)
30+
2331
## 重要提醒
2432

2533
```{note}

docs/zh_cn/user_guides/data_prepare/recog.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -103,6 +103,16 @@
103103

104104
(\*) 注:由于官方的下载地址已经无法访问,我们提供了一个非官方的地址以供参考,但我们无法保证数据的准确性。
105105

106+
对于中国境内的用户,我们也推荐使用开源数据平台[OpenDataLab](https://opendatalab.com/)来获取这些数据集,以获得更好的下载体验:
107+
108+
- [icdar_2013](https://opendatalab.com/ICDAR_2013?source=OpenMMLab%20GitHub)
109+
- [icdar_2015](https://opendatalab.com/ICDAR2015?source=OpenMMLab%20GitHub)
110+
- [IIIT5K](https://opendatalab.com/IIIT_5K?source=OpenMMLab%20GitHub)
111+
- [ct80](https://opendatalab.com/CUTE_80?source=OpenMMLab%20GitHub)
112+
- [svt](https://opendatalab.com/SVT?source=OpenMMLab%20GitHub)
113+
- [Totaltext](https://opendatalab.com/TotalText?source=OpenMMLab%20GitHub)
114+
- [IAM](https://opendatalab.com/IAM_Handwriting?source=OpenMMLab%20GitHub)
115+
106116
## 准备步骤
107117

108118
### ICDAR 2013

0 commit comments

Comments
 (0)