Skip to content

Commit cc270ea

Browse files
ljshouwoailaosangadolphk-yk
committed
add dataset and modification to Tutorial.md
Co-Authored-By: woailaosang <[email protected]> Co-Authored-By: adolphk-yk <[email protected]>
1 parent 176573c commit cc270ea

File tree

19 files changed

+6667
-197
lines changed

19 files changed

+6667
-197
lines changed

.gitignore

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,4 +3,3 @@
33
*.pyc
44
*.cache/
55
dataset/GloVe/
6-
dataset/*

Tutorial.md

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -118,7 +118,7 @@ The architecture of the configuration file is:
118118
]
119119
}
120120
```
121-
- ***target***. [necessary for training and test, some part necessary for prediction] This node defines the target column in the train/valid/test data. The type of target is array because our tookit will support multi-target tasks.
121+
- ***target***. [necessary for training and test] This node defines the target column in the train/valid/test data. The type of target is array because our tookit will support multi-target tasks.
122122
- **outputs**. This node defines the settings of path to save models and logs, as well as cache.
123123
- ***save_base_dir***. The directory to save models and logs.
124124
- ***model_name***. The model would be saved as save_base_dir/model_name.
@@ -200,9 +200,7 @@ Text classification is a core problem to many applications like spam filtering,
200200
201201
- ***Dataset***
202202
203-
The [20 Newsgroups data set](http://qwone.com/~jason/20Newsgroups/) is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. Here is a list of the 20 newsgroups, partitioned (more or less) according to subject matter:
204-
205-
![](https://i.imgur.com/rHLITSi.png)
203+
The [20 Newsgroups data set](http://qwone.com/~jason/20Newsgroups/) is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups.
206204
207205
- ***Usage***
208206

dataset/advanced_demo/predict.tsv

Lines changed: 500 additions & 0 deletions
Large diffs are not rendered by default.

dataset/advanced_demo/test.tsv

Lines changed: 500 additions & 0 deletions
Large diffs are not rendered by default.

dataset/advanced_demo/train.tsv

Lines changed: 1000 additions & 0 deletions
Large diffs are not rendered by default.

dataset/advanced_demo/valid.tsv

Lines changed: 500 additions & 0 deletions
Large diffs are not rendered by default.

dataset/demo/predict.tsv

Lines changed: 500 additions & 0 deletions
Large diffs are not rendered by default.

dataset/demo/test.tsv

Lines changed: 500 additions & 0 deletions
Large diffs are not rendered by default.

dataset/demo/train.tsv

Lines changed: 1000 additions & 0 deletions
Large diffs are not rendered by default.

dataset/demo/valid.tsv

Lines changed: 500 additions & 0 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)