@@ -36,20 +36,6 @@ and 25,000 for testing. There is additional unlabeled data for use as well.
36
36
* ** ` 1.0.0 ` ** (default): New split API
37
37
(https://tensorflow.org/datasets/splits )
38
38
39
- * ** Download size** : ` 80.23 MiB `
40
-
41
- * ** Auto-cached**
42
- ([ documentation] ( https://www.tensorflow.org/datasets/performances#auto-caching ) ):
43
- Yes
44
-
45
- * ** Splits** :
46
-
47
- Split | Examples
48
- :--------------- | -------:
49
- ` 'test' ` | 25,000
50
- ` 'train' ` | 25,000
51
- ` 'unsupervised' ` | 50,000
52
-
53
39
* ** Supervised keys** (See
54
40
[ ` as_supervised ` doc] ( https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args ) ):
55
41
` ('text', 'label') `
@@ -79,8 +65,22 @@ Split | Examples
79
65
80
66
* ** Config description** : Plain text
81
67
68
+ * ** Download size** : ` 80.23 MiB `
69
+
82
70
* ** Dataset size** : ` 129.83 MiB `
83
71
72
+ * ** Auto-cached**
73
+ ([ documentation] ( https://www.tensorflow.org/datasets/performances#auto-caching ) ):
74
+ Yes
75
+
76
+ * ** Splits** :
77
+
78
+ Split | Examples
79
+ :--------------- | -------:
80
+ ` 'test' ` | 25,000
81
+ ` 'train' ` | 25,000
82
+ ` 'unsupervised' ` | 50,000
83
+
84
84
* ** Feature structure** :
85
85
86
86
``` python
@@ -140,7 +140,18 @@ dataButton.addEventListener('click', async () => {
140
140
* ** Config description** : Uses byte-level text encoding with
141
141
` tfds.deprecated.text.ByteTextEncoder `
142
142
143
- * ** Dataset size** : ` 129.88 MiB `
143
+ * ** Download size** : ` Unknown size `
144
+
145
+ * ** Dataset size** : ` Unknown size `
146
+
147
+ * ** Auto-cached**
148
+ ([ documentation] ( https://www.tensorflow.org/datasets/performances#auto-caching ) ):
149
+ Unknown
150
+
151
+ * ** Splits** :
152
+
153
+ Split | Examples
154
+ :---- | -------:
144
155
145
156
* ** Feature structure** :
146
157
@@ -201,14 +212,25 @@ dataButton.addEventListener('click', async () => {
201
212
* ** Config description** : Uses ` tfds.deprecated.text.SubwordTextEncoder ` with
202
213
8k vocab size
203
214
204
- * ** Dataset size** : ` 54.72 MiB `
215
+ * ** Download size** : ` Unknown size `
216
+
217
+ * ** Dataset size** : ` Unknown size `
218
+
219
+ * ** Auto-cached**
220
+ ([ documentation] ( https://www.tensorflow.org/datasets/performances#auto-caching ) ):
221
+ Unknown
222
+
223
+ * ** Splits** :
224
+
225
+ Split | Examples
226
+ :---- | -------:
205
227
206
228
* ** Feature structure** :
207
229
208
230
``` python
209
231
FeaturesDict({
210
232
' label' : ClassLabel(shape = (), dtype = int64, num_classes = 2 ),
211
- ' text' : Text(shape = (None ,), dtype = int64, encoder = < SubwordTextEncoder vocab_size = 8185 > ),
233
+ ' text' : Text(shape = (None ,), dtype = int64),
212
234
})
213
235
```
214
236
@@ -262,14 +284,25 @@ dataButton.addEventListener('click', async () => {
262
284
* ** Config description** : Uses ` tfds.deprecated.text.SubwordTextEncoder ` with
263
285
32k vocab size
264
286
265
- * ** Dataset size** : ` 50.33 MiB `
287
+ * ** Download size** : ` Unknown size `
288
+
289
+ * ** Dataset size** : ` Unknown size `
290
+
291
+ * ** Auto-cached**
292
+ ([ documentation] ( https://www.tensorflow.org/datasets/performances#auto-caching ) ):
293
+ Unknown
294
+
295
+ * ** Splits** :
296
+
297
+ Split | Examples
298
+ :---- | -------:
266
299
267
300
* ** Feature structure** :
268
301
269
302
``` python
270
303
FeaturesDict({
271
304
' label' : ClassLabel(shape = (), dtype = int64, num_classes = 2 ),
272
- ' text' : Text(shape = (None ,), dtype = int64, encoder = < SubwordTextEncoder vocab_size = 32650 > ),
305
+ ' text' : Text(shape = (None ,), dtype = int64),
273
306
})
274
307
```
275
308
0 commit comments