@@ -25,6 +25,8 @@ and 25,000 for testing. There is additional unlabeled data for use as well.
25
25
<span class =" material-icons icon-after " aria-hidden =" true " > north_east
26
26
</span > </a >
27
27
28
+ * ** Config description** : Plain text
29
+
28
30
* ** Homepage** :
29
31
[ http://ai.stanford.edu/~amaas/data/sentiment/ ] ( http://ai.stanford.edu/~amaas/data/sentiment/ )
30
32
@@ -36,35 +38,6 @@ and 25,000 for testing. There is additional unlabeled data for use as well.
36
38
* ** ` 1.0.0 ` ** (default): New split API
37
39
(https://tensorflow.org/datasets/splits )
38
40
39
- * ** Supervised keys** (See
40
- [ ` as_supervised ` doc] ( https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args ) ):
41
- ` ('text', 'label') `
42
-
43
- * ** Figure**
44
- ([ tfds.show_examples] ( https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples ) ):
45
- Not supported.
46
-
47
- * ** Citation** :
48
-
49
- ```
50
- @InProceedings{maas-EtAl:2011:ACL-HLT2011,
51
- author = {Maas, Andrew L. and Daly, Raymond E. and Pham, Peter T. and Huang, Dan and Ng, Andrew Y. and Potts, Christopher},
52
- title = {Learning Word Vectors for Sentiment Analysis},
53
- booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies},
54
- month = {June},
55
- year = {2011},
56
- address = {Portland, Oregon, USA},
57
- publisher = {Association for Computational Linguistics},
58
- pages = {142--150},
59
- url = {http://www.aclweb.org/anthology/P11-1015}
60
- }
61
- ```
62
-
63
-
64
- ## imdb_reviews/plain_text (default config)
65
-
66
- * ** Config description** : Plain text
67
-
68
41
* ** Download size** : ` 80.23 MiB `
69
42
70
43
* ** Dataset size** : ` 129.83 MiB `
@@ -98,77 +71,13 @@ Feature | Class | Shape | Dtype | Description
98
71
label | ClassLabel | | int64 |
99
72
text | Text | | string |
100
73
101
- * ** Examples**
102
- ([ tfds.as_dataframe] ( https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe ) ):
103
-
104
- <!-- mdformat off(HTML should not be auto-formatted) -->
105
-
106
- {% framebox %}
107
-
108
- <button id =" displaydataframe " >Display examples...</button >
109
- <div id =" dataframecontent " style =" overflow-x :auto " ></div >
110
- <script >
111
- const url = " https://storage.googleapis.com/tfds-data/visualization/dataframe/imdb_reviews-plain_text-1.0.0.html" ;
112
- const dataButton = document .getElementById (' displaydataframe' );
113
- dataButton .addEventListener (' click' , async () => {
114
- // Disable the button after clicking (dataframe loaded only once).
115
- dataButton .disabled = true ;
116
-
117
- const contentPane = document .getElementById (' dataframecontent' );
118
- try {
119
- const response = await fetch (url);
120
- // Error response codes don't throw an error, so force an error to show
121
- // the error message.
122
- if (! response .ok ) throw Error (response .statusText );
123
-
124
- const data = await response .text ();
125
- contentPane .innerHTML = data;
126
- } catch (e) {
127
- contentPane .innerHTML =
128
- ' Error loading examples. If the error persist, please open '
129
- + ' a new issue.' ;
130
- }
131
- });
132
- </script >
133
-
134
- {% endframebox %}
135
-
136
- <!-- mdformat on -->
137
-
138
- ## imdb_reviews/bytes
139
-
140
- * ** Config description** : Uses byte-level text encoding with
141
- ` tfds.deprecated.text.ByteTextEncoder `
142
-
143
- * ** Download size** : ` Unknown size `
144
-
145
- * ** Dataset size** : ` Unknown size `
146
-
147
- * ** Auto-cached**
148
- ([ documentation] ( https://www.tensorflow.org/datasets/performances#auto-caching ) ):
149
- Unknown
150
-
151
- * ** Splits** :
152
-
153
- Split | Examples
154
- :---- | -------:
155
-
156
- * ** Feature structure** :
157
-
158
- ``` python
159
- FeaturesDict({
160
- ' label' : ClassLabel(shape = (), dtype = int64, num_classes = 2 ),
161
- ' text' : Text(shape = (None ,), dtype = int64, encoder = < ByteTextEncoder vocab_size = 257 > ),
162
- })
163
- ```
164
-
165
- * ** Feature documentation** :
74
+ * ** Supervised keys** (See
75
+ [ ` as_supervised ` doc] ( https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args ) ):
76
+ ` ('text', 'label') `
166
77
167
- Feature | Class | Shape | Dtype | Description
168
- :------ | :----------- | :------ | :---- | :----------
169
- | FeaturesDict | | |
170
- label | ClassLabel | | int64 |
171
- text | Text | (None,) | int64 |
78
+ * ** Figure**
79
+ ([ tfds.show_examples] ( https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples ) ):
80
+ Not supported.
172
81
173
82
* ** Examples**
174
83
([ tfds.as_dataframe] ( https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe ) ):
@@ -180,7 +89,7 @@ text | Text | (None,) | int64 |
180
89
<button id =" displaydataframe " >Display examples...</button >
181
90
<div id =" dataframecontent " style =" overflow-x :auto " ></div >
182
91
<script >
183
- const url = " https://storage.googleapis.com/tfds-data/visualization/dataframe/imdb_reviews-bytes -1.0.0.html" ;
92
+ const url = " https://storage.googleapis.com/tfds-data/visualization/dataframe/imdb_reviews-plain_text -1.0.0.html" ;
184
93
const dataButton = document .getElementById (' displaydataframe' );
185
94
dataButton .addEventListener (' click' , async () => {
186
95
// Disable the button after clicking (dataframe loaded only once).
@@ -207,146 +116,21 @@ dataButton.addEventListener('click', async () => {
207
116
208
117
<!-- mdformat on -->
209
118
210
- ## imdb_reviews/subwords8k
211
-
212
- * ** Config description** : Uses ` tfds.deprecated.text.SubwordTextEncoder ` with
213
- 8k vocab size
214
-
215
- * ** Download size** : ` Unknown size `
216
-
217
- * ** Dataset size** : ` Unknown size `
218
-
219
- * ** Auto-cached**
220
- ([ documentation] ( https://www.tensorflow.org/datasets/performances#auto-caching ) ):
221
- Unknown
222
-
223
- * ** Splits** :
224
-
225
- Split | Examples
226
- :---- | -------:
227
-
228
- * ** Feature structure** :
119
+ * ** Citation** :
229
120
230
- ``` python
231
- FeaturesDict({
232
- ' label' : ClassLabel(shape = (), dtype = int64, num_classes = 2 ),
233
- ' text' : Text(shape = (None ,), dtype = int64),
234
- })
235
121
```
236
-
237
- * ** Feature documentation** :
238
-
239
- Feature | Class | Shape | Dtype | Description
240
- :------ | :----------- | :------ | :---- | :----------
241
- | FeaturesDict | | |
242
- label | ClassLabel | | int64 |
243
- text | Text | (None,) | int64 |
244
-
245
- * ** Examples**
246
- ([ tfds.as_dataframe] ( https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe ) ):
247
-
248
- <!-- mdformat off(HTML should not be auto-formatted) -->
249
-
250
- {% framebox %}
251
-
252
- <button id =" displaydataframe " >Display examples...</button >
253
- <div id =" dataframecontent " style =" overflow-x :auto " ></div >
254
- <script >
255
- const url = " https://storage.googleapis.com/tfds-data/visualization/dataframe/imdb_reviews-subwords8k-1.0.0.html" ;
256
- const dataButton = document .getElementById (' displaydataframe' );
257
- dataButton .addEventListener (' click' , async () => {
258
- // Disable the button after clicking (dataframe loaded only once).
259
- dataButton .disabled = true ;
260
-
261
- const contentPane = document .getElementById (' dataframecontent' );
262
- try {
263
- const response = await fetch (url);
264
- // Error response codes don't throw an error, so force an error to show
265
- // the error message.
266
- if (! response .ok ) throw Error (response .statusText );
267
-
268
- const data = await response .text ();
269
- contentPane .innerHTML = data;
270
- } catch (e) {
271
- contentPane .innerHTML =
272
- ' Error loading examples. If the error persist, please open '
273
- + ' a new issue.' ;
274
- }
275
- });
276
- </script >
277
-
278
- {% endframebox %}
279
-
280
- <!-- mdformat on -->
281
-
282
- ## imdb_reviews/subwords32k
283
-
284
- * ** Config description** : Uses ` tfds.deprecated.text.SubwordTextEncoder ` with
285
- 32k vocab size
286
-
287
- * ** Download size** : ` Unknown size `
288
-
289
- * ** Dataset size** : ` Unknown size `
290
-
291
- * ** Auto-cached**
292
- ([ documentation] ( https://www.tensorflow.org/datasets/performances#auto-caching ) ):
293
- Unknown
294
-
295
- * ** Splits** :
296
-
297
- Split | Examples
298
- :---- | -------:
299
-
300
- * ** Feature structure** :
301
-
302
- ``` python
303
- FeaturesDict({
304
- ' label' : ClassLabel(shape = (), dtype = int64, num_classes = 2 ),
305
- ' text' : Text(shape = (None ,), dtype = int64),
306
- })
122
+ @InProceedings{maas-EtAl:2011:ACL-HLT2011,
123
+ author = {Maas, Andrew L. and Daly, Raymond E. and Pham, Peter T. and Huang, Dan and Ng, Andrew Y. and Potts, Christopher},
124
+ title = {Learning Word Vectors for Sentiment Analysis},
125
+ booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies},
126
+ month = {June},
127
+ year = {2011},
128
+ address = {Portland, Oregon, USA},
129
+ publisher = {Association for Computational Linguistics},
130
+ pages = {142--150},
131
+ url = {http://www.aclweb.org/anthology/P11-1015}
132
+ }
307
133
```
308
134
309
- * ** Feature documentation** :
310
-
311
- Feature | Class | Shape | Dtype | Description
312
- :------ | :----------- | :------ | :---- | :----------
313
- | FeaturesDict | | |
314
- label | ClassLabel | | int64 |
315
- text | Text | (None,) | int64 |
316
-
317
- * ** Examples**
318
- ([ tfds.as_dataframe] ( https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe ) ):
319
135
320
- <!-- mdformat off(HTML should not be auto-formatted) -->
321
-
322
- {% framebox %}
323
-
324
- <button id =" displaydataframe " >Display examples...</button >
325
- <div id =" dataframecontent " style =" overflow-x :auto " ></div >
326
- <script >
327
- const url = " https://storage.googleapis.com/tfds-data/visualization/dataframe/imdb_reviews-subwords32k-1.0.0.html" ;
328
- const dataButton = document .getElementById (' displaydataframe' );
329
- dataButton .addEventListener (' click' , async () => {
330
- // Disable the button after clicking (dataframe loaded only once).
331
- dataButton .disabled = true ;
332
-
333
- const contentPane = document .getElementById (' dataframecontent' );
334
- try {
335
- const response = await fetch (url);
336
- // Error response codes don't throw an error, so force an error to show
337
- // the error message.
338
- if (! response .ok ) throw Error (response .statusText );
339
-
340
- const data = await response .text ();
341
- contentPane .innerHTML = data;
342
- } catch (e) {
343
- contentPane .innerHTML =
344
- ' Error loading examples. If the error persist, please open '
345
- + ' a new issue.' ;
346
- }
347
- });
348
- </script >
349
-
350
- {% endframebox %}
351
-
352
- <!-- mdformat on -->
136
+ ## imdb_reviews/plain_text (default config)
0 commit comments