Skip to content

Commit 2c16950

Browse files
author
The TensorFlow Datasets Authors
committed
Automated documentation update.
PiperOrigin-RevId: 676535434
1 parent a3428a2 commit 2c16950

File tree

1 file changed

+22
-238
lines changed

1 file changed

+22
-238
lines changed

docs/catalog/imdb_reviews.md

Lines changed: 22 additions & 238 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,8 @@ and 25,000 for testing. There is additional unlabeled data for use as well.
2525
<span class="material-icons icon-after" aria-hidden="true"> north_east
2626
</span> </a>
2727

28+
* **Config description**: Plain text
29+
2830
* **Homepage**:
2931
[http://ai.stanford.edu/~amaas/data/sentiment/](http://ai.stanford.edu/~amaas/data/sentiment/)
3032

@@ -36,35 +38,6 @@ and 25,000 for testing. There is additional unlabeled data for use as well.
3638
* **`1.0.0`** (default): New split API
3739
(https://tensorflow.org/datasets/splits)
3840

39-
* **Supervised keys** (See
40-
[`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):
41-
`('text', 'label')`
42-
43-
* **Figure**
44-
([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):
45-
Not supported.
46-
47-
* **Citation**:
48-
49-
```
50-
@InProceedings{maas-EtAl:2011:ACL-HLT2011,
51-
author = {Maas, Andrew L. and Daly, Raymond E. and Pham, Peter T. and Huang, Dan and Ng, Andrew Y. and Potts, Christopher},
52-
title = {Learning Word Vectors for Sentiment Analysis},
53-
booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies},
54-
month = {June},
55-
year = {2011},
56-
address = {Portland, Oregon, USA},
57-
publisher = {Association for Computational Linguistics},
58-
pages = {142--150},
59-
url = {http://www.aclweb.org/anthology/P11-1015}
60-
}
61-
```
62-
63-
64-
## imdb_reviews/plain_text (default config)
65-
66-
* **Config description**: Plain text
67-
6841
* **Download size**: `80.23 MiB`
6942

7043
* **Dataset size**: `129.83 MiB`
@@ -98,77 +71,13 @@ Feature | Class | Shape | Dtype | Description
9871
label | ClassLabel | | int64 |
9972
text | Text | | string |
10073

101-
* **Examples**
102-
([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):
103-
104-
<!-- mdformat off(HTML should not be auto-formatted) -->
105-
106-
{% framebox %}
107-
108-
<button id="displaydataframe">Display examples...</button>
109-
<div id="dataframecontent" style="overflow-x:auto"></div>
110-
<script>
111-
const url = "https://storage.googleapis.com/tfds-data/visualization/dataframe/imdb_reviews-plain_text-1.0.0.html";
112-
const dataButton = document.getElementById('displaydataframe');
113-
dataButton.addEventListener('click', async () => {
114-
// Disable the button after clicking (dataframe loaded only once).
115-
dataButton.disabled = true;
116-
117-
const contentPane = document.getElementById('dataframecontent');
118-
try {
119-
const response = await fetch(url);
120-
// Error response codes don't throw an error, so force an error to show
121-
// the error message.
122-
if (!response.ok) throw Error(response.statusText);
123-
124-
const data = await response.text();
125-
contentPane.innerHTML = data;
126-
} catch (e) {
127-
contentPane.innerHTML =
128-
'Error loading examples. If the error persist, please open '
129-
+ 'a new issue.';
130-
}
131-
});
132-
</script>
133-
134-
{% endframebox %}
135-
136-
<!-- mdformat on -->
137-
138-
## imdb_reviews/bytes
139-
140-
* **Config description**: Uses byte-level text encoding with
141-
`tfds.deprecated.text.ByteTextEncoder`
142-
143-
* **Download size**: `Unknown size`
144-
145-
* **Dataset size**: `Unknown size`
146-
147-
* **Auto-cached**
148-
([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):
149-
Unknown
150-
151-
* **Splits**:
152-
153-
Split | Examples
154-
:---- | -------:
155-
156-
* **Feature structure**:
157-
158-
```python
159-
FeaturesDict({
160-
'label': ClassLabel(shape=(), dtype=int64, num_classes=2),
161-
'text': Text(shape=(None,), dtype=int64, encoder=<ByteTextEncoder vocab_size=257>),
162-
})
163-
```
164-
165-
* **Feature documentation**:
74+
* **Supervised keys** (See
75+
[`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):
76+
`('text', 'label')`
16677

167-
Feature | Class | Shape | Dtype | Description
168-
:------ | :----------- | :------ | :---- | :----------
169-
| FeaturesDict | | |
170-
label | ClassLabel | | int64 |
171-
text | Text | (None,) | int64 |
78+
* **Figure**
79+
([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):
80+
Not supported.
17281

17382
* **Examples**
17483
([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):
@@ -180,7 +89,7 @@ text | Text | (None,) | int64 |
18089
<button id="displaydataframe">Display examples...</button>
18190
<div id="dataframecontent" style="overflow-x:auto"></div>
18291
<script>
183-
const url = "https://storage.googleapis.com/tfds-data/visualization/dataframe/imdb_reviews-bytes-1.0.0.html";
92+
const url = "https://storage.googleapis.com/tfds-data/visualization/dataframe/imdb_reviews-plain_text-1.0.0.html";
18493
const dataButton = document.getElementById('displaydataframe');
18594
dataButton.addEventListener('click', async () => {
18695
// Disable the button after clicking (dataframe loaded only once).
@@ -207,146 +116,21 @@ dataButton.addEventListener('click', async () => {
207116

208117
<!-- mdformat on -->
209118

210-
## imdb_reviews/subwords8k
211-
212-
* **Config description**: Uses `tfds.deprecated.text.SubwordTextEncoder` with
213-
8k vocab size
214-
215-
* **Download size**: `Unknown size`
216-
217-
* **Dataset size**: `Unknown size`
218-
219-
* **Auto-cached**
220-
([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):
221-
Unknown
222-
223-
* **Splits**:
224-
225-
Split | Examples
226-
:---- | -------:
227-
228-
* **Feature structure**:
119+
* **Citation**:
229120

230-
```python
231-
FeaturesDict({
232-
'label': ClassLabel(shape=(), dtype=int64, num_classes=2),
233-
'text': Text(shape=(None,), dtype=int64),
234-
})
235121
```
236-
237-
* **Feature documentation**:
238-
239-
Feature | Class | Shape | Dtype | Description
240-
:------ | :----------- | :------ | :---- | :----------
241-
| FeaturesDict | | |
242-
label | ClassLabel | | int64 |
243-
text | Text | (None,) | int64 |
244-
245-
* **Examples**
246-
([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):
247-
248-
<!-- mdformat off(HTML should not be auto-formatted) -->
249-
250-
{% framebox %}
251-
252-
<button id="displaydataframe">Display examples...</button>
253-
<div id="dataframecontent" style="overflow-x:auto"></div>
254-
<script>
255-
const url = "https://storage.googleapis.com/tfds-data/visualization/dataframe/imdb_reviews-subwords8k-1.0.0.html";
256-
const dataButton = document.getElementById('displaydataframe');
257-
dataButton.addEventListener('click', async () => {
258-
// Disable the button after clicking (dataframe loaded only once).
259-
dataButton.disabled = true;
260-
261-
const contentPane = document.getElementById('dataframecontent');
262-
try {
263-
const response = await fetch(url);
264-
// Error response codes don't throw an error, so force an error to show
265-
// the error message.
266-
if (!response.ok) throw Error(response.statusText);
267-
268-
const data = await response.text();
269-
contentPane.innerHTML = data;
270-
} catch (e) {
271-
contentPane.innerHTML =
272-
'Error loading examples. If the error persist, please open '
273-
+ 'a new issue.';
274-
}
275-
});
276-
</script>
277-
278-
{% endframebox %}
279-
280-
<!-- mdformat on -->
281-
282-
## imdb_reviews/subwords32k
283-
284-
* **Config description**: Uses `tfds.deprecated.text.SubwordTextEncoder` with
285-
32k vocab size
286-
287-
* **Download size**: `Unknown size`
288-
289-
* **Dataset size**: `Unknown size`
290-
291-
* **Auto-cached**
292-
([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):
293-
Unknown
294-
295-
* **Splits**:
296-
297-
Split | Examples
298-
:---- | -------:
299-
300-
* **Feature structure**:
301-
302-
```python
303-
FeaturesDict({
304-
'label': ClassLabel(shape=(), dtype=int64, num_classes=2),
305-
'text': Text(shape=(None,), dtype=int64),
306-
})
122+
@InProceedings{maas-EtAl:2011:ACL-HLT2011,
123+
author = {Maas, Andrew L. and Daly, Raymond E. and Pham, Peter T. and Huang, Dan and Ng, Andrew Y. and Potts, Christopher},
124+
title = {Learning Word Vectors for Sentiment Analysis},
125+
booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies},
126+
month = {June},
127+
year = {2011},
128+
address = {Portland, Oregon, USA},
129+
publisher = {Association for Computational Linguistics},
130+
pages = {142--150},
131+
url = {http://www.aclweb.org/anthology/P11-1015}
132+
}
307133
```
308134

309-
* **Feature documentation**:
310-
311-
Feature | Class | Shape | Dtype | Description
312-
:------ | :----------- | :------ | :---- | :----------
313-
| FeaturesDict | | |
314-
label | ClassLabel | | int64 |
315-
text | Text | (None,) | int64 |
316-
317-
* **Examples**
318-
([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):
319135

320-
<!-- mdformat off(HTML should not be auto-formatted) -->
321-
322-
{% framebox %}
323-
324-
<button id="displaydataframe">Display examples...</button>
325-
<div id="dataframecontent" style="overflow-x:auto"></div>
326-
<script>
327-
const url = "https://storage.googleapis.com/tfds-data/visualization/dataframe/imdb_reviews-subwords32k-1.0.0.html";
328-
const dataButton = document.getElementById('displaydataframe');
329-
dataButton.addEventListener('click', async () => {
330-
// Disable the button after clicking (dataframe loaded only once).
331-
dataButton.disabled = true;
332-
333-
const contentPane = document.getElementById('dataframecontent');
334-
try {
335-
const response = await fetch(url);
336-
// Error response codes don't throw an error, so force an error to show
337-
// the error message.
338-
if (!response.ok) throw Error(response.statusText);
339-
340-
const data = await response.text();
341-
contentPane.innerHTML = data;
342-
} catch (e) {
343-
contentPane.innerHTML =
344-
'Error loading examples. If the error persist, please open '
345-
+ 'a new issue.';
346-
}
347-
});
348-
</script>
349-
350-
{% endframebox %}
351-
352-
<!-- mdformat on -->
136+
## imdb_reviews/plain_text (default config)

0 commit comments

Comments
 (0)