Automated documentation update.

The TensorFlow Datasets Authors · The TensorFlow Datasets Authors · commit ff7171eade09 · 2024-08-14T13:25:42.000-07:00
PiperOrigin-RevId: 663031637
diff --git a/docs/catalog/_toc.yaml b/docs/catalog/_toc.yaml
@@ -36,6 +36,9 @@ toc:
 - section:
   - path: /datasets/catalog/dices
     title: dices
+  - path: /datasets/catalog/wake_vision
+    status: nightly
+    title: wake_vision
   title: Age
 - section:
   - path: /datasets/catalog/ag_news_subset
@@ -104,6 +107,9 @@ toc:
     title: dices
   - path: /datasets/catalog/sift1m
     title: sift1m
+  - path: /datasets/catalog/wake_vision
+    status: nightly
+    title: wake_vision
   title: Categorical
 - section:
   - path: /datasets/catalog/ai2_arc_with_ir
@@ -189,6 +195,11 @@ toc:
   - path: /datasets/catalog/scientific_papers
     title: scientific_papers
   title: Document summarization
+- section:
+  - path: /datasets/catalog/wake_vision
+    status: nightly
+    title: wake_vision
+  title: Facial attributes
 - section:
   - path: /datasets/catalog/caltech101
     title: caltech101
@@ -202,10 +213,16 @@ toc:
     title: stl10
   - path: /datasets/catalog/sun397
     title: sun397
+  - path: /datasets/catalog/wake_vision
+    status: nightly
+    title: wake_vision
   title: Fine grained image classification
 - section:
   - path: /datasets/catalog/dices
     title: dices
+  - path: /datasets/catalog/wake_vision
+    status: nightly
+    title: wake_vision
   title: Gender
 - section:
   - path: /datasets/catalog/ogbg_molpcba
@@ -366,6 +383,9 @@ toc:
     title: tf_flowers
   - path: /datasets/catalog/the300w_lp
     title: the300w_lp
+  - path: /datasets/catalog/wake_vision
+    status: nightly
+    title: wake_vision
   title: Image
 - section:
   - path: /datasets/catalog/abstract_reasoning
@@ -500,6 +520,9 @@ toc:
     title: uc_merced
   - path: /datasets/catalog/visual_domain_decathlon
     title: visual_domain_decathlon
+  - path: /datasets/catalog/wake_vision
+    status: nightly
+    title: wake_vision
   title: Image classification
 - section:
   - path: /datasets/catalog/imagenet2012
diff --git a/docs/catalog/overview.md b/docs/catalog/overview.md
@@ -49,6 +49,8 @@ for ex in tfds.load('cifar10', split='train'):
 ### `Age`
 
 *   [`dices`](dices.md)
+*   [`wake_vision`](wake_vision.md)
+    <span class="material-icons" title="Available only in the tfds-nightly package">nights_stay</span>
 
 ### `Anomaly detection`
 
@@ -91,6 +93,8 @@ for ex in tfds.load('cifar10', split='train'):
 
 *   [`dices`](dices.md)
 *   [`sift1m`](sift1m.md)
+*   [`wake_vision`](wake_vision.md)
+    <span class="material-icons" title="Available only in the tfds-nightly package">nights_stay</span>
 
 ### `Common sense reasoning`
 
@@ -154,6 +158,11 @@ for ex in tfds.load('cifar10', split='train'):
 *   [`newsroom`](newsroom.md)
 *   [`scientific_papers`](scientific_papers.md)
 
+### `Facial attributes`
+
+*   [`wake_vision`](wake_vision.md)
+    <span class="material-icons" title="Available only in the tfds-nightly package">nights_stay</span>
+
 ### `Fine grained image classification`
 
 *   [`caltech101`](caltech101.md)
@@ -162,10 +171,14 @@ for ex in tfds.load('cifar10', split='train'):
 *   [`stanford_dogs`](stanford_dogs.md)
 *   [`stl10`](stl10.md)
 *   [`sun397`](sun397.md)
+*   [`wake_vision`](wake_vision.md)
+    <span class="material-icons" title="Available only in the tfds-nightly package">nights_stay</span>
 
 ### `Gender`
 
 *   [`dices`](dices.md)
+*   [`wake_vision`](wake_vision.md)
+    <span class="material-icons" title="Available only in the tfds-nightly package">nights_stay</span>
 
 ### `Graph`
 
@@ -252,6 +265,8 @@ for ex in tfds.load('cifar10', split='train'):
 *   [`symmetric_solids`](symmetric_solids.md)
 *   [`tf_flowers`](tf_flowers.md)
 *   [`the300w_lp`](the300w_lp.md)
+*   [`wake_vision`](wake_vision.md)
+    <span class="material-icons" title="Available only in the tfds-nightly package">nights_stay</span>
 
 ### `Image classification`
 
@@ -321,6 +336,8 @@ for ex in tfds.load('cifar10', split='train'):
 *   [`svhn_cropped`](svhn_cropped.md)
 *   [`uc_merced`](uc_merced.md)
 *   [`visual_domain_decathlon`](visual_domain_decathlon.md)
+*   [`wake_vision`](wake_vision.md)
+    <span class="material-icons" title="Available only in the tfds-nightly package">nights_stay</span>
 
 ### `Image clustering`
 
diff --git a/docs/catalog/wake_vision.md b/docs/catalog/wake_vision.md
@@ -0,0 +1,135 @@
+<div itemscope itemtype="http://schema.org/Dataset">
+  <div itemscope itemprop="includedInDataCatalog" itemtype="http://schema.org/DataCatalog">
+    <meta itemprop="name" content="TensorFlow Datasets" />
+  </div>
+  <meta itemprop="name" content="wake_vision" />
+  <meta itemprop="description" content="Wake Vision is a large, high-quality dataset featuring over 6 million images,&#10;significantly exceeding the scale and diversity of current tinyML datasets&#10;(100x). This dataset includes images with annotations of whether each image&#10;contains a person. Additionally, it incorporates a comprehensive fine-grained&#10;benchmark to assess fairness and robustness, covering perceived gender,&#10;perceived age, subject distance, lighting conditions, and depictions. The Wake&#10;Vision labels are derived from Open Image&#x27;s annotations which are licensed by&#10;Google LLC under CC BY 4.0 license. The images are listed as having a CC BY 2.0&#10;license. Note from Open Images: &quot;while we tried to identify images that are&#10;licensed under a Creative Commons Attribution license, we make no&#10;representations or warranties regarding the license status of each image and you&#10;should verify the license for each image yourself.&quot;&#10;&#10;To use this dataset:&#10;&#10;```python&#10;import tensorflow_datasets as tfds&#10;&#10;ds = tfds.load(&#x27;wake_vision&#x27;, split=&#x27;train&#x27;)&#10;for ex in ds.take(4):&#10;  print(ex)&#10;```&#10;&#10;See [the guide](https://www.tensorflow.org/datasets/overview) for more&#10;informations on [tensorflow_datasets](https://www.tensorflow.org/datasets).&#10;&#10;" />
+  <meta itemprop="url" content="https://www.tensorflow.org/datasets/catalog/wake_vision" />
+  <meta itemprop="sameAs" content="https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi%3A10.7910%2FDVN%2F1HOPXC" />
+  <meta itemprop="citation" content="@article{banbury2024wake,&#10;  title={Wake Vision: A Large-scale, Diverse Dataset and Benchmark Suite for TinyML Person Detection},&#10;  author={Banbury, Colby and Njor, Emil and Stewart, Matthew and Warden, Pete and Kudlur, Manjunath and Jeffries, Nat and Fafoutis, Xenofon and Reddi, Vijay Janapa},&#10;  journal={arXiv preprint arXiv:2405.00892},&#10;  year={2024}&#10;}" />
+</div>
+
+# `wake_vision`
+
+
+Note: This dataset was added recently and is only available in our
+`tfds-nightly` package
+<span class="material-icons" title="Available only in the tfds-nightly package">nights_stay</span>.
+
+*   **Description**:
+
+Wake Vision is a large, high-quality dataset featuring over 6 million images,
+significantly exceeding the scale and diversity of current tinyML datasets
+(100x). This dataset includes images with annotations of whether each image
+contains a person. Additionally, it incorporates a comprehensive fine-grained
+benchmark to assess fairness and robustness, covering perceived gender,
+perceived age, subject distance, lighting conditions, and depictions. The Wake
+Vision labels are derived from Open Image's annotations which are licensed by
+Google LLC under CC BY 4.0 license. The images are listed as having a CC BY 2.0
+license. Note from Open Images: "while we tried to identify images that are
+licensed under a Creative Commons Attribution license, we make no
+representations or warranties regarding the license status of each image and you
+should verify the license for each image yourself."
+
+*   **Homepage**:
+    [https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi%3A10.7910%2FDVN%2F1HOPXC](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi%3A10.7910%2FDVN%2F1HOPXC)
+
+*   **Source code**:
+    [`tfds.datasets.wake_vision.Builder`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/datasets/wake_vision/wake_vision_dataset_builder.py)
+
+*   **Versions**:
+
+    *   **`1.0.0`** (default): Initial TensorFlow Datasets release. Note that
+        this is based on the 2.0 version of Wake Vision on Harvard Dataverse.
+
+*   **Download size**: `Unknown size`
+
+*   **Dataset size**: `Unknown size`
+
+*   **Auto-cached**
+    ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):
+    Unknown
+
+*   **Splits**:
+
+Split | Examples
+:---- | -------:
+
+*   **Feature structure**:
+
+```python
+FeaturesDict({
+    'age_unknown': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'body_part': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'bright': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'dark': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'far': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'filename': Text(shape=(), dtype=string),
+    'gender_unknown': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'image': Image(shape=(None, None, 3), dtype=uint8),
+    'medium_distance': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'middle_age': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'near': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'non-person_depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'non-person_non-depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'normal_lighting': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'older': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'person': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'person_depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'predominantly_female': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'predominantly_male': ClassLabel(shape=(), dtype=int64, num_classes=2),
+    'young': ClassLabel(shape=(), dtype=int64, num_classes=2),
+})
+```
+
+*   **Feature documentation**:
+
+Feature                  | Class        | Shape           | Dtype  | Description
+:----------------------- | :----------- | :-------------- | :----- | :----------
+                         | FeaturesDict |                 |        |
+age_unknown              | ClassLabel   |                 | int64  |
+body_part                | ClassLabel   |                 | int64  |
+bright                   | ClassLabel   |                 | int64  |
+dark                     | ClassLabel   |                 | int64  |
+depiction                | ClassLabel   |                 | int64  |
+far                      | ClassLabel   |                 | int64  |
+filename                 | Text         |                 | string |
+gender_unknown           | ClassLabel   |                 | int64  |
+image                    | Image        | (None, None, 3) | uint8  |
+medium_distance          | ClassLabel   |                 | int64  |
+middle_age               | ClassLabel   |                 | int64  |
+near                     | ClassLabel   |                 | int64  |
+non-person_depiction     | ClassLabel   |                 | int64  |
+non-person_non-depiction | ClassLabel   |                 | int64  |
+normal_lighting          | ClassLabel   |                 | int64  |
+older                    | ClassLabel   |                 | int64  |
+person                   | ClassLabel   |                 | int64  |
+person_depiction         | ClassLabel   |                 | int64  |
+predominantly_female     | ClassLabel   |                 | int64  |
+predominantly_male       | ClassLabel   |                 | int64  |
+young                    | ClassLabel   |                 | int64  |
+
+*   **Supervised keys** (See
+    [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):
+    `('image', 'person')`
+
+*   **Figure**
+    ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):
+    Not supported.
+
+*   **Examples**
+    ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):
+    Missing.
+
+*   **Citation**:
+
+```
+@article{banbury2024wake,
+  title={Wake Vision: A Large-scale, Diverse Dataset and Benchmark Suite for TinyML Person Detection},
+  author={Banbury, Colby and Njor, Emil and Stewart, Matthew and Warden, Pete and Kudlur, Manjunath and Jeffries, Nat and Fafoutis, Xenofon and Reddi, Vijay Janapa},
+  journal={arXiv preprint arXiv:2405.00892},
+  year={2024}
+}
+```
+