Update doc

lizekai-richard · lizekai-richard · commit a17fa744c381 · 2024-12-28T23:55:57.000+08:00
diff --git a/README.md b/README.md
@@ -56,7 +56,7 @@ Motivated by this, we propose DD-Ranking, a new benchmark for DD evaluation. DD-
 
 </details>
 
-## About
+## Introduction
 
 <details>
 <summary>Unfold to see more details.</summary>
@@ -88,32 +88,29 @@ $$\text{IOR}/\text{HLR} = \frac{(\text{Acc.}{\text{syn-any}}-\text{Acc.}{\text{r
 DD-Ranking is integrated with:
 <!-- Uniform Fair Labels: loss on soft label -->
 <!-- Data Aug. -->
-- Multiple [strategies](https://github.com/NUS-HPC-AI-Lab/DD-Ranking/tree/main/dd_ranking/loss) of using soft labels;
-- Data augmentation, reconsidered as [optional tricks](https://github.com/NUS-HPC-AI-Lab/DD-Ranking/tree/main/dd_ranking/aug) in DD;
-- Commonly used [model architectures](https://github.com/NUS-HPC-AI-Lab/DD-Ranking/blob/main/dd_ranking/utils/networks.py) in DD.
- A new ranking on representative DD methods.
-
-DD-Ranking is flexible and easy to use, supported by:
-<!-- Defualt configs: Customized configs -->
-<!-- Integrated classes: 1) Optimizer and etc.; 2) random selection tests (additionally, w/ or w/o hard labels)-->
-- Extensive configs provided;
-- Cutomized configs;
-- Testing and training framework with integrated metrics.
+- Multiple [strategies](https://github.com/NUS-HPC-AI-Lab/DD-Ranking/tree/main/dd_ranking/loss) of using soft labels in existing works;
+- Commonly used [data augmentation](https://github.com/NUS-HPC-AI-Lab/DD-Ranking/tree/main/dd_ranking/aug) methods in existing works;
+- Commonly used [model architectures](https://github.com/NUS-HPC-AI-Lab/DD-Ranking/blob/main/dd_ranking/utils/networks.py) in existing works.
+
+DD-Ranking has the following features:
+- **Fair Evaluation**: DD-Ranking provides a fair evaluation scheme for DD methods that can decouple the impacts from knowledge distillation and data augmentation to reflect the real informativeness of the distilled data.
+- **Easy-to-use**: DD-Ranking provides a unified interface for dataset distillation evaluation.
+- **Extensible**: DD-Ranking supports various datasets and models.
+- **Customizable**: DD-Ranking supports various data augmentations and soft label strategies.
 
 </details>
 
 ## Overview
-Included datasets and methods (hard/soft label).
-|Dataset|Hard Label|Soft Label|
+Included datasets and methods (categorized by hard/soft label).
+|Supported Dataset|Evaluated Hard Label Methods|Evaluated Soft Label Methods|
 |:-|:-|:-|
 |CIFAR-10|DC|DATM|
 |CIFAR-100|DSA|SRe2L|
 |TinyImageNet|DM|RDED|
 ||MTT|D4M|
 
-## Coming Soon
-Rank on different data augmentation methods.
-Rank on different data augmentation methods.
+Evaluation results can be found in the [leaderboard](https://huggingface.co/spaces/Soptq/DD-Ranking).
+
 ## Tutorial
 
 Install DD-Ranking with `pip` or from [source](https://github.com/NUS-HPC-AI-Lab/DD-Ranking/tree/main):
@@ -221,6 +218,10 @@ The following results will be returned to you:
 - [Quickstart]()
 - [Supported Models]() -->
 
+## Coming Soon
+- [ ] DD-Ranking scores that decouple the impacts from data augmentation.
+- [ ] Evaluation results on ImageNet subsets.
+
 ## Contributing
 
 <!-- Only PR for the 1st version of DD-Ranking -->
diff --git a/book.toml b/book.toml
@@ -7,7 +7,7 @@ authors             = ["DD-Ranking Team"]
 language            = "en"
 multilingual        = false
 src                 = "doc"
-title               = "DD-Ranking Benchmark"
+title               = "DD-Ranking API Documentation"
 
 [output.html]
 mathjax-support = true
diff --git a/doc/introduction.md b/doc/introduction.md
@@ -8,7 +8,7 @@ Dataset Distillation (DD) aims to condense a large dataset into a much smaller o
 
 ![history](static/history.png)
 
-Notebaly, more and more methods are transitting from "hard label" to "soft label" in dataset distillation, especially during evaluation. **Hard labels** are categorical, having the same format of the real dataset. **Soft labels** are distributions, typically generated by a pre-trained teacher model. 
+Notebaly, more and more methods are transitting from "hard label" to "soft label" in dataset distillation, especially during evaluation. **Hard labels** are categorical, having the same format of the real dataset. **Soft labels** are outputs of a pre-trained teacher model. 
 Recently, Deng et al., pointed out that "a label is worth a thousand images". They showed analytically that soft labels are exetremely useful for accuracy improvement. 
 
 However, since the essence of soft labels is **knowledge distillation**, we find that when applying the same evaluation method to randomly selected data, the test accuracy also improves significantly (see the figure above).