About the REFERENCE dataset

Hello and great job!

To reproduce the baselines:
**Evaluation:**
```bash
python scripts/eval.py \
       --ref_path <reference dataset> \
       --gen_path <generated dataset>
```

What does the **_reference dataset_** refer to?

Here's my understanding:
1. Models are trained on the MOSES_trainingset and generate a large number of molecules.
2. The distribution of these generated molecules is compared with the MOSES_testset, which shares the same distribution as the training set.
3. Therefore, the reference dataset serves as a metric to evaluate how well the models have learned the distribution of the MOSES_trainingset.

Is this correct?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the REFERENCE dataset #115

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

About the REFERENCE dataset #115

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions