Discrepancy Between Hyperparameters in Paper and Code — Which Should Be Used for Reproducing Results?

Hi, I noticed some inconsistencies between the hyperparameter values (e.g. learning rate) reported in the paper and those used in the provided code. To faithfully reproduce the results described in the paper, should I follow the values in the paper or the ones in the code? It would be greatly appreciated if you could clarify which configuration was actually used in the experiments reported in the paper. Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discrepancy Between Hyperparameters in Paper and Code — Which Should Be Used for Reproducing Results? #8

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Discrepancy Between Hyperparameters in Paper and Code — Which Should Be Used for Reproducing Results? #8

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions