Skip to content

Training or Learning curve? #3

@ynandwan

Description

@ynandwan

Hi
Is it possible for you to share the learning curves: plot of train/val accuracy or val/train loss vs epoch or train time?
I ran run_sr10to40.sh script and it has already run for 47 of 200 epochs (total 50 hrs of running on K40) but the loss hasn't reduced at all. Accuracy is at 0.5 since the beginning. Unless some inflection point comes, I don't see it converging.
One difference that I noticed is that the paper mentions using layer norm lstm whereas you have used normal lstm. Not sure how much impact it has on the performance.

Thanks
Yatin

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions