-
Notifications
You must be signed in to change notification settings - Fork 9
Open
Description
Hi
Is it possible for you to share the learning curves: plot of train/val accuracy or val/train loss vs epoch or train time?
I ran run_sr10to40.sh script and it has already run for 47 of 200 epochs (total 50 hrs of running on K40) but the loss hasn't reduced at all. Accuracy is at 0.5 since the beginning. Unless some inflection point comes, I don't see it converging.
One difference that I noticed is that the paper mentions using layer norm lstm whereas you have used normal lstm. Not sure how much impact it has on the performance.
Thanks
Yatin
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels