ValueError when attempting to run pretraining with custom reader #12032
-
I'm trying to pretrain a
The config I'm using is:
The custom pretraining reader simply streams in data from a very, very large
I've been having a lot of trouble figuring out what the problem is. I've compared my |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
I solved my own problem! My custom reader had incorrect output. Once I copied the implementation of
|
Beta Was this translation helpful? Give feedback.
I solved my own problem! My custom reader had incorrect output. Once I copied the implementation of
JsonlCorpus
, it started to work. Still not sure what exactly was going wrong in my code (so I would appreciate some input), but I did get it working. You can see the changes: