Thanks for your great work!
I want to konw, is the reported scores, along with the dreamerv3, "are the average episode returns within the last 10k steps, that is, all episodes that finished between 390k and 400k environment frames"? Or the last step model evaluation? Or other calculation methods?