I finnish training,it looks perfectable,reward reached -110,but when I render it,it performs terrible,is It should be like this? 