❓ Question
I wanted to train a DQN agent for 10M, given the time limit of the cluster I'm using I have to continue training the agent by loading the last saved model.I tried to follow the documentation (see the command bellow) but I seems like the model is not properly loaded: the last model achieves a score of 20 but when I continue training it has a score of -21 and other parameters (e.g exploration rate) don't continue annealing.
I just wanted to make sure that I'm not missing something. Is this the right way to do it?
python train.py --algo dqn --env PongNoFrameskip-v4 -i agents/dqn/PongNoFrameskipv4_1/rl_model_60000_steps.zip -n 1000000
Checklist
❓ Question
I wanted to train a DQN agent for 10M, given the time limit of the cluster I'm using I have to continue training the agent by loading the last saved model.I tried to follow the documentation (see the command bellow) but I seems like the model is not properly loaded: the last model achieves a score of 20 but when I continue training it has a score of -21 and other parameters (e.g exploration rate) don't continue annealing.
I just wanted to make sure that I'm not missing something. Is this the right way to do it?
python train.py --algo dqn --env PongNoFrameskip-v4 -i agents/dqn/PongNoFrameskipv4_1/rl_model_60000_steps.zip -n 1000000Checklist