https://github.com/lilianweng/deep-reinforcement-learning-gym/blob/4fec4876ad28fe83309efd2cdf2a6f4281a5b23c/playground/policies/ddpg.py#L47 a is rescaled, but mu is not.