Hello! I trained the model on the WMT16 dataset and modified the parameters to the following values

The main modifications were dim and seq_len, what's more, I change the learning_step to 120000, to make the result better.
But I still got very poor results.

I wonder when I change these parameters, do I have to change other parameters along with them?
When I trained the model with your original parameters, the results were not good enough because of dim and seq_len, but they were better than the current results.