Appreciate your work! And may i ask about the details in your cosine learning rate strategy, like Tmax setting or some others needed attention?