Convert gradient accumulation with Accelerate

In the train function, inside run.py:
````
            if args.gradient_accumulation_steps > 1:
                loss = loss / args.gradient_accumulation_steps
````

For details see the link:
https://huggingface.co/docs/accelerate/v0.11.0/en/gradient_accumulation