**Description**. The paper compared with a few baselines. The benchmarking scripts are pretty messy. Refer to the followings: - #115 - #117 - #118 **Action itmes** 1. Provide a cleaner documentation to run these baselines. 2. Clean up these PRs for benchmarking use.