Skip to content

This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).

Notifications You must be signed in to change notification settings

ZJU-REAL/TimeHC-RL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 

Repository files navigation

Logo TimeHC-RL: Temporal-aware Hierarchical
Cognitive Reinforcement Learning for Enhancing
LLMs’ Social Intelligence

🔗 arXiv | 📄 PDF

Guiyang Hou1*, Xing Gao2, Yuchuan Wu2, Xiang Huang2,3, Wenqi Zhang1, Zhe Zheng1, Yongliang Shen1, Jialu Du1, Fei Huang2, Yongbin Li2†, Weiming Lu1†,
1Zhejiang University, 2Tongyi Lab, Alibaba Group, 3Nanjing University
Preprint. Under review.
*This work was done when the first author was an intern at Tongyi Lab, Corresponding Author

Results

Citation

If you find our work helpful, feel free to give us a cite.

@misc{hou2025timehcrltemporalawarehierarchicalcognitive,
      title={TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence}, 
      author={Guiyang Hou and Xing Gao and Yuchuan Wu and Xiang Huang and Wenqi Zhang and Zhe Zheng and Yongliang Shen and Jialu Du and Fei Huang and Yongbin Li and Weiming Lu},
      year={2025},
      eprint={2505.24500},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2505.24500}, 
}

Contact Us

If you have any questions, please contact us by email: [email protected], [email protected]

About

This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published