Guanhua Huang, Tingqiang Xu, Jinbo Wang, Guangming Sheng, Siheng Li, Evander Yang, Kejiao Li, Yunxiang Li, Zenan Xu, Qi Yi, Kyrierl Deng, Ziyuan Nan, Yuhao Jiang, Chenchen Zhang, Taiqiang Wu, Feiyuan Zhang, Junhao Wang, Bo Zhou, Alex Chen, Di Wang, Shunyu Yao
(2026).
Stabilizing RLVR via Token-level Gradient Diagnosis and Layerwise Clipping.
Tencent Hunyuan.
Peiyuan Feng, Yichen He, Guanhua Huang, Yuan Lin, Hanchong Zhang, Yuchen Zhang, Hang Li
(2024).
AGILE: A Novel Framework of LLM Agents.
In
Neural Information Processing Systems 37 (NeurIPS 2024).