CV
Education
- Ph.D. in Department of Computer Science and Technology, Tsinghua University, 2024-2029 (expected)
- B.S. in Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University, 2020-2024
- GPA: 3.95/4.0, Rank: 9/79
Publications
- A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
Kairong Luo*, Haodong Wen, Shengding Hu, Zhenbo Sun, Zhiyuan Liu, Maosong Sun†, Kaifeng Lyu†, Wenguang Chen†- Accepted by ICLR 2025 (March 2025)
- arXiv preprint
- Summary: This paper introduces an empirical law to predict the pretraining loss of large language models under various learning rate schedules (e.g., constant, cosine, step decay). The proposed multi-power law combines a power law based on the sum of learning rates with additional terms to account for loss reduction due to learning rate decay. Validated across multiple model sizes and architectures, this law accurately predicts loss curves for unseen schedules and helps identify optimal schedules that outperform widely used ones like cosine. The findings provide insights into pretraining dynamics and learning rate schedule design. The automatically discovered schedule resembles the Warmup-Stable-Decay (WSD) schedule but achieves slightly better performance.
- DreamFuser: Value-guided Diffusion Policy for Offline Reinforcement Learning
Kairong Luo*, Caiwei Xiao*, Zhiao Huang, Zhan Ling, Yunhao Fang, Hao Su†- Status: Preprint / Under Review (November 2023)
- OpenReview
- Summary: DreamFuser is a trajectory-based value optimization approach that integrates diffusion-based trajectory learning with efficient Q-function learning. It addresses computational challenges in action sampling during training by leveraging the Generalized Noisy Action Markov Decision Process (GNMDP), which treats the diffusion denoising process as part of the MDP transition. Empirical results show DreamFuser outperforms existing diffusion policy algorithms, particularly in low-level control tasks, and matches or exceeds state-of-the-art methods on the D4RL benchmark. The work also highlights the computational and memory advantages of DreamFuser over traditional MDP-based diffusion policies.
Teaching
- Teaching Assistant, Probability and Statistics (English), Tsinghua University, Autumn 2023
- Teaching Assistant, Introduction to Computing Systems (Chinese, 计算机系统概论), Tsinghua University, Autumn 2024
- Teaching Assistant, Data Structure and Algorithm (Chinese, 数据结构与算法), Tsinghua University, Spring 2024
Service and Leadership
- Chair of IIIS Student Union (学生联席会), November 2023 - June 2024
- Mentor for Pre-college Program, Department of Computer Science / IIIS, Tsinghua University, September 2024 - June 2025 (expected)