关于 | 洪喆沛

洪喆沛

华南师范大学人工智能学院

中国广东佛山

email: hongzhepei@gmail.com

语言： English / 中文

我目前就读于华南师范大学人工智能学院，是一名本科生。我的研究兴趣包括大模型后训练与 Agentic AI 系统，近期工作重点关注同策略蒸馏、强化学习训练范式，以及可靠的大语言模型智能体。

我的研究主要围绕两个方向展开。第一个方向是大模型后训练技术，包括同策略蒸馏、强化学习训练范式以及黑盒模型蒸馏等问题。我的最新论文 ROPD 探索了基于 rubric 的同策略蒸馏方法，旨在以黑盒兼容且更具样本效率的方式完成大模型后训练与能力迁移。

第二个方向是 Agentic AI 系统，包括大语言模型智能体、多智能体协作、工具调用与长程任务求解。我对如何构建更可靠、可评估、可持续执行复杂任务的智能体系统感兴趣。我的最新论文 TRACE 将长程智能体安全检测重构为轨迹级证据压缩，通过 Compressor-Reader 设计聚合长轨迹中稀疏、延迟与组合性的风险信号。

背景：

2023-2027 年就读于华南师范大学，软件工程本科在读。
目前作为学生研究者，主要关注大模型后训练、强化学习与 Agentic AI 系统。

selected publications

arXiv

Rubric-based On-policy Distillation

Junfeng Fang, Zhepei Hong, Mao Zheng, and 7 more authors

arXiv preprint, May 2026

Preprint, Co-first author

Abs arXiv PDF Code

On-policy distillation (OPD) is a powerful paradigm for model alignment, yet its reliance on teacher logits restricts its application to white-box scenarios. We introduce ROPD, a rubric-based OPD framework that induces prompt-specific rubrics from teacher-student contrasts and uses them to score student rollouts for on-policy optimization. Empirically, ROPD outperforms advanced logit-based OPD methods across most scenarios and achieves up to a 10x gain in sample efficiency, positioning rubric-based OPD as a flexible, black-box-compatible alternative for scalable distillation across proprietary and open-source LLMs.
arXiv

TRACE: Trajectory Risk-Aware Compression for Long-Horizon Agent Safety

Zhepei Hong, Lin Wang, Liting Li, and 5 more authors

arXiv preprint, May 2026

Preprint, First author

Abs arXiv PDF Code

Long-horizon LLM agents produce safety evidence across long trajectories, where sparse, delayed, and compositional risk signals often escape local moderation. We reframe long-horizon agent safety detection as trajectory-level evidence compression and propose Trajectory Risk-Aware Compression for Long-Horizon Agent Safety (TRACE). TRACE uses a Compressor-Reader design: the Compressor encodes the full trajectory into a compact latent evidence state under trajectory-level supervision, and the Reader judges the raw trajectory with this latent evidence state as a safety reference. Across ASSE-Bench, Pre-Ex-Bench, and R-Judge, TRACE achieves the best accuracy on all evaluated backbones, improving over strong baselines by up to 12.6 percentage points. On LongSafety, TRACE shows smaller performance degradation as context length grows.
BSPC

HEAT: Hierarchical Emotion Adaptation with Progressive Thresholding for EEG Emotion and Consciousness Detection

Zhepei Hong, Rongtao Chen, Liting Li, and 3 more authors

Biomedical Signal Processing and Control, Apr 2026

Accepted, First author

PDF Code
BIBM

PR-DA: Prototype Regularization Domain Adaptation for Cross-Subject EEG-Based Emotion Recognition

Rongtao Chen, Zhepei Hong, Qi You, and 3 more authors

In IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Oct 2025

Accepted, Oral presentation, Co-first author

PDF Code
TAFFC

Multi-scale Dynamic Temporal Network with Graph Matching Domain Adaptation for Cross-Subject EEG Emotion Recognition

Rongtao Chen, Zhepei Hong, Liting Li, and 3 more authors

IEEE Transactions on Affective Computing, Mar 2026

Accepted, Co-first author

PDF Code