cs.LG(2025-10-03)

📊 共 6 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (5 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
1 Studying the Korean Word-Chain Game with RLVR: Mitigating Reward Conflicts via Curriculum Learning 利用课程学习缓解奖励冲突,RLVR方法求解韩语词语接龙游戏 reinforcement learning curriculum learning large language model
2 Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward 提出低概率正则化(Lp-Reg)方法,解决RLVR中探索性token消失问题,提升复杂推理任务性能。 reinforcement learning large language model
3 Certifiable Safe RLHF: Fixed-Penalty Constraint Optimization for Safer Language Models 提出Certifiable Safe-RLHF,通过固定惩罚优化提升语言模型安全性。 RLHF large language model
4 Long-Term Mapping of the Douro River Plume with Multi-Agent Reinforcement Learning 提出多智能体强化学习以解决长时间河流羽流映射问题 reinforcement learning spatiotemporal
5 Longitudinal Flow Matching for Trajectory Modeling 提出插值多边际流匹配(IMMFM)用于解决轨迹建模中稀疏采样和高维问题 flow matching

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
6 Machine Unlearning Meets Adversarial Robustness via Constrained Interventions on LLMs 提出基于约束干预的大语言模型不可学习与对抗鲁棒性统一方法 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页