cs.LG(2025-07-27)

📊 共 9 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (5) 支柱九:具身大模型 (Embodied Foundation Models) (4)

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
1 Cultivating Helpful, Personalized, and Creative AI Tutors: A Framework for Pedagogical Alignment using Reinforcement Learning EduAlign框架:利用强化学习提升LLM在教育领域的个性化和创造性 reinforcement learning large language model
2 MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge 提出MaPPO框架以优化大语言模型的偏好对齐问题 preference learning DPO direct preference optimization
3 Learning from Expert Factors: Trajectory-level Reward Shaping for Formulaic Alpha Mining 提出轨迹级奖励塑造方法TLRS,提升公式化Alpha挖掘的效率与预测能力 reinforcement learning reward shaping
4 FAST: Similarity-based Knowledge Transfer for Efficient Policy Learning FAST:基于相似度的知识迁移,用于高效策略学习 policy learning
5 Spatial-Temporal Reinforcement Learning for Network Routing with Non-Markovian Traffic 提出空间-时间强化学习(STRL)框架,解决非马尔可夫网络流量下的路由问题。 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)

#题目一句话要点标签🔗
6 Benchmarking Large Language Models for Geolocating Colonial Virginia Land Grants 利用大语言模型对殖民时期弗吉尼亚土地授权进行地理定位 large language model chain-of-thought
7 MIPS: a Multimodal Infinite Polymer Sequence Pre-training Framework for Polymer Property Prediction 提出MIPS框架以解决聚合物性质预测问题 multimodal
8 Meta Fusion: A Unified Framework For Multimodality Fusion with Mutual Learning Meta Fusion:一种基于互学习的统一多模态融合框架 multimodal
9 Interpretable Anomaly-Based DDoS Detection in AI-RAN with XAI and LLMs 提出基于XAI和LLM的可解释AI-RAN异常DDoS检测系统 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页