cs.LG(2025-08-07)
📊 共 5 篇论文
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | SPaRFT: Self-Paced Reinforcement Fine-Tuning for Large Language Models | 提出SPaRFT以解决大语言模型训练效率低下问题 | reinforcement learning curriculum learning large language model | ||
| 2 | R-Zero: Self-Evolving Reasoning LLM from Zero Data | 提出R-Zero以解决自我进化推理模型的数据依赖问题 | reinforcement learning large language model | ||
| 3 | Anti-Jamming Sensing with Distributed Reconfigurable Intelligent Metasurface Antennas | 提出分布式可重构智能超表面天线以解决抗干扰感知问题 | reinforcement learning deep reinforcement learning DRL |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Disentangling Bias by Modeling Intra- and Inter-modal Causal Attention for Multimodal Sentiment Analysis | 提出MMCI模型以解决多模态情感分析中的偏差问题 | multimodal | ||
| 5 | A Metric for MLLM Alignment in Large-scale Recommendation | 提出泄漏影响评分以解决多模态推荐系统对齐问题 | large language model multimodal |