cs.AI(2025-06-28)
📊 共 14 篇论文
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Offline Reinforcement Learning for Mobility Robustness Optimization | 利用离线强化学习优化移动鲁棒性,提升蜂窝网络性能 | reinforcement learning offline RL offline reinforcement learning | ||
| 12 | ReasonBridge: Efficient Reasoning Transfer from Closed to Open-Source Language Models | ReasonBridge:通过高效推理迁移,提升开源语言模型的推理能力 | distillation large language model instruction following | ||
| 13 | SPEAR: Structured Pruning for Spiking Neural Networks via Synaptic Operation Estimation and Reinforcement Learning | 提出SPEAR框架,通过强化学习和突触操作估计实现脉冲神经网络的结构化剪枝。 | reinforcement learning | ||
| 14 | WavShape: Information-Theoretic Speech Representation Learning for Fair and Privacy-Aware Audio Processing | WavShape:面向公平与隐私保护的语音表征信息论学习框架 | representation learning |