cs.AI(2026-02-11)
📊 共 18 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (11)
支柱二:RL算法与架构 (RL & Architecture) (5 🔗1)
支柱一:机器人控制 (Robot Control) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | Neuro-symbolic Action Masking for Deep Reinforcement Learning | 提出神经符号动作掩码(NSAM),提升DRL样本效率并减少约束违背。 | reinforcement learning deep reinforcement learning DRL | ||
| 13 | Found-RL: foundation model-enhanced reinforcement learning for autonomous driving | Found-RL:利用基础模型增强自动驾驶强化学习,解决样本效率和可解释性问题 | reinforcement learning reward shaping foundation model | ✅ | |
| 14 | Interactive LLM-assisted Curriculum Learning for Multi-Task Evolutionary Policy Search | 提出交互式LLM辅助的课程学习框架,用于多任务进化策略搜索。 | curriculum learning embodied AI multimodal | ||
| 15 | Cross-Sectional Asset Retrieval via Future-Aligned Soft Contrastive Learning | 提出FASCL框架,通过未来收益相关性进行横截面资产检索。 | representation learning contrastive learning | ||
| 16 | Fine-Tuning GPT-5 for GPU Kernel Generation | 通过强化学习微调GPT-5,显著提升GPU Kernel代码生成质量与效率 | reinforcement learning large language model |
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | See, Plan, Snap: Evaluating Multimodal GUI Agents in Scratch | 提出 ScratchWorld 基准测试,评估多模态 GUI 智能体在 Scratch 中的编程能力 | manipulation multimodal | ||
| 18 | Protecting Context and Prompts: Deterministic Security for Non-Deterministic AI | 提出认证提示与认证上下文,实现大语言模型应用中确定性的安全防护。 | manipulation large language model |