cs.AI(2025-01-12)
📊 共 12 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (8)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱四:生成式动作 (Generative Motion) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | Risk-Averse Finetuning of Large Language Models | 提出基于条件风险价值的风险规避微调方法,降低大语言模型生成有害内容风险 | reinforcement learning RLHF large language model | ||
| 10 | An Empirical Study of Deep Reinforcement Learning in Continuing Tasks | 针对持续性任务,本文深入研究了深度强化学习算法的性能,并验证了奖励中心化方法的有效性。 | reinforcement learning deep reinforcement learning | ||
| 11 | DVM: Towards Controllable LLM Agents in Social Deduction Games | DVM:面向社交推理游戏的可控LLM智能体框架 | reinforcement learning large language model |
🔬 支柱四:生成式动作 (Generative Motion) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | Generative Artificial Intelligence-Supported Pentesting: A Comparison between Claude Opus, GPT-4, and Copilot | 评估通用GenAI在渗透测试中的应用:Claude Opus、GPT-4与Copilot对比 | penetration |