cs.AI(2024-06-30)
📊 共 6 篇论文
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization | 提出BAPO,通过基准锚定偏好优化解决LLM个性化中的知识遗忘问题 | large language model | ||
| 2 | Actionable Cyber Threat Intelligence using Knowledge Graphs and Large Language Models | 利用知识图谱和大型语言模型提取可执行的网络威胁情报 | large language model | ||
| 3 | Large Language Models for Behavioral Economics: Internal Validity and Elicitation of Mental Models | 利用大型语言模型提升行为经济学实验的内部效度 | large language model | ||
| 4 | Evaluation of Bias Towards Medical Professionals in Large Language Models | 大型语言模型在医学专业评估中存在对医疗人员的偏见 | large language model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 5 | Multi-Agent Training for Pommerman: Curriculum Learning and Population-based Self-Play Approach | 提出基于课程学习和群体自博弈的Pommerman多智能体训练方法 | reinforcement learning curriculum learning | ||
| 6 | Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints | 提出基于扩散模型的离线多智能体强化学习安全约束框架 | reinforcement learning |