cs.AI(2024-06-30)

📊 共 6 篇论文

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (4) 支柱二:RL算法与架构 (RL & Architecture) (2)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)

#题目一句话要点标签🔗
1 BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization 提出BAPO,通过基准锚定偏好优化解决LLM个性化中的知识遗忘问题 large language model
2 Actionable Cyber Threat Intelligence using Knowledge Graphs and Large Language Models 利用知识图谱和大型语言模型提取可执行的网络威胁情报 large language model
3 Large Language Models for Behavioral Economics: Internal Validity and Elicitation of Mental Models 利用大型语言模型提升行为经济学实验的内部效度 large language model
4 Evaluation of Bias Towards Medical Professionals in Large Language Models 大型语言模型在医学专业评估中存在对医疗人员的偏见 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
5 Multi-Agent Training for Pommerman: Curriculum Learning and Population-based Self-Play Approach 提出基于课程学习和群体自博弈的Pommerman多智能体训练方法 reinforcement learning curriculum learning
6 Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints 提出基于扩散模型的离线多智能体强化学习安全约束框架 reinforcement learning

⬅️ 返回 cs.AI 首页 · 🏠 返回主页