cs.AI(2024-11-20)
📊 共 21 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (15)
支柱二:RL算法与架构 (RL & Architecture) (5)
支柱四:生成式动作 (Generative Motion) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (15 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 16 | DrugGen: Advancing Drug Discovery with Large Language Models and Reinforcement Learning Feedback | DrugGen:利用大语言模型和强化学习反馈加速药物发现 | reinforcement learning large language model | ||
| 17 | Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning | 提出可解释LLM驱动的多维蒸馏框架,提升电商搜索相关性学习效果 | distillation large language model chain-of-thought | ||
| 18 | DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs | DSTC:仅用自生成测试与代码进行直接偏好学习,提升代码大模型性能 | preference learning DPO direct preference optimization | ||
| 19 | BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games | BALROG:用于评估Agentic LLM/VLM在游戏环境中推理能力的新基准 | reinforcement learning large language model | ||
| 20 | NumCoKE: Ordinal-Aware Numerical Reasoning over Knowledge Graphs with Mixture-of-Experts and Contrastive Learning | 提出NumCoKE框架,通过混合专家模型和对比学习增强知识图谱数值推理能力。 | contrastive learning |
🔬 支柱四:生成式动作 (Generative Motion) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 21 | Heuristically Adaptive Diffusion-Model Evolutionary Strategy | 提出启发式自适应扩散模型进化策略,提升进化算法的探索能力和收敛效率。 | classifier-free guidance |