cs.AI(2024-10-06)

📊 共 10 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (8 🔗1) 支柱二:RL算法与架构 (RL & Architecture) (2)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)

#题目一句话要点标签🔗
1 Multimodal 3D Fusion and In-Situ Learning for Spatially Aware AI 提出多模态3D融合与原位学习框架,实现空间感知AI在AR中的应用 multimodal
2 Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion TaylorMLP:通过泰勒展开实现大语言模型权重安全发布与滥用防御 large language model
3 GenSim: A General Social Simulation Platform with Large Language Model based Agents GenSim:基于LLM Agent的大规模通用社会模拟平台,具备纠错机制 large language model
4 Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark PolyMATH:一个挑战多模态大语言模型数学推理能力的综合基准测试 large language model chain-of-thought
5 Evaluating the Correctness of Inference Patterns Used by LLMs for Judgment 提出一种评估LLM推理模式正确性的方法,用于法律领域判决分析。 large language model
6 OD-Stega: LLM-Based Near-Imperceptible Steganography via Optimized Distributions 提出OD-Stega:一种基于LLM和优化分布的近乎不可察觉的隐写术 large language model
7 Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement 提出Gödel Agent,一种自指代理框架,用于递归式自我改进。 large language model
8 An evaluation of LLM code generation capabilities through graded exercises 通过分级练习评估LLM代码生成能力,揭示评估方法可能高估模型实际技能 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
9 Toward Debugging Deep Reinforcement Learning Programs with RLExplorer RLExplorer:用于调试深度强化学习程序的首个故障诊断方法 reinforcement learning deep reinforcement learning DRL
10 Ranking Policy Learning via Marketplace Expected Value Estimation From Observational Data 提出基于观测数据的市场预期价值估计的排序策略学习框架 policy learning

⬅️ 返回 cs.AI 首页 · 🏠 返回主页