cs.AI（2025-02-20）

📊 共 20 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (12 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (6) 支柱一：机器人控制 (Robot Control) (2)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (12 篇)

#	题目	一句话要点	标签	🔗	⭐
1	FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis	FetalCLIP：用于胎儿超声图像分析的视觉-语言基础模型	foundation model multimodal
2	Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework	提出CHARGE框架与Chart-MRAG Bench，用于评估图表场景下的多模态RAG	multimodal	✅
3	EAGER-LLM: Enhancing Large Language Models as Recommenders through Exogenous Behavior-Semantic Integration	EAGER-LLM：通过外生行为-语义集成增强LLM作为推荐器的能力	large language model
4	Multimodal Quantitative Language for Generative Recommendation	提出MQL4GRec，通过多模态量化语言实现生成式推荐的知识迁移。	multimodal
5	WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models	WavRAG：面向语音对话模型的音频集成检索增强生成框架	large language model chain-of-thought
6	EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations	EquivaMap：利用LLM自动进行优化公式的等价性检查	large language model
7	An LLM-Based Approach for Insight Generation in Data Analysis	提出基于LLM的洞察生成方法，用于从多表数据库中自动提取文本洞察	large language model
8	Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents	Vending-Bench：用于评估自主Agent长期连贯性的自动售货机基准测试	large language model
9	Plan-over-Graph: Towards Parallelable LLM Agent Schedule	提出Plan-over-Graph方法，实现LLM Agent任务规划的并行化调度	large language model	✅
10	Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning	提出RetrievalPRM，解决数学推理中过程奖励模型泛化性不足问题	large language model
11	FlowAgent: Achieving Compliance and Flexibility for Workflow Agents	FlowAgent：兼顾工作流代理的合规性与灵活性	large language model	✅
12	Investigating the Impact of LLM Personality on Cognitive Bias Manifestation in Automated Decision-Making Tasks	研究LLM人格特质对自动化决策中认知偏差的影响，并探索缓解策略。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
13	External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation	提出ExFM框架，高效服务在线广告推荐中参数规模达万亿级别的外部大型基础模型。	distillation foundation model
14	SPRIG: Stackelberg Perception-Reinforcement Learning with Internal Game Dynamics	SPRIG：基于内部博弈动态的Stackelberg感知-强化学习框架	reinforcement learning deep reinforcement learning PPO
15	Making Universal Policies Universal	提出跨智能体通用策略学习方法，解决异构动作空间下的通用决策问题	policy learning imitation learning generalist agent
16	Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning	提出基于强化学习的量子纠错码优化方法，显著降低物理量子比特开销。	reinforcement learning
17	HPS: Hard Preference Sampling for Human Preference Alignment	提出Hard Preference Sampling (HPS)框架，用于提升LLM人类偏好对齐的鲁棒性和效率。	RLHF large language model
18	Causal Mean Field Multi-Agent Reinforcement Learning	提出因果平均场Q学习（CMFQ）算法，提升多智能体强化学习在非平稳环境下的可扩展性。	reinforcement learning

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
19	Multi-Agent Coordination across Diverse Applications: A Survey	多智能体协同综述：跨领域应用中的协同机制与未来方向	humanoid large language model
20	Towards Secure Program Partitioning for Smart Contracts with LLM's In-Context Learning	PartitionGPT：利用LLM上下文学习实现智能合约安全程序划分	manipulation large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页