cs.AI（2026-02-24）

📊 共 27 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (18 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (6) 支柱一：机器人控制 (Robot Control) (3)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (18 篇)

#	题目	一句话要点	标签	🔗	⭐
1	NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning	提出NoRD以解决数据收集与推理标注成本高的问题	vision-language-action VLA
2	Predicting Sentence Acceptability Judgments in Multimodal Contexts	研究视觉上下文对人类和LLM句子可接受性判断的影响	large language model multimodal
3	Physics-based phenomenological characterization of cross-modal bias in multimodal models	提出基于物理的表征方法，分析多模态大语言模型中的跨模态偏差问题。	large language model multimodal
4	Multimodal MRI Report Findings Supervised Brain Lesion Segmentation with Substructures	提出MS-RSuper方法，利用多模态MRI报告监督脑部病灶及其子结构的分割。	multimodal
5	E-MMKGR: A Unified Multimodal Knowledge Graph Framework for E-commerce Applications	提出E-MMKGR：一个用于电商应用的统一多模态知识图谱框架	multimodal
6	Qwen-BIM: developing large language model for BIM-based design with domain-specific benchmark and dataset	Qwen-BIM：构建领域特定大语言模型，用于BIM设计，并提出相应基准和数据集。	large language model
7	Modality-Guided Mixture of Graph Experts with Entropy-Triggered Routing for Multimodal Recommendation	提出MAGNET模型，通过模态引导的图专家混合网络和熵触发路由，提升多模态推荐效果。	multimodal
8	Counterfactual Simulation Training for Chain-of-Thought Faithfulness	提出反事实模拟训练(CST)以提升思维链(CoT)推理的可靠性。	chain-of-thought	✅
9	PromptCD: Test-Time Behavior Enhancement via Polarity-Prompt Contrastive Decoding	PromptCD：极性提示对比解码，提升LLM/VLM测试时行为可控性	large language model visual grounding
10	A Benchmark for Deep Information Synthesis	提出DEEPSYNTH基准，评估LLM智能体在复杂信息合成与推理任务中的能力。	large language model
11	SparkMe: Adaptive Semi-Structured Interviewing for Qualitative Insight Discovery	SparkMe：自适应半结构化访谈，利用多智能体LLM进行定性洞察发现	large language model	✅
12	"Are You Sure?": An Empirical Study of Human Perception Vulnerability in LLM-Driven Agentic Systems	首个大规模人类实验揭示LLM驱动Agent系统中Agent介导欺骗的脆弱性	large language model
13	LogicGraph : Benchmarking Multi-Path Logical Reasoning via Neuro-Symbolic Generation and Verification	LogicGraph：提出神经符号生成与验证框架，用于评估多路径逻辑推理能力。	large language model	✅
14	Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence	提出AgentOS框架，将LLM重定义为推理内核，提升系统级智能	large language model
15	HELP: HyperNode Expansion and Logical Path-Guided Evidence Localization for Accurate and Efficient GraphRAG	提出HELP框架，通过超节点扩展和逻辑路径引导，提升GraphRAG的准确性和效率	large language model
16	AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs	AdapTools：针对Agentic LLM的自适应工具型间接提示注入攻击	large language model
17	Grounding LLMs in Scientific Discovery via Embodied Actions	EmbodiedAct：通过具身动作将LLM应用于科学发现，解决长时程模拟中的可靠性和稳定性问题。	large language model
18	Hybrid LLM-Embedded Dialogue Agents for Learner Reflection: Designing Responsive and Theory-Driven Interactions	提出混合LLM嵌入式对话Agent，用于支持学习者反思，设计响应式和理论驱动的交互	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
19	CG-DMER: Hybrid Contrastive-Generative Framework for Disentangled Multimodal ECG Representation Learning	提出CG-DMER框架，用于解耦多模态心电图表征学习，提升心血管疾病诊断准确性。	representation learning multimodal
20	Buffer Matters: Unleashing the Power of Off-Policy Reinforcement Learning in Large Language Model Reasoning	提出BAPO：一种基于Off-Policy RLVR的大语言模型推理能力提升方法	reinforcement learning large language model
21	Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback	提出多目标强化学习框架以优化城市交通控制	reinforcement learning policy learning reward design
22	OptiLeak: Efficient Prompt Reconstruction via Reinforcement Learning in Multi-tenant LLM Services	OptiLeak：利用强化学习高效重建多租户LLM服务中的泄露提示	reinforcement learning direct preference optimization
23	PyVision-RL: Forging Open Agentic Vision Models via RL	PyVision-RL：通过强化学习赋能开放Agentic视觉模型	reinforcement learning multimodal
24	PRECTR-V2:Unified Relevance-CTR Framework with Cross-User Preference Mining, Exposure Bias Correction, and LLM-Distilled Encoder Optimization	PRECTR-V2：融合用户偏好挖掘、偏差校正和LLM蒸馏的统一相关性-CTR框架	representation learning distillation

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
25	Recursive Belief Vision Language Model	提出RB-VLA，通过置信度建模解决VLA模型在长时程操作中的部分可观测性问题	manipulation diffusion policy world model
26	Inner Speech as Behavior Guides: Steerable Imitation of Diverse Behaviors for Human-AI coordination	MIMIC：利用内部语言指导，实现人机协作中行为的多样性模仿与可控引导	manipulation imitation learning behavior cloning
27	Pressure Reveals Character: Behavioural Alignment Evaluation at Depth	提出压力测试基准，揭示语言模型在复杂情境下的对齐问题	manipulation

⬅️ 返回 cs.AI 首页 · 🏠 返回主页