cs.AI（2026-02-23）

📊 共 17 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (10 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (6) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Hiding in Plain Text: Detecting Concealed Jailbreaks via Activation Disentanglement	提出ReDAct与FrameShield，通过激活解耦检测大语言模型中的隐蔽越狱攻击。	large language model
2	A Multimodal Framework for Aligning Human Linguistic Descriptions with Visual Perceptual Data	提出多模态框架，对齐人类语言描述与视觉感知数据，解决跨模态参照理解问题。	multimodal
3	DReX: An Explainable Deep Learning-based Multimodal Recommendation Framework	DReX：一种基于深度学习的可解释多模态推荐框架	multimodal
4	Can Large Language Models Replace Human Coders? Introducing ContentBench	ContentBench：评估低成本大语言模型在内容分析编码任务中的能力。	large language model
5	Classroom Final Exam: An Instructor-Tested Reasoning Benchmark	提出 Classroom Final Exam (CFE)，用于评估大语言模型在 STEM 领域的推理能力。	large language model multimodal	✅
6	CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching	CausalFlip：提出因果判断基准，评估LLM在语义匹配之外的因果推理能力	large language model chain-of-thought
7	AdaEvolve: Adaptive LLM Driven Zeroth-Order Optimization	AdaEvolve：自适应LLM驱动的零阶优化框架，提升进化搜索效率	large language model
8	The LLMbda Calculus: AI Agents, Conversations, and Information Flow	提出LLMbda演算以解决AI代理安全性问题	large language model
9	LLM-enabled Applications Require System-Level Threat Monitoring	LLM应用面临系统级威胁，提出安全监控方案以保障可靠运行	large language model
10	Red-Teaming Claude Opus and ChatGPT-based Security Advisors for Trusted Execution Environments	针对TEE安全，提出TEE-RedBench评估方法，用于评估LLM安全顾问的可靠性	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
11	IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking	IR$^3$：通过对比逆强化学习实现奖励篡改的可解释检测与缓解	reinforcement learning inverse reinforcement learning RLHF
12	Human-Guided Agentic AI for Multimodal Clinical Prediction: Lessons from the AgentDS Healthcare Benchmark	提出人机协作的Agentic AI，用于多模态临床预测，并在AgentDS医疗基准上验证有效性。	MAE multimodal
13	Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis	提出Tri-Subspace Disentanglement框架，解决多模态情感分析中模态间信息融合不充分的问题。	MAE multimodal
14	Ada-RS: Adaptive Rejection Sampling for Selective Thinking	Ada-RS：自适应拒绝采样提升工具型LLM选择性推理效率	DPO large language model chain-of-thought
15	Beyond Mimicry: Toward Lifelong Adaptability in Imitation Learning	面向终身适应性的模仿学习：超越单纯模仿，实现组合泛化	imitation learning
16	Meta-Learning and Meta-Reinforcement Learning - Tracing the Path towards DeepMind's Adaptive Agent	综述元学习与元强化学习，追溯DeepMind自适应Agent的技术演进路径	reinforcement learning

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
17	Agentic AI as a Cybersecurity Attack Surface: Threats, Exploits, and Defenses in Runtime Supply Chains	针对Agentic AI运行时供应链的攻击面研究：威胁、利用与防御	manipulation large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页