cs.AI（2026-02-26）

📊 共 34 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (24 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (9 🔗1) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (24 篇)

#	题目	一句话要点	标签	🔗
1	SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy	SPM-Bench：针对扫描探针显微镜的大语言模型权威自动评测基准	large language model multimodal
2	Multi-Agent Large Language Model Based Emotional Detoxification Through Personalized Intensity Control for Consumer Protection	提出基于多Agent LLM的情绪解毒系统MALLET，以个性化强度控制保护消费者免受过度情绪刺激。	large language model
3	SC-Arena: A Natural Language Benchmark for Single-Cell Reasoning with Knowledge-Augmented Evaluation	SC-Arena：面向单细胞推理的自然语言基准，采用知识增强评估	large language model foundation model
4	Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction	评估小语言模型在领导者-跟随者交互中的零样本和单样本自适应能力	large language model
5	AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications	提出AMA-Bench评估Agent在长时程记忆应用中的性能，并提出AMA-Agent提升效果。	large language model
6	CourtGuard: A Model-Agnostic Framework for Zero-Shot Policy Adaptation in LLM Safety	CourtGuard：一种零样本策略适应的LLM安全模型无关框架	large language model
7	Utilizing LLMs for Industrial Process Automation	利用大型语言模型加速工业过程自动化软件开发	large language model
8	Mitigating Legibility Tax with Decoupled Prover-Verifier Games	提出解耦的证明者-验证者博弈，缓解大模型输出可验证性与准确性之间的权衡问题。	large language model
9	A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring	提出决策理论视角的隐写术以解决LLM监控问题	large language model
10	Enhancing CVRP Solver through LLM-driven Automatic Heuristic Design	提出基于LLM的自动启发式算法设计AILS-AHD，提升CVRP求解器性能	large language model
11	LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure	LLMServingSim 2.0：异构和解耦LLM Serving基础设施的统一模拟器	large language model
12	Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search	提出CC-BOS框架，利用文言文和果蝇优化算法实现大语言模型的黑盒越狱攻击。	large language model
13	Discovery of Interpretable Physical Laws in Materials via Language-Model-Guided Symbolic Regression	提出语言模型引导的符号回归，用于发现材料科学中可解释的物理定律	large language model
14	MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks	MiroFlow：面向通用深度研究任务的高性能鲁棒开源Agent框架	large language model
15	Distributed LLM Pretraining During Renewable Curtailment Windows: A Feasibility Study	提出基于可再生能源消纳窗口的分布式LLM预训练方法	large language model
16	AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification	AgentSentry：通过时序因果诊断和上下文净化缓解LLM Agent中的间接提示注入攻击	large language model
17	IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation	IMMACULATE：一种通过可验证计算实现LLM审计的实用框架	large language model	✅
18	Toward Personalized LLM-Powered Agents: Foundations, Evaluation, and Future Directions	综述：面向个性化LLM驱动Agent，探讨其基础、评估及未来方向	large language model
19	MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios	提出 MobilityBench，用于评估真实世界出行场景下的路线规划Agent	large language model	✅
20	Addressing Climate Action Misperceptions with Generative AI	利用生成式AI解决气候行动认知偏差，提升环保行为意愿	large language model
21	Requesting Expert Reasoning: Augmenting LLM Agents with Learned Collaborative Intervention	提出AHCE框架，通过学习协作干预增强LLM Agent在专业领域的推理能力	large language model
22	Generative Agents Navigating Digital Libraries	Agent4DL：利用生成式Agent模拟数字图书馆用户搜索行为	large language model
23	Cognitive Models and AI Algorithms Provide Templates for Designing Language Agents	利用认知模型和AI算法为语言智能体设计提供模板	large language model
24	Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search	提出CC-BOS框架，利用文言文和果蝇优化算法实现大语言模型的黑盒越狱攻击。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (9 篇)

#	题目	一句话要点	标签	🔗
25	FactGuard: Agentic Video Misinformation Detection via Reinforcement Learning	提出FactGuard以解决视频虚假信息检测中的推理不足问题	reinforcement learning large language model multimodal
26	RLHFless: Serverless Computing for Efficient RLHF	提出RLHFless，利用Serverless计算高效训练RLHF，提升资源利用率并降低成本。	reinforcement learning RLHF large language model
27	The Trinity of Consistency as a Defining Principle for General World Models	提出通用世界模型的“一致性三位一体”原则，并构建多帧推理与生成基准CoW-Bench。	world model multimodal
28	Agentic AI for Intent-driven Optimization in Cell-free O-RAN	提出Agentic AI框架，用于Cell-free O-RAN中意图驱动的优化，提升资源利用率。	reinforcement learning deep reinforcement learning DRL
29	Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive	揭示基于优化的AI系统在规范响应上的局限性，提出架构性约束条件。	reinforcement learning RLHF large language model
30	QSIM: Mitigating Overestimation in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning	QSIM：通过动作相似性加权Q学习缓解多智能体强化学习中的过度估计	reinforcement learning	✅
31	Automated Vulnerability Detection in Source Code Using Deep Representation Learning	提出基于卷积神经网络的源代码漏洞自动检测方法，提升C语言漏洞检测召回率。	representation learning
32	Towards LLM-Empowered Knowledge Tracing via LLM-Student Hierarchical Behavior Alignment in Hyperbolic Space	提出L-HAKT，利用LLM和双曲空间对齐学生行为，提升知识追踪效果	contrastive learning large language model
33	Learning to Generate Secure Code via Token-Level Rewards	提出Vul2Safe框架，通过token级奖励学习生成安全代码，解决安全数据稀缺和奖励信号粗糙问题。	reinforcement learning large language model

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
34	The AI Research Assistant: Promise, Peril, and a Proof of Concept	利用人机协作发现Hermite求积法则的新误差表示和界限	manipulation

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2026-02-26）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (24 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (9 篇)

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理