cs.AI（2025-12-23）

📊 共 29 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (18 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (9 🔗1) 支柱一：机器人控制 (Robot Control) (1) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (18 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography	Odysseus：利用双重隐写术破解商业多模态LLM集成系统	large language model multimodal
2	Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent	SAGE：基于人机协同推理的大语言模型自动立体定向放射外科计划系统	large language model chain-of-thought
3	Dual-Encoder Transformer-Based Multimodal Learning for Ischemic Stroke Lesion Segmentation Using Diffusion MRI	提出基于双编码器Transformer的Ischemic Stroke病灶分割方法，提升DWI和ADC图像的分割精度。	multimodal
4	Toward Explaining Large Language Models in Software Engineering Tasks	提出FeatureSHAP，用于解释软件工程任务中的大型语言模型	large language model	✅
5	Advancing Multimodal Teacher Sentiment Analysis:The Large-Scale T-MED Dataset & The Effective AAM-TSA Model	构建T-MED数据集与AAM-TSA模型以提升教师情感分析准确性	multimodal
6	SynCraft: Guiding Large Language Models to Predict Edit Sequences for Molecular Synthesizability Optimization	SynCraft：引导大语言模型预测编辑序列，优化分子合成可行性	large language model
7	TongSIM: A General Platform for Simulating Intelligent Machines	TongSIM：通用智能机器模拟平台，支持具身智能体训练与评估	embodied AI large language model multimodal
8	Concept Generalization in Humans and Large Language Models: Insights from the Number Game	通过数字游戏对比人类与大语言模型在概念泛化上的差异	large language model
9	A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice	DeepSeek赋能的AI系统Janus-Pro-CXR，用于临床胸部X光片自动判读	large language model multimodal
10	Reason2Decide: Rationale-Driven Multi-Task Learning	Reason2Decide：一种基于理由驱动的多任务学习框架，提升临床决策支持系统的预测精度和解释一致性。	large language model foundation model
11	Generative Digital Twins: Vision-Language Simulation Models for Executable Industrial Systems	提出视觉-语言模拟模型，从草图和文本生成可执行的工业系统数字孪生。	multimodal
12	Synthesizing Procedural Memory: Challenges and Architectures in Automated Workflow Generation	提出一种自动工作流生成方法，解决大型语言模型从工具使用者到工作流架构师的转变难题。	large language model
13	Memory as Resonance: A Biomimetic Architecture for Infinite Context Memory on Ergodic Phonetic Manifolds	提出基于遍历语音流形的共振记忆架构PTM，解决大语言模型无限上下文记忆问题。	large language model
14	MemR$^3$: Memory Retrieval via Reflective Reasoning for LLM Agents	MemR³：通过反思推理实现LLM Agent的记忆检索，提升问答质量。	large language model
15	AXIOM: Benchmarking LLM-as-a-Judge for Code via Rule-Based Perturbation and Multisource Quality Calibration	AXIOM：通过规则扰动和多源质量校准，基准测试LLM作为代码评估判官的能力	large language model
16	Enhancing Zero-Shot Time Series Forecasting in Off-the-Shelf LLMs via Noise Injection	通过噪声注入增强即用型LLM的零样本时间序列预测能力	large language model
17	On the Effectiveness of Instruction-Tuning Local LLMs for Identifying Software Vulnerabilities	指令调优本地LLM，有效识别软件漏洞类型，提升安全性和实用性。	large language model
18	S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test	提出S$^3$IT基准测试，用于评估具身智能体在复杂社交环境中的推理能力	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
19	Scaling Reinforcement Learning for Content Moderation with Large Language Models	利用强化学习和大型语言模型提升大规模内容审核的效率与准确性	reinforcement learning reward shaping large language model
20	Identifying Appropriately-Sized Services with Deep Reinforcement Learning	提出Rake，利用深度强化学习从实现工件中识别合适大小的服务。	reinforcement learning deep reinforcement learning
21	Adaptive Financial Sentiment Analysis for NIFTY 50 via Instruction-Tuned LLMs , RAG and Reinforcement Learning Approaches	提出基于指令调优LLM、RAG和强化学习的自适应金融情感分析框架，用于NIFTY 50指数预测。	reinforcement learning PPO large language model
22	Graph-Symbolic Policy Enforcement and Control (G-SPEC): A Neuro-Symbolic Framework for Safe Agentic AI in 5G Autonomous Networks	提出G-SPEC神经符号框架，保障5G自治网络中LLM代理的安全策略执行。	reinforcement learning deep reinforcement learning large language model
23	LongVideoAgent: Multi-Agent Reasoning with Long Videos	LongVideoAgent：提出一种基于多智能体推理的长视频问答框架，提升时序定位和细节捕捉能力。	reinforcement learning multimodal	✅
24	Leveraging High-Fidelity Digital Models and Reinforcement Learning for Mission Engineering: A Case Study of Aerial Firefighting Under Perfect Information	利用高保真数字模型与强化学习进行任务工程：以完美信息下的空中消防为例	reinforcement learning
25	Offline Safe Policy Optimization From Heterogeneous Feedback	提出PreSa框架，通过异构反馈直接优化安全策略，解决离线安全策略优化问题	reinforcement learning preference learning RLHF
26	Evolutionary Neural Architecture Search with Dual Contrastive Learning	提出DCL-ENAS，利用双重对比学习提升进化神经架构搜索的效率和精度。	contrastive learning
27	Discovering Lie Groups with Flow Matching	提出流匹配方法以发现李群的对称性	flow matching

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
28	ActionFlow: A Pipelined Action Acceleration for Vision Language Models on Edge	ActionFlow：边缘设备上视觉语言模型流水线式动作加速框架	manipulation vision-language-action VLA

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
29	Learning Skills from Action-Free Videos	提出基于光流的技能抽象框架SOF，从无动作视频中学习机器人技能	optical flow

⬅️ 返回 cs.AI 首页 · 🏠 返回主页