cs.AI（2025-10-07）

📊 共 35 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (26 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (6) 支柱一：机器人控制 (Robot Control) (1 🔗1) 支柱三：空间感知与语义 (Perception & Semantics) (1) 支柱四：生成式动作 (Generative Motion) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (26 篇)

#	题目	一句话要点	标签	🔗
1	MetaVLA: Unified Meta Co-training For Efficient Embodied Adaption	MetaVLA：用于高效具身适应的统一元协同训练框架	vision-language-action VLA OpenVLA
2	ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems	提出ARM：发现通用多智能体系统的Agentic推理模块	large language model foundation model chain-of-thought
3	BuilderBench -- A benchmark for generalist agents	BuilderBench：面向通用智能体，用于开放式探索的基准测试平台	generalist agent
4	Domain-Shift-Aware Conformal Prediction for Large Language Models	提出领域偏移感知共形预测(DS-CP)，提升大语言模型在领域偏移下的不确定性量化	large language model
5	PuzzlePlex: Benchmarking Foundation Models on Reasoning and Planning with Puzzles	PuzzlePlex：用于评估具身智能体推理与规划能力的多样化谜题基准	foundation model
6	Leveraging Large Language Models for Cybersecurity Risk Assessment -- A Case from Forestry Cyber-Physical Systems	利用本地化大语言模型辅助林业网络物理系统网络安全风险评估	large language model
7	StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Stars	StarEmbed：天文学变星观测时间序列基础模型基准测试	foundation model
8	Digital Transformation Chatbot (DTchatbot): Integrating Large Language Model-based Chatbot in Acquiring Digital Transformation Needs	提出基于大语言模型的数字化转型需求获取聊天机器人DTchatbot	large language model
9	Early Multimodal Prediction of Cross-Lingual Meme Virality on Reddit: A Time-Window Analysis	提出一种基于时间窗口分析的跨语言Meme早期流行度多模态预测方法	multimodal
10	Uncovering Representation Bias for Investment Decisions in Open-Source Large Language Models	揭示开源大语言模型在投资决策中的表征偏差，关注Qwen模型	large language model
11	Membership Inference Attacks on Tokenizers of Large Language Models	提出基于Tokenizer的成员推断攻击，揭示大语言模型隐私风险	large language model
12	Data Provenance Auditing of Fine-Tuned Large Language Models with a Text-Preserving Technique	提出一种文本保持的水印框架，用于审计微调大语言模型的数据来源	large language model
13	Large Language Model-Based Uncertainty-Adjusted Label Extraction for Artificial Intelligence Model Development in Upper Extremity Radiography	GPT-4o提取放射报告标签，用于上肢X光片多标签图像分类模型训练	large language model
14	Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences	揭示LLM竞争中涌现的“莫洛克交易”：追求成功导致AI对齐性下降	large language model
15	Domain-Grounded Evaluation of LLMs in International Student Knowledge	针对留学知识领域，提出领域相关的LLM评估方法，解决幻觉问题。	large language model
16	Relative Positioning Based Code Chunking Method For Rich Context Retrieval In Repository Level Code Completion Task With Code Language Model	提出基于相对位置的代码块划分方法，提升代码语言模型在仓库级代码补全任务中的性能	large language model
17	Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting	通过数据移动预测增强大规模MoE LLM Serving性能	large language model	✅
18	Impact of LLMs on Team Collaboration in Software Development	研究LLM对软件开发团队协作的影响，提升效率与沟通，应对挑战与安全问题。	large language model
19	Automated Program Repair of Uncompilable Student Code	利用大型语言模型自动修复学生未编译代码，提升学生建模效果	large language model
20	MixReasoning: Switching Modes to Think	MixReasoning：提出一种自适应调整推理深度的混合推理框架	chain-of-thought
21	Training-Free Time Series Classification via In-Context Reasoning with LLM Agents	提出FETA：基于LLM Agent上下文推理的免训练时间序列分类框架	large language model	✅
22	Optimizing for Persuasion Improves LLM Generalization: Evidence from Quality-Diversity Evolution of Debate Strategies	DebateQD：基于说服力优化的LLM提升泛化能力，解决过拟合问题	large language model
23	VeriEquivBench: An Equivalence Score for Ground-Truth-Free Evaluation of Formally Verifiable Code	提出VeriEquivBench基准，用于无ground-truth评估形式化可验证代码的等价性。	large language model
24	ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming	提出ConstraintLLM以解决工业级约束编程问题	large language model	✅
25	Artificially intelligent agents in the social and behavioral sciences: A history and outlook	回顾社会与行为科学中智能代理的发展历程与未来展望	large language model
26	From Agentification to Self-Evolving Agentic AI for Wireless Networks: Concepts, Approaches, and Future Research Directions	提出自进化Agentic AI框架，解决无线网络中人工干预的优化难题	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签
27	The Safety Challenge of World Models for Embodied AI Agents: A Review	综述世界模型在具身智能安全挑战，分析自动驾驶与机器人场景下的模型缺陷	world model embodied AI
28	Joint Communication Scheduling and Velocity Control for Multi-UAV-Assisted Post-Disaster Monitoring: An Attention-Based In-Context Learning Approach	提出基于注意力机制的上下文学习方法AIC-VDS，用于多无人机辅助的灾后监测通信调度与速度控制联合优化。	reinforcement learning deep reinforcement learning DRL
29	Vul-R2: A Reasoning LLM for Automated Vulnerability Repair	Vul-R2：一种用于自动漏洞修复的推理LLM	reinforcement learning large language model foundation model
30	In-the-Flow Agentic System Optimization for Effective Planning and Tool Use	提出AgentFlow，通过在流程中优化Agent系统，有效提升规划能力和工具使用效果	reinforcement learning large language model
31	Towards Reliable and Practical LLM Security Evaluations via Bayesian Modelling	提出基于贝叶斯建模的LLM安全评估框架，提升prompt注入攻击漏洞评估的可靠性与实用性	Mamba large language model
32	TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning	提出TaTToo，一种工具驱动的表格推理PRM，提升测试时表格推理能力。	reinforcement learning reward shaping

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
33	D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI	D2E框架：利用桌面数据预训练提升具身智能机器人性能	manipulation embodied AI large language model	✅

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
34	Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model	提出SurgiFlowVid，利用双预测视频扩散模型缓解手术视频数据不平衡问题	scene understanding optical flow motion prediction

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
35	AutoPentester: An LLM Agent-based Framework for Automated Pentesting	AutoPentester：基于LLM Agent的自动化渗透测试框架	penetration large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2025-10-07）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (26 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理