cs.AI（2026-02-06）

📊 共 44 篇论文 | 🔗 6 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (27 🔗4) 支柱二：RL算法与架构 (RL & Architecture) (11) 支柱一：机器人控制 (Robot Control) (3) 支柱八：物理动画 (Physics-based Animation) (2 🔗1) 支柱六：视频提取与匹配 (Video Extraction) (1 🔗1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (27 篇)

#	题目	一句话要点	标签	🔗
1	Trifuse: Enhancing Attention-Based GUI Grounding via Multimodal Fusion	Trifuse：通过多模态融合增强基于注意力的GUI元素定位	large language model multimodal
2	POP: Online Structural Pruning Enables Efficient Inference of Large Foundation Models	POP：在线结构剪枝实现大模型高效推理，兼顾精度与速度	large language model foundation model
3	Federated Prompt-Tuning with Heterogeneous and Incomplete Multimodal Client Data	提出异构不完全多模态联邦Prompt Tuning框架，解决跨客户端数据缺失和语义对齐问题。	multimodal
4	Is there "Secret Sauce'' in Large Language Model Development?	大规模语言模型性能主要由算力驱动，但开发者效率差异显著影响非前沿模型	large language model
5	Sequences as Nodes for Contrastive Multimodal Graph Recommendation	提出MuSICRec，通过多模态对比图推荐缓解冷启动和数据稀疏问题。	multimodal
6	Multimodal Enhancement of Sequential Recommendation	提出MuSTRec，融合多模态信息与序列推荐，提升推荐性能。	multimodal
7	PreFlect: From Retrospective to Prospective Reflection in Large Language Model Agents	PreFlect：大型语言模型Agent中从回顾性反思到前瞻性反思的转变	large language model	✅
8	ShallowJail: Steering Jailbreaks against Large Language Models	提出ShallowJail攻击，利用浅层对齐破解大语言模型的安全防护	large language model	✅
9	The Quantum Sieve Tracer: A Hybrid Framework for Layer-Wise Activation Tracing in Large Language Models	提出量子筛追踪器，用于分析大语言模型中的逐层激活追踪，揭示模型架构差异。	large language model
10	GhostCite: A Large-Scale Analysis of Citation Validity in the Age of Large Language Models	GhostCite：大规模分析大语言模型时代下引文有效性问题	large language model
11	Multimodal Generative Retrieval Model with Staged Pretraining for Food Delivery on Meituan	针对美团外卖场景，提出基于分阶段预训练的多模态生成式检索模型	multimodal
12	LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models	LogicSkills：一个用于评估大语言模型形式推理能力的结构化基准	large language model
13	Same Answer, Different Representations: Hidden instability in VLMs	揭示视觉语言模型内部表征不稳定性：相同答案，不同表征	multimodal
14	How Well Can LLM Agents Simulate End-User Security and Privacy Attitudes and Behaviors?	SP-ABCBench评估LLM智能体模拟用户安全隐私态度的能力，发现仍有提升空间	large language model
15	TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering	TamperBench：系统性压力测试LLM在微调和篡改下的安全性	large language model	✅
16	TraceCoder: A Trace-Driven Multi-Agent Framework for Automated Debugging of LLM-Generated Code	TraceCoder：基于运行时追踪的多智能体框架，用于自动调试LLM生成的代码	large language model
17	ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training	ScaleEnv：从零扩展环境合成，用于通用交互式工具使用Agent训练	generalist agent
18	Bridging 6G IoT and AI: LLM-Based Efficient Approach for Physical Layer's Optimization Tasks	提出基于LLM的PE-RTFV框架，用于6G IoT物理层优化	large language model
19	Wild Guesses and Mild Guesses in Active Concept Learning	研究主动概念学习中查询策略对神经符号贝叶斯学习器的影响，揭示了确认偏差的潜在合理性。	large language model
20	Evidence for Daily and Weekly Periodic Variability in GPT-4o Performance	揭示GPT-4o性能的每日和每周周期性波动，挑战时间不变性假设	large language model
21	AgentStepper: Interactive Debugging of Software Development Agents	AgentStepper：用于软件开发Agent交互式调试的工具	large language model
22	Lemon Agent Technical Report	Lemon Agent：基于AgentCortex框架的多智能体协同系统，提升复杂任务处理效率。	multimodal
23	HyPER: Bridging Exploration and Exploitation for Scalable LLM Reasoning with Hypothesis Path Expansion and Reduction	HyPER：通过假设路径扩展与缩减，桥接探索与利用，实现可扩展的LLM推理	chain-of-thought
24	Evaluating Retrieval-Augmented Generation Variants for Natural Language-Based SQL and API Call Generation	评估检索增强生成变体在自然语言到SQL和API调用生成中的应用	large language model
25	BEAGLE: Behavior-Enforced Agent for Grounded Learner Emulation	BEAGLE：行为增强的智能体，用于模拟扎根学习者的学习过程	large language model
26	Rethinking Scientific Modeling: Toward Physically Consistent and Simulation-Executable Programmatic Generation	提出物理一致的程序化生成框架，用于自动创建可执行的结构建模代码。	large language model	✅
27	Intrinsic Stability Limits of Autoregressive Reasoning: Structural Consequences for Long-Horizon Execution	揭示自回归推理的内在稳定性极限，提出长程执行的结构性治理方案	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (11 篇)

#	题目	一句话要点	标签
28	Next-generation cyberattack detection with large language models: anomaly analysis across heterogeneous logs	利用大语言模型进行下一代网络攻击检测，实现跨异构日志的异常分析	distillation large language model
29	Unlocking Noisy Real-World Corpora for Foundation Model Pre-Training via Quality-Aware Tokenization	提出QA-Token，通过质量感知分词提升噪声数据上预训练模型效果	reinforcement learning foundation model
30	LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning	提出LatentChem，通过隐空间推理提升化学大模型效率与性能。	latent dynamics large language model chain-of-thought
31	Towards Understanding What State Space Models Learn About Code	首个SSM代码理解分析：揭示其在代码建模中的优势与局限性	SSM state space model
32	SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees	提出SeeUPO，一种具备收敛保证的序列级Agentic-RL算法，解决多轮交互场景下的训练不稳定性问题。	reinforcement learning PPO large language model
33	Semantically Labelled Automata for Multi-Task Reinforcement Learning with LTL Instructions	提出基于语义标记自动机的多任务强化学习方法，解决LTL指令下的泛化问题。	reinforcement learning
34	Prism: Spectral Parameter Sharing for Multi-Agent Reinforcement Learning	Prism：基于谱参数共享的多智能体强化学习框架，提升资源效率	reinforcement learning
35	Progress Constraints for Reinforcement Learning in Behavior Trees	提出基于进度约束的强化学习行为树方法，提升任务性能与样本效率。	reinforcement learning
36	AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents	AgentCPM-Explore：面向边缘规模Agent的长程深度探索	reinforcement learning large language model
37	AbFlow : End-to-end Paratope-Centric Antibody Design by Interaction Enhanced Flow Matching	AbFlow：通过交互增强的Flow Matching实现以互补位为中心的端到端抗体设计	flow matching
38	Sample-Efficient Policy Space Response Oracles with Joint Experience Best Response	提出Joint Experience Best Response，提升PSRO在多智能体强化学习中的样本效率。	reinforcement learning offline RL

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

#	题目	一句话要点	标签
39	Empirical Analysis of Adversarial Robustness and Explainability Drift in Cybersecurity Classifiers	针对网络安全分类器，研究对抗鲁棒性与可解释性漂移问题	manipulation
40	Incentive-Aware AI Safety via Strategic Resource Allocation: A Stackelberg Security Games Perspective	提出基于Stackelberg安全博弈的激励感知AI安全框架，解决AI系统开发与部署中的对抗性风险。	manipulation
41	Malicious Agent Skills in the Wild: A Large-Scale Security Empirical Study	构建恶意Agent技能数据集，揭示LLM Agent生态系统中的安全漏洞	manipulation

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
42	SAS-Net: Scene-Appearance Separation Network for Robust Spatiotemporal Registration in Bidirectional Photoacoustic Microscopy	提出SAS-Net，解决双向光声显微镜中时空配准的域偏移和几何失真问题。	spatiotemporal	✅
43	GraFSTNet: Graph-based Frequency SpatioTemporal Network for Cellular Traffic Prediction	GraFSTNet：基于图的频率时空网络用于蜂窝流量预测	spatiotemporal

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
44	Reasoning-Augmented Representations for Multimodal Retrieval	提出推理增强表示框架，提升通用多模态检索中隐式推理能力	feature matching multimodal	✅

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2026-02-06）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (27 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (11 篇)

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理