cs.CL（2025-12-25）

📊 共 14 篇论文

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (7) 支柱二：RL算法与架构 (RL & Architecture) (3) 支柱一：机器人控制 (Robot Control) (2) 支柱六：视频提取与匹配 (Video Extraction) (2)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (7 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Do Latent Tokens Think? A Causal and Adversarial Analysis of Chain-of-Continuous-Thought	揭示潜在令牌的伪推理本质：对Chain-of-Continuous-Thought进行因果和对抗分析	large language model chain-of-thought
2	Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech	提出基于思维图谱的对话行为推理框架，提升全双工语音交互系统的自然性。	multimodal chain-of-thought
3	Five Years of SciCap: What We Learned and Future Directions for Scientific Figure Captioning	SciCap项目五年回顾：总结科学图像描述经验，展望未来研究方向	large language model
4	Context Discipline and Performance Correlation: Analyzing LLM Performance and Quality Degradation Under Varying Context Lengths	分析上下文长度变化对LLM性能和质量的影响，揭示KV缓存瓶颈	large language model
5	MoRAgent: Parameter Efficient Agent Tuning with Mixture-of-Roles	MoRAgent：基于混合角色（MoR）的参数高效Agent微调框架	large language model
6	Heaven-Sent or Hell-Bent? Benchmarking the Intelligence and Defectiveness of LLM Hallucinations	提出HIC-Bench，用于评估LLM幻觉的智能与缺陷，并探索其在科学创新中的作用。	large language model
7	Beyond Heuristics: A Decision-Theoretic Framework for Agent Memory Management	提出DAM：基于决策论的Agent记忆管理框架，解决启发式方法的局限性	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
8	Compass-Embedding v4: Robust Contrastive Learning for Multilingual E-commerce Embeddings	Compass-Embedding v4：面向东南亚电商场景的鲁棒对比学习多语言嵌入	representation learning contrastive learning
9	A Unified Definition of Hallucination: It's The World Model, Stupid!	统一幻觉定义：核心在于语言模型对世界的建模能力	world model
10	Rethinking Sample Polarity in Reinforcement Learning with Verifiable Rewards	提出A3PO：一种自适应非对称token级优势塑造策略优化方法，提升基于可验证奖励的强化学习推理能力。	reinforcement learning

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
11	Break Out the Silverware -- Semantic Understanding of Stored Household Items	提出NOAM模型，解决服务机器人居家环境中物品存储位置的语义理解难题	manipulation scene understanding large language model
12	Detecting AI-Generated Paraphrases in Bengali: A Comparative Study of Zero-Shot and Fine-Tuned Transformers	研究孟加拉语AI生成释义文本检测，对比零样本与微调Transformer模型性能。	manipulation large language model

🔬 支柱六：视频提取与匹配 (Video Extraction) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
13	Oogiri-Master: Benchmarking Humor Understanding via Oogiri	Oogiri-Master：通过大喜利游戏基准测试语言模型的幽默理解能力	HuMoR large language model
14	WearVox: An Egocentric Multichannel Voice Assistant Benchmark for Wearables	WearVox：面向可穿戴设备的多通道语音助手评测基准	egocentric large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页