cs.CL(2026-03-16)
📊 共 21 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (15 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱一:机器人控制 (Robot Control) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (15 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 16 | MMKU-Bench: A Multimodal Update Benchmark for Diverse Visual Knowledge | 提出MMKU-Bench,用于评估多模态模型在知识更新方面的表现,涵盖已知与未知知识。 | reinforcement learning RLHF multimodal | ||
| 17 | Fusian: Multi-LoRA Fusion for Fine-Grained Continuous MBTI Personality Control in Large Language Models | Fusian:多LoRA融合实现大语言模型中细粒度连续MBTI人格控制 | reinforcement learning large language model | ||
| 18 | Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning | Code-A1:通过强化学习对抗进化代码LLM和测试LLM,提升代码生成质量。 | reinforcement learning | ||
| 19 | Criterion-referenceability determines LLM-as-a-judge validity across physics assessment formats | 研究表明,LLM作为评分者的有效性取决于物理评估任务的标准参照性 | MAE large language model |
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 20 | LLMs as Signal Detectors: Sensitivity, Bias, and the Temperature-Criterion Analogy | 利用信号检测理论分析LLM:揭示温度参数与决策标准的类比及局限性 | manipulation large language model | ||
| 21 | The Impact of Ideological Discourses in RAG: A Case Study with COVID-19 Treatments | 研究意识形态文本对RAG模型输出的影响,以COVID-19治疗为例。 | manipulation large language model |