cs.AI（2025-01-28）

📊 共 12 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (9 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (2) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
1	MCTS-SQL: Light-Weight LLMs can Master the Text-to-SQL through Monte Carlo Tree Search	提出MCTS-SQL框架，利用蒙特卡洛树搜索提升轻量级LLM在Text-to-SQL任务上的性能。	large language model
2	RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token Reprogrammings	RadioLLM：通过混合提示和Token重编程将大型语言模型引入认知无线电	large language model
3	Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology	通过微调开源大语言模型提升放射肿瘤任务性能，探索其临床应用潜力	large language model
4	SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model	SafeRAG：构建RAG安全性评测基准，揭示其在对抗知识操纵攻击中的脆弱性	large language model	✅
5	Distilling Large Language Models for Network Active Queue Management	提出AQM-LLM，利用大语言模型提升网络主动队列管理性能	large language model
6	Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt Generation for Enhanced LLM Content Moderation	提出GAP框架，通过图结构优化对抗性提示生成，提升LLM内容审核能力	large language model multimodal	✅
7	From Natural Language to Extensive-Form Game Representations	提出一种基于LLM和上下文学习的框架，将自然语言博弈描述转换为扩展式博弈表示	large language model
8	Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers	提出语义自验证(SSV)方法，提升语言模型在逻辑推理任务中的准确性和可靠性。	large language model
9	Balancing Content Size in RAG-Text2SQL System	研究RAG-Text2SQL系统中检索文档大小与质量的平衡策略，提升查询准确性。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
10	SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training	比较SFT与RL在基础模型后训练中的作用	reinforcement learning foundation model
11	Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding	提出基于群体智慧解码（WOC）的大语言模型（LLM）估算方法，提升世界知识利用率。	world model large language model

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
12	Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care	提出结合强化学习与AI代理的自适应机器人交互系统，用于痴呆症护理。	humanoid humanoid robot reinforcement learning

⬅️ 返回 cs.AI 首页 · 🏠 返回主页