cs.AI（2025-01-31）

📊 共 21 篇论文

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (14) 支柱二：RL算法与架构 (RL & Architecture) (6) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (14 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Imitation Game for Adversarial Disillusion with Multimodal Generative Chain-of-Thought Role-Play	提出基于模仿博弈的多模态生成式思维链角色扮演对抗攻击防御框架	multimodal chain-of-thought
2	Multimodal MRI-Ultrasound AI for Prostate Cancer Detection Outperforms Radiologist MRI Interpretation: A Multi-Center Study	多模态MRI-超声AI用于前列腺癌检测，优于放射科医生MRI判读	multimodal
3	Augmented Intelligence for Multimodal Virtual Biopsy in Breast Cancer Using Generative Artificial Intelligence	利用生成式AI增强乳腺癌多模态虚拟活检的智能辅助诊断	multimodal
4	Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes	提出基于大语言模型的IoTGen框架，用于生成智能家居用户行为序列数据，提升下游模型泛化性。	large language model
5	MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems	提出MINDSTORES以解决现有LLM规划能力不足的问题	embodied AI large language model
6	Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game	通过变色龙游戏评估LLM在信息控制、推理和策略制定方面的能力	large language model
7	We're Different, We're the Same: Creative Homogeneity Across LLMs	揭示大语言模型在创意生成上的同质性：不同模型，相似结果	large language model
8	LLM Cyber Evaluations Don't Capture Real-World Risk	提出LLM网络安全风险评估框架，弥合能力评估与真实世界影响之间的差距	large language model
9	SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling	SETS：利用自验证与自纠正提升大模型测试时推理能力	large language model
10	Analysis of LLMs vs Human Experts in Requirements Engineering	对比LLM与专家在需求工程中的表现：LLM在需求获取方面更优	large language model
11	LLM-RecG: A Semantic Bias-Aware Framework for Zero-Shot Sequential Recommendation	LLM-RecG：一种语义偏差感知的零样本序列推荐框架	large language model
12	Language Games as the Pathway to Artificial Superhuman Intelligence	提出基于语言游戏的ASI演进框架，突破数据再生产陷阱	large language model
13	SOK: Exploring Hallucinations and Security Risks in AI-Assisted Software Development with Insights for LLM Deployment	探索AI辅助软件开发中的幻觉与安全风险，为LLM部署提供洞见	large language model
14	Can AI Solve the Peer Review Crisis? A Large Scale Cross Model Experiment of LLMs' Performance and Biases in Evaluating over 1000 Economics Papers	利用大规模实验评估LLM在经济学论文评审中的表现与偏差，探索AI辅助同行评审	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
15	Jackpot! Alignment as a Maximal Lottery	提出基于最大彩票的对齐方法，提升LLM在人类反馈学习中的鲁棒性	reinforcement learning RLHF large language model
16	In Pursuit of Predictive Models of Human Preferences Toward AI Teammates	探究人类对AI队友偏好的预测模型，用于Hanabi合作博弈	reinforcement learning predictive model
17	An Empirical Game-Theoretic Analysis of Autonomous Cyber-Defence Agents	提出基于潜在奖励塑造和多响应预言机的深度强化学习网络攻防博弈分析框架	reinforcement learning deep reinforcement learning DRL
18	Objective Metrics for Human-Subjects Evaluation in Explainable Reinforcement Learning	提出基于客观指标的XRL评估方法，用于调试智能体行为和支持人机协作。	reinforcement learning
19	SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments	SHARPIE：用于人机交互强化学习实验的模块化通用框架	reinforcement learning
20	Enabling Autonomic Microservice Management through Self-Learning Agents	提出ServiceOdyssey，通过自学习Agent实现微服务自治管理	curriculum learning large language model

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
21	Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning	提出Safety Chain-of-Thought，增强LLM防御对抗性攻击的能力	manipulation large language model chain-of-thought

⬅️ 返回 cs.AI 首页 · 🏠 返回主页