cs.AI（2025-05-13）

📊 共 34 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (20 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (12 🔗1) 支柱一：机器人控制 (Robot Control) (2)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

#	题目	一句话要点	标签	🔗
1	Ultrasound Report Generation with Multimodal Large Language Models for Standardized Texts	提出基于多模态大语言模型的超声报告生成框架，实现标准化文本输出。	large language model multimodal
2	Decoding Neighborhood Environments with Large Language Models	利用大型语言模型解码社区环境：无需训练，实现高精度环境要素识别。	large language model
3	Optimized Couplings for Watermarking Large Language Models	针对大语言模型，提出优化耦合的水印方案，提升检测能力并降低文本质量损失。	large language model	✅
4	DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models	提出DeepMath-Creative以评估大语言模型的数学创造力	large language model
5	CellTypeAgent: Trustworthy cell type annotation with Large Language Models	CellTypeAgent：利用大语言模型实现可信的细胞类型注释	large language model
6	The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News	提出TruEDebate，利用多智能体辩论系统与大语言模型提升假新闻检测的解释性和有效性	large language model
7	Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People	针对听障人士语音清晰度预测，揭示语音基础模型应用的最佳实践	foundation model
8	Federated Large Language Models: Feasibility, Robustness, Security and Future Directions	综述联邦大语言模型(FLLM)在可行性、鲁棒性、安全性的挑战与未来方向	large language model
9	TrialMatchAI: An End-to-End AI-powered Clinical Trial Recommendation System to Streamline Patient-to-Trial Matching	TrialMatchAI：端到端AI临床试验推荐系统，加速患者与试验匹配	large language model chain-of-thought
10	Lost in Transmission: When and Why LLMs Fail to Reason Globally	提出BAPO模型，揭示LLM全局推理失败源于内部通信带宽限制	large language model chain-of-thought
11	Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification	结合CoT、RAG、自洽性和自验证，提升大型语言模型的可靠性	large language model chain-of-thought
12	Tests as Prompt: A Test-Driven-Development Benchmark for LLM Code Generation	WebApp1K：提出测试驱动开发基准，评估LLM从测试用例生成代码的能力	large language model instruction following
13	AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques	综述AI与生成式AI在灾害管理中的应用，聚焦于灾害评估与响应技术。	multimodal
14	AI-Mediated Code Comment Improvement	提出基于AI的代码注释改进方法，利用大语言模型重写注释以提升质量	large language model
15	Securing RAG: A Risk Assessment and Mitigation Framework	提出RAG安全框架，评估并缓解检索增强生成中的安全风险	large language model
16	VizCV: AI-assisted visualization of researchers' publications tracks	VizCV：提出AI辅助的可视化框架，用于分析科研人员的论文发表轨迹。	large language model
17	Resource-Efficient Language Models: Quantization for Fast and Accessible Inference	针对大语言模型，提出后训练量化方法以加速推理并降低资源需求	large language model
18	Evaluating LLM Metrics Through Real-World Capabilities	评估LLM在真实世界能力：弥合基准测试与实际应用差距，Gemini表现突出	large language model
19	Communication Styles and Reader Preferences of LLM and Human Experts in Explaining Health Information	研究LLM与人类专家在健康信息解释中的沟通风格差异及读者偏好	large language model
20	Leveraging AI for Productive and Trustworthy HPC Software: Challenges and Research Directions	利用AI革新HPC软件开发：挑战与研究方向探讨	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (12 篇)

#	题目	一句话要点	标签	🔗
21	Foundation Models Knowledge Distillation For Battery Capacity Degradation Forecast	提出基于知识蒸馏的电池容量衰退预测框架，提升跨尺度泛化能力并降低计算成本。	teacher-student distillation foundation model
22	Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections	提出基于深度强化学习的纵向控制策略，提升自动驾驶车辆在信号交叉口的安全性、效率和舒适性。	reinforcement learning deep reinforcement learning DRL
23	Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control	提出基于蒙特卡洛束搜索的Actor-Critic强化学习方法，提升连续控制任务的探索效率。	reinforcement learning policy learning PPO
24	Strategy-Augmented Planning for Large Language Models via Opponent Exploitation	提出基于策略增强规划的LLM智能体，通过对手策略挖掘提升博弈性能	reinforcement learning large language model	✅
25	Learning Like Humans: Advancing LLM Reasoning Capabilities via Adaptive Difficulty Curriculum Learning and Expert-Guided Self-Reformulation	提出自适应难度课程学习与专家引导自重构，提升LLM数学推理能力	reinforcement learning imitation learning curriculum learning
26	Improved Algorithms for Differentially Private Language Model Alignment	提出差分隐私语言模型对齐算法，提升隐私保护下的对齐效果	reinforcement learning RLHF DPO
27	Deep Reinforcement Learning for Power Grid Multi-Stage Cascading Failure Mitigation	提出基于深度强化学习的电力系统多阶段级联故障缓解策略	reinforcement learning deep reinforcement learning
28	Enhancing Aerial Combat Tactics through Hierarchical Multi-Agent Reinforcement Learning	提出分层多智能体强化学习，提升空战战术决策能力	reinforcement learning deep reinforcement learning
29	Hyperbolic Contrastive Learning with Model-augmentation for Knowledge-aware Recommendation	提出基于双曲对比学习和模型增强的知识感知推荐方法，解决层级结构建模和偏好偏移问题。	preference learning contrastive learning
30	Automated Meta Prompt Engineering for Alignment with the Theory of Mind	提出基于Agent强化学习的元提示工程，提升大语言模型与人类心智理论的对齐	reinforcement learning large language model
31	A Mamba-based Network for Semi-supervised Singing Melody Extraction Using Confidence Binary Regularization	提出基于Mamba的SpectMamba网络，结合置信度二元正则化，用于半监督的歌声旋律提取。	Mamba
32	A Study of Data-driven Methods for Inventory Optimization	研究数据驱动方法在超市库存优化中的应用，对比时间序列、随机森林和深度强化学习算法。	reinforcement learning deep reinforcement learning

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
33	Modeling Unseen Environments with Language-guided Composable Causal Components in Reinforcement Learning	提出WM3C以解决强化学习中的环境泛化问题	manipulation reinforcement learning policy learning
34	A Survey of Deep Learning for Complex Speech Spectrograms	综述深度学习在复数语音语谱图处理中的应用，涵盖网络架构、训练策略及应用。	manipulation

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2025-05-13）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (12 篇)

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理