cs.AI（2025-04-07）

📊 共 35 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (22 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (12 🔗1) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (22 篇)

#	题目	一句话要点	标签	🔗
1	SmolVLM: Redefining small and efficient multimodal models	提出SmolVLM以解决小型多模态模型的资源效率问题	multimodal
2	Large Language Model (LLM) for Software Security: Code Analysis, Malware Analysis, Reverse Engineering	综述性研究：利用大型语言模型提升软件安全，聚焦代码分析、恶意软件分析与逆向工程	large language model
3	The challenge of uncertainty quantification of large language models in medicine	提出一种综合框架，用于量化医学大语言模型的不确定性，提升临床决策的可靠性。	large language model
4	Promoting Security and Trust on Social Networks: Explainable Cyberbullying Detection Using Large Language Models in a Stream-Based Machine Learning Framework	提出基于流式机器学习框架和大型语言模型的可解释网络欺凌检测方案，提升社交网络安全。	large language model
5	Leveraging Label Potential for Enhanced Multimodal Emotion Recognition	提出LSGMER模型，利用标签信息增强多模态情感识别的准确性和稳定性。	multimodal
6	CCSK:Cognitive Convection of Self-Knowledge Based Retrieval Augmentation for Large Language Models	提出CCSK，通过自知识认知对流增强大语言模型的检索增强生成效果	large language model
7	Multimodal Agricultural Agent Architecture (MA3): A New Paradigm for Intelligent Agricultural Decision-Making	提出多模态农业Agent架构MA3，用于智能农业决策，应对气候变化下的生产优化与可持续发展挑战。	multimodal
8	Prism: Dynamic and Flexible Benchmarking of LLMs Code Generation with Monte Carlo Tree Search	Prism：基于蒙特卡洛树搜索的LLM代码生成动态灵活基准测试框架	large language model
9	On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions	利用种族表征进行偏差校正，但通用性仍具挑战	large language model
10	AccLLM: Accelerating Long-Context LLM Inference Via Algorithm-Hardware Co-Design	AccLLM：通过算法-硬件协同设计加速长文本LLM推理	large language model
11	SciSciGPT: Advancing Human-AI Collaboration in the Science of Science	SciSciGPT：利用大语言模型赋能科学研究，促进人机协作	large language model
12	A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions	构建对话Agent能力图谱，分析挑战与未来方向，助力通用人工智能	large language model	✅
13	Frontier AI's Impact on the Cybersecurity Landscape	前沿AI加剧网络安全攻防失衡，攻击能力超越防御，亟需新基准与防御AI	foundation model
14	EduPlanner: LLM-Based Multi-Agent Systems for Customized and Intelligent Instructional Design	EduPlanner：基于LLM的多智能体系统，用于定制化和智能化的教学设计	large language model	✅
15	Utility-Focused LLM Annotation for Retrieval and Retrieval-Augmented Generation	利用大语言模型标注文档效用，提升检索和RAG系统性能	large language model
16	Prima.cpp: Fast 30-70B LLM Inference on Heterogeneous and Low-Resource Home Clusters	Prima.cpp：在异构低资源家庭集群上实现快速30-70B LLM推理	large language model
17	Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning	提出DOWN框架，通过自适应辩论提升LLM推理效率并降低计算成本	large language model
18	The Dream Within Huang Long Cave: AI-Driven Interactive Narrative for Family Storytelling and Emotional Reflection	提出基于AI的交互叙事系统，用于家庭故事讲述和情感反思	large language model
19	Debate-Feedback: A Multi-Agent Framework for Efficient Legal Judgment Prediction	提出Debate-Feedback框架，利用多智能体辩论高效预测法律判决	large language model
20	BIASINSPECTOR: Detecting Bias in Structured Data through LLM Agents	BIASINSPECTOR：利用LLM Agent自动检测结构化数据中的偏见	large language model
21	ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines	提出ELT-Bench，用于评估AI Agent在端到端ELT Pipeline构建中的能力。	large language model	✅
22	Generalising from Self-Produced Data: Model Training Beyond Human Constraints	提出一种AI自主生成数据并训练模型的新框架，突破人类数据和抽象层级的限制。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (12 篇)

#	题目	一句话要点	标签	🔗
23	R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation	R2Vul：结合强化学习与结构化推理蒸馏提升代码LLM的软件漏洞检测能力	reinforcement learning distillation large language model
24	Deep Reinforcement Learning Algorithms for Option Hedging	对比深度强化学习算法在期权对冲中的表现，MCPG算法表现最佳	reinforcement learning deep reinforcement learning DRL
25	Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework	提出基于跨模态关系知识蒸馏的毫米波通信波束预测方法，提升资源效率。	distillation multimodal
26	Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning	提出基于强化学习微调的LLM进化搜索算法，加速组合优化算法发现	reinforcement learning large language model
27	VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks	VAPO：用于高级推理任务的高效可靠的强化学习框架	reinforcement learning chain-of-thought
28	GAMDTP: Dynamic Trajectory Prediction with Graph Attention Mamba Network	提出GAMDTP以解决动态轨迹预测问题	Mamba SSM
29	Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use	提出Step-Wise RL，通过合成数据和多步强化学习提升语言模型在推理和工具使用上的性能。	reinforcement learning RLHF large language model
30	HypRL: Reinforcement Learning of Control Policies for Hyperproperties	HYPRL：提出一种基于HyperLTL规范引导的多智能体强化学习控制策略框架	reinforcement learning reward shaping
31	Interactive Explanations for Reinforcement-Learning Agents	提出ASQ-IT交互式解释系统，提升用户对强化学习智能体行为的理解和问题定位能力	reinforcement learning
32	Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling	提出LLM-QL模型，利用查询似然建模增强LLM在稠密检索中的性能	contrastive learning large language model
33	Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors	提出W4S框架，利用弱Meta-Agent优化工作流，提升强执行器的性能。	reinforcement learning large language model
34	GOTHAM: Graph Class Incremental Learning Framework under Weak Supervision	提出GOTHAM框架，解决弱监督下图数据的类别增量学习问题。	teacher-student distillation	✅

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
35	How to evaluate control measures for LLM agents? A trajectory from today to superintelligence	提出LLM Agent控制评估框架，根据Agent能力演进调整红队对抗策略	affordance

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2025-04-07）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (22 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (12 篇)

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理