cs.AI（2024-11-20）

📊 共 21 篇论文

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (15) 支柱二：RL算法与架构 (RL & Architecture) (5) 支柱四：生成式动作 (Generative Motion) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (15 篇)

#	题目	一句话要点	标签	🔗	⭐
1	AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations	AdaptAgent：利用少量人类演示，实现多模态Web Agent的快速适应	large language model multimodal
2	Are Large Language Models Memorizing Bug Benchmarks?	评估大型语言模型在缺陷基准测试中的记忆效应，揭示数据泄露风险	large language model
3	Existential Conversations with Large Language Models: Content, Community, and Culture	探索大型语言模型的存在主义对话：内容、社群与文化影响	large language model
4	The Information Security Awareness of Large Language Models	提出自动化方法评估大语言模型信息安全意识，揭示其安全漏洞	large language model
5	"It was 80% me, 20% AI": Seeking Authenticity in Co-Writing with Large Language Models	探索人机共创中作者身份的保持：个性化AI写作辅助工具的设计与评估	large language model
6	Transforming the Hybrid Cloud for Emerging AI Workloads	提出全栈协同设计以应对AI工作负载的复杂性	foundation model multimodal
7	SoK: A Systems Perspective on Compound AI Threats and Countermeasures	系统性分析复合AI威胁与对策，为安全部署提供指导	large language model
8	BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices	BetterBench：评估AI基准测试，揭示问题并建立最佳实践	foundation model
9	When IoT Meet LLMs: Applications and Challenges	探索LLM与IoT融合：提升决策能力与优化资源利用	large language model
10	AI-Driven Agents with Prompts Designed for High Agreeableness Increase the Likelihood of Being Mistaken for a Human in the Turing Test	通过设计高亲和力提示词的AI Agent，提升图灵测试中被误判为人类的可能性	large language model
11	MAS-Attention: Memory-Aware Stream Processing for Attention Acceleration on Resource-Constrained Edge Devices	MAS-Attention：面向资源受限边缘设备的内存感知注意力加速方案	foundation model
12	ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs	ToolScan：用于表征工具使用LLM中错误的新基准测试	large language model
13	CryptoFormalEval: Integrating LLMs and Formal Verification for Automated Cryptographic Protocol Vulnerability Detection	CryptoFormalEval：集成LLM与形式化验证，实现密码协议漏洞自动检测	large language model
14	DMQR-RAG: Diverse Multi-Query Rewriting for RAG	提出DMQR-RAG框架，通过多样化多查询重写提升RAG检索和生成性能	large language model
15	MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Cultural Learning	MindForge：赋予具身智能体心智理论，实现终身文化学习	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (5 篇)

#	题目	一句话要点	标签	🔗	⭐
16	DrugGen: Advancing Drug Discovery with Large Language Models and Reinforcement Learning Feedback	DrugGen：利用大语言模型和强化学习反馈加速药物发现	reinforcement learning large language model
17	Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning	提出可解释LLM驱动的多维蒸馏框架，提升电商搜索相关性学习效果	distillation large language model chain-of-thought
18	DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs	DSTC：仅用自生成测试与代码进行直接偏好学习，提升代码大模型性能	preference learning DPO direct preference optimization
19	BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games	BALROG：用于评估Agentic LLM/VLM在游戏环境中推理能力的新基准	reinforcement learning large language model
20	NumCoKE: Ordinal-Aware Numerical Reasoning over Knowledge Graphs with Mixture-of-Experts and Contrastive Learning	提出NumCoKE框架，通过混合专家模型和对比学习增强知识图谱数值推理能力。	contrastive learning

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
21	Heuristically Adaptive Diffusion-Model Evolutionary Strategy	提出启发式自适应扩散模型进化策略，提升进化算法的探索能力和收敛效率。	classifier-free guidance

⬅️ 返回 cs.AI 首页 · 🏠 返回主页