cs.AI(2024-11-20)

📊 共 21 篇论文

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (15) 支柱二:RL算法与架构 (RL & Architecture) (5) 支柱四:生成式动作 (Generative Motion) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (15 篇)

#题目一句话要点标签🔗
1 AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations AdaptAgent:利用少量人类演示,实现多模态Web Agent的快速适应 large language model multimodal
2 Are Large Language Models Memorizing Bug Benchmarks? 评估大型语言模型在缺陷基准测试中的记忆效应,揭示数据泄露风险 large language model
3 Existential Conversations with Large Language Models: Content, Community, and Culture 探索大型语言模型的存在主义对话:内容、社群与文化影响 large language model
4 The Information Security Awareness of Large Language Models 提出自动化方法评估大语言模型信息安全意识,揭示其安全漏洞 large language model
5 "It was 80% me, 20% AI": Seeking Authenticity in Co-Writing with Large Language Models 探索人机共创中作者身份的保持:个性化AI写作辅助工具的设计与评估 large language model
6 Transforming the Hybrid Cloud for Emerging AI Workloads 提出全栈协同设计以应对AI工作负载的复杂性 foundation model multimodal
7 SoK: A Systems Perspective on Compound AI Threats and Countermeasures 系统性分析复合AI威胁与对策,为安全部署提供指导 large language model
8 BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices BetterBench:评估AI基准测试,揭示问题并建立最佳实践 foundation model
9 When IoT Meet LLMs: Applications and Challenges 探索LLM与IoT融合:提升决策能力与优化资源利用 large language model
10 AI-Driven Agents with Prompts Designed for High Agreeableness Increase the Likelihood of Being Mistaken for a Human in the Turing Test 通过设计高亲和力提示词的AI Agent,提升图灵测试中被误判为人类的可能性 large language model
11 MAS-Attention: Memory-Aware Stream Processing for Attention Acceleration on Resource-Constrained Edge Devices MAS-Attention:面向资源受限边缘设备的内存感知注意力加速方案 foundation model
12 ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs ToolScan:用于表征工具使用LLM中错误的新基准测试 large language model
13 CryptoFormalEval: Integrating LLMs and Formal Verification for Automated Cryptographic Protocol Vulnerability Detection CryptoFormalEval:集成LLM与形式化验证,实现密码协议漏洞自动检测 large language model
14 DMQR-RAG: Diverse Multi-Query Rewriting for RAG 提出DMQR-RAG框架,通过多样化多查询重写提升RAG检索和生成性能 large language model
15 MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Cultural Learning MindForge:赋予具身智能体心智理论,实现终身文化学习 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
16 DrugGen: Advancing Drug Discovery with Large Language Models and Reinforcement Learning Feedback DrugGen:利用大语言模型和强化学习反馈加速药物发现 reinforcement learning large language model
17 Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning 提出可解释LLM驱动的多维蒸馏框架,提升电商搜索相关性学习效果 distillation large language model chain-of-thought
18 DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs DSTC:仅用自生成测试与代码进行直接偏好学习,提升代码大模型性能 preference learning DPO direct preference optimization
19 BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games BALROG:用于评估Agentic LLM/VLM在游戏环境中推理能力的新基准 reinforcement learning large language model
20 NumCoKE: Ordinal-Aware Numerical Reasoning over Knowledge Graphs with Mixture-of-Experts and Contrastive Learning 提出NumCoKE框架,通过混合专家模型和对比学习增强知识图谱数值推理能力。 contrastive learning

🔬 支柱四:生成式动作 (Generative Motion) (1 篇)

#题目一句话要点标签🔗
21 Heuristically Adaptive Diffusion-Model Evolutionary Strategy 提出启发式自适应扩散模型进化策略,提升进化算法的探索能力和收敛效率。 classifier-free guidance

⬅️ 返回 cs.AI 首页 · 🏠 返回主页