cs.AI(2024-09-14)
📊 共 15 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (7)
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱五:交互与反应 (Interaction & Reaction) (2)
支柱四:生成式动作 (Generative Motion) (1)
支柱八:物理动画 (Physics-based Animation) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (7 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | From Text to Multimodality: Exploring the Evolution and Impact of Large Language Models in Medical Practice | 综述医学领域多模态大语言模型:演进、应用与挑战 | large language model multimodal | ||
| 2 | Prevailing Research Areas for Music AI in the Era of Foundation Models | 综述音乐AI在基石模型时代的研究前沿与未来方向 | foundation model multimodal | ||
| 3 | StressPrompt: Does Stress Impact Large Language Models and Human Performance Similarly? | StressPrompt:探究压力对大语言模型与人类表现的相似影响 | large language model instruction following | ||
| 4 | Synergistic Simulations: Multi-Agent Problem Solving with Large Language Models | 提出基于LLM的多智能体协同框架,模拟解决现实和编程问题 | large language model | ||
| 5 | What Is Wrong with My Model? Identifying Systematic Problems with Semantic Data Slicing | 提出SemSlicer,利用语义切片识别机器学习模型中的系统性问题。 | large language model | ||
| 6 | ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration | ESPnet-EZ:纯Python ESPnet,简化语音模型微调与集成 | foundation model | ||
| 7 | On the limits of agency in agent-based models | 提出LLM原型方法,高效集成LLM到ABM中,用于大规模自适应Agent仿真。 | large language model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 8 | Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution | Wave-U-Mamba:一种高质量、高效率的语音超分辨率端到端框架 | Mamba SSM | ||
| 9 | Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation | 提出AGDC模块,增强强化学习在源项估计中自主检测目标和停止的能力 | reinforcement learning | ||
| 10 | Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility | 提出MaskSR2模型,结合知识蒸馏和掩码声学建模,提升全频带语音恢复的清晰度 | distillation | ||
| 11 | Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models | 提出基于步级Q值模型的LLM Agent决策增强方法,显著提升多步决策任务性能。 | DPO large language model |
🔬 支柱五:交互与反应 (Interaction & Reaction) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling | 提出基于穿戴式传感器和Ego4D预训练的儿童自闭症患者人际互动中说话人分类方法 | dyadic interaction egocentric Ego4D | ||
| 13 | Federated Learning with Quantum Computing and Fully Homomorphic Encryption: A Novel Computing Paradigm Shift in Privacy-Preserving ML | 提出基于全同态加密和量子计算的联邦学习框架,增强隐私保护机器学习的安全性。 | OMOMO |
🔬 支柱四:生成式动作 (Generative Motion) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 14 | Hacking, The Lazy Way: LLM Augmented Pentesting | 提出基于LLM增强的渗透测试方法,提升自动化程度和效率 | penetration large language model chain-of-thought |
🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | Multiscale fusion enhanced spiking neural network for invasive BCI neural signal decoding | 提出多尺度融合增强型脉冲神经网络,用于侵入式脑机接口神经信号解码。 | spatiotemporal |