cs.AI(2024-05-20)
📊 共 12 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (7 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (3 🔗1)
支柱一:机器人控制 (Robot Control) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (7 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Eliciting Problem Specifications via Large Language Models | 利用大型语言模型自动生成认知系统可用的问题规范 | large language model | ||
| 2 | Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation | 提出Reindex-Then-Adapt框架,提升大型语言模型在对话推荐中的性能,解决推荐分布控制难题。 | large language model | ||
| 3 | Counterfactual Explanation-Based Badminton Motion Guidance Generation Using Wearable Sensors | 提出基于可穿戴传感器和反事实解释的羽毛球运动指导生成框架 | multimodal | ||
| 4 | Semantic Trajectory Data Mining with LLM-Informed POI Classification | 利用LLM进行POI分类,提升语义轨迹数据挖掘性能 | large language model | ||
| 5 | Recommender Algorithm for Supporting Self-Management of CVD Risk Factors in an Adult Population at Home | 提出一种结合知识库与大语言模型的推荐算法,用于支持成人居家心血管疾病风险因素的自我管理。 | large language model | ||
| 6 | Can Github issues be solved with Tree Of Thoughts? | 探索思维树(ToT)框架在解决GitHub问题中的应用,分析其局限性与改进方向 | large language model | ||
| 7 | Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities | 提出社交智能评估基准,揭示LLM与人类在逆向推理和规划上的差距 | large language model | ✅ |
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 8 | OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework | OpenRLHF:一个易用、可扩展、高性能的RLHF框架,加速LLM对齐。 | reinforcement learning RLHF large language model | ✅ | |
| 9 | A Study on Optimization Techniques for Variational Quantum Circuits in Reinforcement Learning | 研究变分量子电路优化技术,提升强化学习在NISQ时代的性能与稳定性 | reinforcement learning | ||
| 10 | Alternators For Sequence Modeling | 提出Alternators序列建模方法,适用于复杂序列数据的生成、预测和补全任务。 | latent dynamics Mamba |
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Lockpicking LLMs: A Logit-Based Jailbreak Using Token-level Manipulation | JailMine:一种基于Logit的Token级操纵方法,用于破解LLM的越狱防御 | manipulation large language model | ||
| 12 | A Metric-based Principal Curve Approach for Learning One-dimensional Manifold | 提出基于度量的主曲线方法,用于学习空间数据的一维流形 | MPC |