cs.LG(2025-03-18)
📊 共 13 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (6 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (5)
支柱一:机器人控制 (Robot Control) (1)
支柱五:交互与反应 (Interaction & Reaction) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Bayesian Modeling of Zero-Shot Classifications for Urban Flood Detection | 提出BayFlood,结合零样本视觉语言模型与贝叶斯模型用于城市洪水检测。 | foundation model | ||
| 2 | Rethinking the Evaluation of Secure Code Generation | 重新评估安全代码生成:现有方法在安全性和功能性上存在权衡 | large language model | ||
| 3 | EnvBench: A Benchmark for Automated Environment Setup | EnvBench:用于自动化环境配置的综合基准测试,涵盖Python和JVM项目。 | large language model | ✅ | |
| 4 | COPA: Comparing the incomparable in multi-objective model evaluation | COPA:通过相对排序比较多目标模型评估中的不可比指标。 | foundation model | ||
| 5 | Robust Weight Imprinting: Insights from Neural Collapse and Proxy-Based Aggregation | 提出IMPRINT框架,通过神经崩塌现象指导的代理聚类,提升迁移学习中的权重印刻性能。 | foundation model | ✅ | |
| 6 | Speculative Decoding for Verilog: Speed and Quality, All in One | 针对Verilog代码生成,提出思辨解码方法,兼顾速度与质量。 | large language model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 7 | DashCLIP: Leveraging multimodal models for generating semantic embeddings for DoorDash | DashCLIP:利用多模态模型为DoorDash生成语义嵌入,提升产品理解和用户意图识别 | contrastive learning multimodal | ||
| 8 | Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels | 提出Tiled Flash线性注意力(TFLA),加速线性RNN和xLSTM内核,提升长序列建模效率。 | Mamba linear attention | ||
| 9 | DAPO: An Open-Source LLM Reinforcement Learning System at Scale | DAPO:一个大规模LLM强化学习开源系统,在AIME 2024上达到50分 | reinforcement learning | ||
| 10 | LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers | LLM-FE:利用LLM作为进化优化器,实现表格数据自动化特征工程 | predictive model large language model | ||
| 11 | SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas | SocialJax:用于序贯社会困境中多智能体强化学习的评估套件 | reinforcement learning |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | Temporal Context Awareness: A Defense Framework Against Multi-turn Manipulation Attacks on Large Language Models | 提出TCA框架,防御大语言模型上的多轮对话操纵攻击 | manipulation large language model |
🔬 支柱五:交互与反应 (Interaction & Reaction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | Higher-Order Graphon Neural Networks: Approximation and Cut Distance | 提出不变Graphon网络(IWN),用于高阶图神经网络的逼近和割距离研究。 | OMOMO |