cs.LG(2025-04-21)

📊 共 9 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (5) 支柱九:具身大模型 (Embodied Foundation Models) (4)

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
1 Symmetry-Preserving Architecture for Multi-NUMA Environments (SPANE): A Deep Reinforcement Learning Approach for Dynamic VM Scheduling 提出SPANE以解决多NUMA环境下动态虚拟机调度问题 reinforcement learning deep reinforcement learning
2 In-context Ranking Preference Optimization 提出IRPO框架,通过上下文排序偏好优化LLM,提升排序任务性能。 DPO direct preference optimization large language model
3 Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL Think2SQL:通过强化LLM推理能力提升Text2SQL性能 reinforcement learning large language model
4 Integrating Response Time and Attention Duration in Bayesian Preference Learning for Multiple Criteria Decision Aiding 提出一种融合反应时间和注意力时长的贝叶斯偏好学习框架,用于多标准决策辅助。 preference learning
5 Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment 提出动态对比技能学习(DCSL),解决强化学习中技能学习的灵活性和泛化性问题。 reinforcement learning contrastive learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)

#题目一句话要点标签🔗
6 Virology Capabilities Test (VCT): A Multimodal Virology Q&A Benchmark 提出病毒学能力测试(VCT),用于评估LLM在复杂病毒学实验流程故障排除中的能力。 large language model multimodal
7 Scaling and Beyond: Advancing Spatial Reasoning in MLLMs Requires New Recipes MLLM空间推理能力不足,需变革现有架构与训练方法 large language model multimodal
8 Combating Toxic Language: A Review of LLM-Based Strategies for Software Engineering 综述:基于LLM的软件工程有毒语言检测与缓解策略 large language model
9 Compute-Optimal LLMs Provably Generalize Better With Scale 提出基于Freedman不等式的LLM泛化界限,揭示计算最优模型扩展优势 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页