cs.LG(2024-08-23)

📊 共 7 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (5 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (1 🔗1) 支柱八:物理动画 (Physics-based Animation) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
1 Knowledge Graph Modeling-Driven Large Language Model Operating System (LLM OS) for Task Automation in Process Engineering Problem-Solving 提出基于知识图谱建模的LLM OS,用于流程工程问题自动化 teacher-student large language model
2 The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities LLM微调终极指南:从基础到突破的技术、研究、实践与挑战全面综述 PPO DPO direct preference optimization
3 Localized Observation Abstraction Using Piecewise Linear Spatial Decay for Reinforcement Learning in Combat Simulations 提出基于分段线性空间衰减的局部观测抽象方法,加速战斗模拟中强化学习智能体的训练。 reinforcement learning deep reinforcement learning spatial relationship
4 Hierarchical Spatio-Temporal State-Space Modeling for fMRI Analysis 提出基于Mamba的FST-Mamba模型,用于fMRI分析中的神经生物标志物发现。 Mamba state space model spatiotemporal
5 Mastering the Digital Art of War: Developing Intelligent Combat Simulation Agents for Wargaming Using Hierarchical Reinforcement Learning 提出基于分层强化学习的智能作战模拟Agent,用于兵棋推演。 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
6 LLaVaOLMoBitnet1B: Ternary LLM goes Multimodal! 提出LLaVaOLMoBitnet1B:首个三元多模态大语言模型,支持图文输入。 large language model multimodal

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
7 Extraction of Typical Operating Scenarios of New Power System Based on Deep Time Series Aggregation 提出基于深度时间序列聚合的新型电力系统典型运行场景提取方法 spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页