cs.LG(2025-07-18)

📊 共 14 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (9) 支柱九:具身大模型 (Embodied Foundation Models) (4) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (9 篇)

#题目一句话要点标签🔗
1 A million-scale dataset and generalizable foundation model for nanomaterial-protein interactions 提出NanoPro-3M数据集与NanoProFormer模型,用于预测纳米材料-蛋白质相互作用。 representation learning foundation model multimodal
2 LLaPipe: LLM-Guided Reinforcement Learning for Automated Data Preparation Pipeline Construction LLaPipe:利用LLM指导的强化学习构建自动化数据准备流水线 reinforcement learning distillation large language model
3 Reframing attention as a reinforcement learning problem for causal discovery 提出Causal Process Model,将注意力机制重构为强化学习问题以进行因果发现。 reinforcement learning deep reinforcement learning representation learning
4 SoftPipe: A Soft-Guided Reinforcement Learning Framework for Automated Data Preparation 提出SoftPipe框架以解决数据准备中的搜索空间问题 reinforcement learning large language model
5 State Space Models Naturally Produce Traveling Waves, Time Cells, and Scale to Abstract Cognitive Functions 提出基于状态空间模型(SSM)的框架,统一神经元动力学与认知功能,解释时间细胞涌现。 reinforcement learning SSM state space model
6 Preference-based Multi-Objective Reinforcement Learning 提出基于偏好的多目标强化学习,解决复杂任务中奖励函数难以设计的问题 reinforcement learning reward design
7 Toward Temporal Causal Representation Learning with Tensor Decomposition 提出CaRTeD框架,结合张量分解与时序因果表示学习,处理高维不等长时序数据。 representation learning
8 BikeVAE-GNN: A Variational Autoencoder-Augmented Hybrid Graph Neural Network for Sparse Bicycle Volume Estimation 提出BikeVAE-GNN,解决城市自行车网络中稀疏流量估计问题 MAE spatial relationship
9 Dual-Center Graph Clustering with Neighbor Distribution 提出基于邻居分布的双中心图聚类方法,提升图聚类性能。 representation learning contrastive learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)

#题目一句话要点标签🔗
10 Prompt Smart, Pay Less: Cost-Aware APO for Real-World Applications 提出APE-OPRO混合框架,在真实商业场景下实现高性价比的自动Prompt优化。 large language model multimodal
11 Solo Connection: A Parameter Efficient Fine-Tuning Technique for Transformers 提出Solo Connection,一种参数高效的Transformer微调技术,提升自然语言生成性能。 large language model
12 DPMT: Dual Process Multi-scale Theory of Mind Framework for Real-time Human-AI Collaboration 提出DPMT框架,通过双过程多尺度心智理论提升人机实时协作 large language model
13 Bi-GRU Based Deception Detection using EEG Signals 利用Bi-GRU和脑电信号进行欺骗检测,在Bag-of-Lies数据集上达到97%的准确率。 multimodal

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
14 FuSeFL: Fully Secure and Scalable Cross-Silo Federated Learning FuSeFL:一种全安全、可扩展的跨孤岛联邦学习方案 MPC OMOMO

⬅️ 返回 cs.LG 首页 · 🏠 返回主页