cs.CL(2024-11-19)
📊 共 17 篇论文 | 🔗 4 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (12 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (4 🔗1)
支柱一:机器人控制 (Robot Control) (1 🔗1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (12 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | GRL-Prompt: Towards Knowledge Graph based Prompt Optimization via Reinforcement Learning | 提出GRL-Prompt,利用强化学习和知识图谱优化大语言模型提示词。 | reinforcement learning reward shaping large language model | ||
| 14 | ACING: Actor-Critic for Instruction Learning in Black-Box LLMs | ACING:用于黑盒LLM指令学习的Actor-Critic方法 | reinforcement learning large language model chain-of-thought | ✅ | |
| 15 | HNCSE: Advancing Sentence Embeddings via Hybrid Contrastive Learning with Hard Negatives | HNCSE:通过混合对比学习与难负样本提升句子嵌入 | representation learning contrastive learning | ||
| 16 | ProSec: Fortifying Code LLMs with Proactive Security Alignment | ProSec:通过主动安全对齐增强代码大语言模型的安全性 | preference learning large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | JuniperLiu at CoMeDi Shared Task: Models as Annotators in Lexical Semantics Disagreements | 提出基于模型集成和异方性消除的词汇语义分歧预测方法 | manipulation | ✅ |