cs.AI（2024-09-26）

📊 共 31 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (21 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (8) 支柱七：动作重定向 (Motion Retargeting) (1 🔗1) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (21 篇)

#	题目	一句话要点	标签	🔗
1	MMMT-IF: A Challenging Multimodal Multi-Turn Instruction Following Benchmark	提出MMMT-IF基准测试，用于评估多模态多轮对话中指令遵循能力。	multimodal instruction following
2	Compositional Hardness of Code in Large Language Models -- A Probabilistic Perspective	揭示LLM代码生成中组合任务的内在困难，提出多智能体分解策略	large language model chain-of-thought
3	Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy	提出AMMORE数据集，并利用CoT提示提升LLM在数学形成性评估中边缘案例的评分准确率	large language model chain-of-thought
4	Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey	综述有害微调攻击与防御，应对大语言模型安全风险	large language model	✅
5	A Multimodal Single-Branch Embedding Network for Recommendation in Cold-Start and Missing Modality Scenarios	提出SiBraR单分支嵌入网络，解决推荐系统中冷启动和模态缺失问题。	multimodal
6	MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models	MaskLLM：面向大语言模型的可学习半结构化稀疏方法	large language model	✅
7	Development and Validation of a Large Language Model for Generating Fully-Structured Radiology Reports	提出动态模板约束解码的LLM，用于生成高质量、结构化的肺癌筛查报告。	large language model
8	Trustworthy AI: Securing Sensitive Data in Large Language Models	提出面向大语言模型的信任框架，保障敏感数据安全	large language model
9	A Scalable Data-Driven Framework for Systematic Analysis of SEC 10-K Filings Using Large Language Models	提出一种可扩展的数据驱动框架，利用大型语言模型系统分析SEC 10-K文件。	large language model
10	Infer Human's Intentions Before Following Natural Language Instructions	提出FISER框架，通过社交推理预测人类意图，提升具身协作任务中的指令跟随性能。	instruction following chain-of-thought
11	A Survey of Spatio-Temporal EEG data Analysis: from Models to Applications	综述时空脑电图数据分析方法及其应用	large language model foundation model	✅
12	Policy Maps: Tools for Guiding the Unbounded Space of LLM Behaviors	提出Policy Maps，引导LLM行为空间，辅助AI策略设计。	large language model
13	Data-Prep-Kit: getting your data ready for LLM application development	提出Data Prep Kit (DPK)，用于大规模语言模型应用开发的数据准备。	large language model
14	MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks	提出MoJE：一种基于专家混合和朴素表格分类器的LLM越狱攻击防御方法	large language model
15	Heuristics and Biases in AI Decision-Making: Implications for Responsible AGI	评估LLM认知偏差：揭示GPT-4o、Gemma 2和Llama 3.1的决策缺陷	large language model
16	The Nexus of AR/VR, AI, UI/UX, and Robotics Technologies in Enhancing Learning and Social Interaction for Children with Autism Spectrum Disorders: A Systematic Review	系统综述：AR/VR、AI、UI/UX与机器人技术融合，提升自闭症儿童的学习与社交互动	large language model
17	AI Delegates with a Dual Focus: Ensuring Privacy and Strategic Self-Disclosure	提出双重关注的AI代理，保障隐私与策略性自我披露，用于社交互动。	large language model
18	Dr. GPT in Campus Counseling: Understanding Higher Education Students' Opinions on LLM-assisted Mental Health Services	探索LLM在校园心理咨询中的应用：理解大学生对AI辅助心理健康服务的观点	large language model
19	Multi-Designated Detector Watermarking for Language Models	提出多指定检测器水印（MDDW）技术，用于保护大型语言模型的知识产权。	large language model
20	From News to Forecast: Integrating Event Analysis in LLM-Based Time Series Forecasting with Reflection	提出融合事件分析与LLM的时间序列预测方法，提升预测精度。	large language model
21	Human Mobility Modeling with Household Coordination Activities under Limited Information via Retrieval-Augmented LLMs	提出检索增强LLM框架，利用有限信息建模包含家庭协同的人类出行模式	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (8 篇)

#	题目	一句话要点	标签
22	Navigation in a simplified Urban Flow through Deep Reinforcement Learning	提出基于PPO+LSTM的深度强化学习方法，优化无人机在城市环境中的自主导航。	reinforcement learning deep reinforcement learning DRL
23	Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning	提出结合BEV感知与深度强化学习的端到端自动驾驶框架，提升驾驶性能。	reinforcement learning deep reinforcement learning DRL
24	DRL-STNet: Unsupervised Domain Adaptation for Cross-modality Medical Image Segmentation via Disentangled Representation Learning	DRL-STNet：通过解耦表征学习实现跨模态医学图像分割的无监督域自适应	DRL representation learning
25	Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles	提出Role-RL，通过角色强化学习实现LLM在线长文本处理中的最优角色分配	reinforcement learning large language model
26	FactorSim: Generative Simulation via Factorized Representation	FactorSim：通过分解表示生成模拟环境，用于训练智能体。	reinforcement learning zero-shot transfer
27	Autonomous Network Defence using Reinforcement Learning	提出基于强化学习的自主网络防御方法，有效应对高级持续性威胁	reinforcement learning
28	Breaking PEFT Limitations: Leveraging Weak-to-Strong Knowledge Transfer for Backdoor Attacks in LLMs	提出FAKD：利用弱到强知识迁移增强LLM中基于PEFT的后门攻击	distillation large language model
29	Just Say What You Want: Only-prompting Self-rewarding Online Preference Optimization	提出仅提示自奖励在线偏好优化算法，提升小模型在线RLHF性能。	reinforcement learning RLHF

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
30	A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction	提出异构混合专家模型TITAN，用于解决交通流量预测中变量中心学习不足的问题。	spatial relationship	✅

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
31	Subjective and Objective Quality-of-Experience Evaluation Study for Live Video Streaming	针对直播视频流提出主客观QoE评估方法，并构建TaoLive QoE数据集。	optical flow

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2024-09-26）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (21 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (8 篇)

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理