cs.AI（2024-08-20）

📊 共 28 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (18) 支柱二：RL算法与架构 (RL & Architecture) (7) 支柱一：机器人控制 (Robot Control) (2 🔗1) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (18 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches	针对边缘设备LLM微调与部署，综述高效微调与压缩技术	large language model foundation model
2	Out-of-Distribution Detection with Attention Head Masking for Multimodal Document Classification	提出注意力头掩码(AHM)方法，用于多模态文档分类中的OOD检测。	multimodal
3	From Glucose Patterns to Health Outcomes: A Generalizable Foundation Model for Continuous Glucose Monitor Data Analysis	GluFormer：基于连续血糖监测数据的通用基础模型，用于预测健康结果	foundation model
4	What can Large Language Models Capture about Code Functional Equivalence?	SeqCoBench：评估代码大语言模型对代码功能等价性理解能力的基准	large language model
5	LeCov: Multi-level Testing Criteria for Large Language Models	LeCov：面向大语言模型的多层次测试准则，提升模型可信度。	large language model
6	Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Carrier Articles	提出基于载体文章的黑盒越狱方法，提升大语言模型安全性	large language model
7	Reconciling Methodological Paradigms: Employing Large Language Models as Novice Qualitative Research Assistants in Talent Management Research	利用RAG-LLM作为新手研究助理，提升人才管理研究中定性数据分析效率。	large language model
8	Dr.Academy: A Benchmark for Evaluating Questioning Capability in Education for Large Language Models	提出Dr.Academy基准，评估大型语言模型在教育领域的问题生成能力	large language model
9	Large Language Model Driven Recommendation	利用大型语言模型驱动的推荐系统，实现个性化和交互式推荐。	large language model
10	Flexora: Flexible Low Rank Adaptation for Large Language Models	Flexora：一种灵活的低秩自适应方法，用于提升大语言模型在下游任务上的性能。	large language model
11	Fine-Tuning a Local LLaMA-3 Large Language Model for Automated Privacy-Preserving Physician Letter Generation in Radiation Oncology	通过本地微调LLaMA-3大语言模型，实现辐射肿瘤科的自动化隐私保护型医生信函生成。	large language model
12	Investigating Context Effects in Similarity Judgements in Large Language Models	研究大型语言模型在相似性判断中受上下文效应的影响	large language model
13	Probing the Safety Response Boundary of Large Language Models via Unsafe Decoding Path Generation	提出JVD方法，通过生成不安全解码路径探测并利用大语言模型的安全漏洞。	large language model
14	How Well Do Large Language Models Serve as End-to-End Secure Code Agents for Python?	评估大型语言模型作为端到端安全Python代码生成器的能力，并提出迭代修复工具。	large language model
15	Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning	Pluto and Charon：一种时间与内存高效的边缘AI协同框架，用于个人LLM微调	large language model
16	Automated Prompt Engineering for Cost-Effective Code Generation Using Evolutionary Algorithm	提出EPiC：利用进化算法进行低成本代码生成提示工程	large language model
17	Towards Efficient Formal Verification of Spiking Neural Network	提出基于时间编码的高效SNN形式化验证方法，提升对抗鲁棒性验证的可扩展性。	large language model
18	AI-Based IVR	提出基于AI的IVR系统，提升呼叫中心效率并适配哈萨克语	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (7 篇)

#	题目	一句话要点	标签	🔗	⭐
19	OCTCube-M: A 3D multimodal optical coherence tomography foundation model for retinal and systemic diseases with cross-cohort and cross-device validation	OCTCube-M：用于视网膜和全身疾病的3D多模态OCT基础模型	contrastive learning foundation model multimodal
20	QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning	提出QPO以解决查询依赖的提示优化问题	reinforcement learning offline reinforcement learning large language model
21	Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks	Hokoff：王者荣耀真实游戏数据集及其离线强化学习基准	reinforcement learning offline RL offline reinforcement learning
22	MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling	提出MambaDS模型，利用地形约束选择性状态空间建模实现近地面气象场降尺度	Mamba state space model
23	Minor SFT loss for LLM fine-tune to increase performance and reduce model deviation	提出MinorSFT损失函数，提升SFT微调效果并降低LLM模型偏移	PPO RLHF DPO
24	Strategist: Self-improvement of LLM Decision Making via Bi-Level Tree Search	STRATEGIST：基于双层树搜索的LLM决策自提升方法	reinforcement learning large language model
25	Hologram Reasoning for Solving Algebra Problems with Geometry Diagrams	提出基于全息推理的HGR方法，解决几何图代数问题	reinforcement learning deep reinforcement learning

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
26	Dynamic Analysis and Adaptive Discriminator for Fake News Detection	提出动态分析与自适应判别器（DAAD）用于解决虚假新闻检测问题。	manipulation large language model multimodal	✅
27	DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation	DisMix：解耦乐器混合音源，实现音高和音色的源级别操控	manipulation

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
28	Trajectory Imputation in Multi-Agent Sports with Derivative-Accumulating Self-Ensemble	提出MIDAS，利用导数累积自集成方法解决多智能体运动轨迹插补问题	spatiotemporal

⬅️ 返回 cs.AI 首页 · 🏠 返回主页