cs.AI（2024-12-20）

📊 共 20 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (15) 支柱二：RL算法与架构 (RL & Architecture) (5 🔗1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (15 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Aria-UI: Visual Grounding for GUI Instructions	Aria-UI：提出纯视觉GUI指令理解模型，无需HTML/AXTree输入，实现更强的任务自动化。	multimodal visual grounding
2	Social Science Is Necessary for Operationalizing Socially Responsible Foundation Models	强调社会科学在负责任的基础模型落地中的必要性，构建社会技术系统框架。	foundation model
3	Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation	利用LLM生成混淆汇编代码：MetamorphASM基准测试与系统分析	large language model
4	Less is More: Towards Green Code Large Language Models via Unified Structural Pruning	提出Flab-Pruner，通过统一结构剪枝实现绿色代码大语言模型	large language model
5	VirusT5: Harnessing Large Language Models to Predicting SARS-CoV-2 Evolution	VirusT5：利用大型语言模型预测SARS-CoV-2病毒进化	large language model
6	AI-in-the-loop: The future of biomedical visual analytics applications in the era of AI	探讨AI驱动下生物医学可视化分析的未来，强调“AI-in-the-loop”的人机协作模式。	large language model foundation model
7	AutoLife: Automatic Life Journaling with Smartphones and LLMs	AutoLife：利用智能手机和LLM自动生成生活日志	large language model multimodal
8	Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage	提出多模态Agent调优方法，构建VLM驱动的工具高效使用Agent	large language model
9	Formal Mathematical Reasoning: A New Frontier in AI	倡导形式化数学推理以推动AI4Math发展	large language model
10	The Evolution of LLM Adoption in Industry Data Curation Practices	探索LLM在工业界数据治理实践中的演进：从启发式到洞察驱动	large language model
11	MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design	MetaScientist：一种人机协同的自动化机械超材料设计框架	foundation model
12	Trust Calibration in IDEs: Paving the Way for Widespread Adoption of AI Refactoring	在IDE中实现信任校准，促进AI重构的广泛应用	large language model
13	Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration	提出Collaborative Gym框架，用于人机协作Agent的开发与评估	large language model
14	Level-Navi Agent: A Framework and benchmark for Chinese Web Search Agents	提出Level-Navi Agent框架与Web24基准，用于评估中文Web搜索Agent能力	large language model
15	JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs	提出JailPO，通过偏好优化实现针对对齐LLM的黑盒越狱攻击	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (5 篇)

#	题目	一句话要点	标签	🔗	⭐
16	APIRL: Deep Reinforcement Learning for REST API Fuzzing	提出APIRL以解决REST API模糊测试中的性能与精度问题	reinforcement learning deep reinforcement learning
17	Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback	提出Align-Anything框架，利用语言反馈对齐多模态模型与人类意图	reinforcement learning RLHF large language model	✅
18	AIR: Unifying Individual and Collective Exploration in Cooperative Multi-Agent Reinforcement Learning	提出AIR，统一个体与集体探索，提升合作多智能体强化学习效果	reinforcement learning
19	Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement Learning	提出基于自适应信息选择的隐式学习框架，解决通信受限多智能体强化学习问题	reinforcement learning
20	Autonomous Option Invention for Continual Hierarchical Reinforcement Learning and Planning	提出一种持续分层强化学习和规划的自主选项发明方法，提升样本效率。	reinforcement learning

⬅️ 返回 cs.AI 首页 · 🏠 返回主页