cs.AI(2025-12-08)
📊 共 4 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | AI-Powered Annotation Pipelines for Stabilizing Large Language Models: A Human-AI Synergy Approach | 提出AI驱动的标注流水线,稳定大语言模型并提升可靠性 | reinforcement learning RLHF large language model | ||
| 2 | Meta Hierarchical Reinforcement Learning for Scalable Resource Management in O-RAN | 提出Meta-HRL框架,用于O-RAN中可扩展的资源管理与网络切片联合优化。 | reinforcement learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | ReasonBENCH: Benchmarking the (In)Stability of LLM Reasoning | ReasonBENCH:评估LLM推理不稳定性的基准测试框架 | large language model chain-of-thought | ✅ | |
| 4 | ThinkTrap: Denial-of-Service Attacks against Black-box LLM Services via Infinite Thinking | ThinkTrap:针对黑盒LLM服务的无限思考拒绝服务攻击 | large language model |