cs.AI(2025-12-08)

📊 共 4 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (2) 支柱九:具身大模型 (Embodied Foundation Models) (2 🔗1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
1 AI-Powered Annotation Pipelines for Stabilizing Large Language Models: A Human-AI Synergy Approach 提出AI驱动的标注流水线,稳定大语言模型并提升可靠性 reinforcement learning RLHF large language model
2 Meta Hierarchical Reinforcement Learning for Scalable Resource Management in O-RAN 提出Meta-HRL框架,用于O-RAN中可扩展的资源管理与网络切片联合优化。 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
3 ReasonBENCH: Benchmarking the (In)Stability of LLM Reasoning ReasonBENCH:评估LLM推理不稳定性的基准测试框架 large language model chain-of-thought
4 ThinkTrap: Denial-of-Service Attacks against Black-box LLM Services via Infinite Thinking ThinkTrap:针对黑盒LLM服务的无限思考拒绝服务攻击 large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页