cs.CL（2025-05-06）

📊 共 16 篇论文

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (14) 支柱二：RL算法与架构 (RL & Architecture) (2)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (14 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models	提出LS-Mixture SFT，解决SFT微调中LLM的过度推理问题，提升推理效率。	large language model chain-of-thought
2	Advancing Conversational Diagnostic AI with Multimodal Reasoning	AMIE：基于多模态推理提升对话式诊断AI的性能	large language model multimodal
3	SLOT: Structuring the Output of Large Language Models	SLOT：通过后处理转换LLM输出为结构化格式，提升下游任务可靠性	large language model
4	Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions	综述：大型语言模型驱动的科学假设生成与验证方法	large language model multimodal symbolic grounding
5	MedArabiQ: Benchmarking Large Language Models on Arabic Medical Tasks	MedArabiQ：构建阿拉伯语医疗任务基准，评估并提升LLM在医疗领域的应用。	large language model
6	TeleEval-OS: Performance evaluations of large language models for operations scheduling	TeleEval-OS：首个面向电信运营调度的LLM性能评估基准	large language model
7	Integration of Large Language Models and Traditional Deep Learning for Social Determinants of Health Prediction	结合大语言模型与传统深度学习，用于预测健康的社会决定因素	large language model
8	BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models	提出BadLingual，一种针对大型语言模型的任务无关的语言后门攻击。	large language model
9	Uncertainty-Aware Large Language Models for Explainable Disease Diagnosis	ConfiDx：面向可解释疾病诊断的、能感知不确定性的大语言模型	large language model
10	Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation	提出基于QLoRA微调LLaMA 3.2-3B和RAG的轻量级临床决策支持系统	large language model foundation model
11	Faster MoE LLM Inference for Extremely Large Models	针对超大MoE LLM，提出更快速的推理方法，提升效率并优化性能。	large language model
12	A Comparative Analysis of Ethical and Safety Gaps in LLMs using Relative Danger Coefficient	提出相对危险系数RDC，用于比较评估不同LLM的伦理和安全差距	large language model
13	Say It Another Way: Auditing LLMs with a User-Grounded Automated Paraphrasing Framework	AUGMENT：一种用户行为驱动的LLM自动复述框架，用于可靠的审计。	large language model
14	Ψ-Arena: Interactive Assessment and Optimization of LLM-based Psychological Counselors with Tripartite Feedback	提出Ψ-Arena，通过三方反馈交互式评估和优化基于LLM的心理咨询师。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
15	Recall with Reasoning: Chain-of-Thought Distillation for Mamba's Long-Context Memory and Extrapolation	提出基于思维链蒸馏的Recall with Reasoning方法，提升Mamba在长文本上的记忆和外推能力	Mamba distillation chain-of-thought
16	SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation	SepALM：利用音频语言模型进行错误纠正，提升语音分离的鲁棒性	distillation large language model chain-of-thought

⬅️ 返回 cs.CL 首页 · 🏠 返回主页