| 1 |
The Thinking Therapist: Training Large Language Models to Deliver Acceptance and Commitment Therapy using Supervised Fine-Tuning and Odds Ratio Policy Optimization |
利用监督微调和优势比策略优化训练大语言模型进行接受与承诺疗法 |
large language model chain-of-thought |
|
|
| 2 |
Towards EnergyGPT: A Large Language Model Specialized for the Energy Sector |
提出EnergyGPT,一个针对能源领域的专业大型语言模型,通过微调LLaMA 3.1-8B实现。 |
large language model |
|
|
| 3 |
Toward Purpose-oriented Topic Model Evaluation enabled by Large Language Models |
提出基于大语言模型的面向目的性主题模型评估框架,解决传统指标语义理解不足的问题。 |
large language model |
✅ |
|
| 4 |
MedBench-IT: A Comprehensive Benchmark for Evaluating Large Language Models on Italian Medical Entrance Examinations |
MedBench-IT:首个意大利医学入学考试LLM综合评测基准 |
large language model |
|
|
| 5 |
EPT Benchmark: Evaluation of Persian Trustworthiness in Large Language Models |
提出EPT基准,评估大型语言模型在波斯语环境下的可信度 |
large language model |
✅ |
|
| 6 |
A Comparative Benchmark of Large Language Models for Labelling Wind Turbine Maintenance Logs |
提出风机维护日志标注的LLM基准测试框架,加速运维数据分析。 |
large language model |
|
|
| 7 |
LLM Analysis of 150+ years of German Parliamentary Debates on Migration Reveals Shift from Post-War Solidarity to Anti-Solidarity in the Last Decade |
利用LLM分析德国议会百年辩论,揭示从战后团结到反团结的转变 |
large language model |
|
|
| 8 |
Neurocognitive Modeling for Text Generation: Deep Learning Architecture for EEG Data |
提出基于RNN编码器和Gemma 2B的分类器-LLM架构,用于脑电信号文本生成。 |
large language model |
|
|
| 9 |
On the Same Wavelength? Evaluating Pragmatic Reasoning in Language Models across Broad Concepts |
提出基于Wavelength的评估框架,衡量语言模型在广泛概念上的语用推理能力。 |
chain-of-thought |
|
|
| 10 |
Proof-Carrying Numbers (PCN): A Protocol for Trustworthy Numeric Answers from LLMs via Claim Verification |
提出Proof-Carrying Numbers以解决LLMs数值可信性问题 |
large language model |
|
|
| 11 |
COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens |
COMPACT:面向通道和Token的通用Token优化模型剪枝,提升小模型性能。 |
large language model |
|
|
| 12 |
Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem |
利用饱和驱动的数据集生成方法,提升LLM在TPTP生态中的数学推理能力 |
large language model |
✅ |
|
| 13 |
MoGU V2: Toward a Higher Pareto Frontier Between Model Usability and Security |
MoGU V2:提升LLM可用性与安全性帕累托前沿的框架 |
large language model |
|
|
| 14 |
Anchoring Refusal Direction: Mitigating Safety Risks in Tuning via Projection Constraint |
提出ProCon方法,通过投影约束缓解指令微调中大语言模型的安全性风险。 |
large language model |
|
|