| 1 |
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning |
BoostStep:通过改进单步推理提升大语言模型的数学能力 |
large language model chain-of-thought |
|
|
| 2 |
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases |
LangFair:用于评估大型语言模型偏见和公平性的Python软件包 |
large language model |
|
|
| 3 |
Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text |
提出基于图感知的少样本上下文学习方法,用于SQL查询到自然语言描述的生成。 |
large language model |
✅ |
|
| 4 |
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models |
PRMBench:一个用于过程级奖励模型细粒度和挑战性的评测基准 |
large language model |
|
|
| 5 |
Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification |
提出GORAG:一种基于图检索增强生成框架,用于动态少样本文本分类 |
large language model |
|
|
| 6 |
Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity |
利用机器学习和可解释AI检测教育内容中AI生成文本,提升学术诚信 |
large language model |
|
|
| 7 |
Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text |
利用可解释AI进行LLM文本溯源,区分人类撰写和多种LLM生成文本 |
large language model |
|
|
| 8 |
CLIX: Cross-Lingual Explanations of Idiomatic Expressions |
提出CLIX任务,利用跨语言解释成语,辅助语言学习者词汇扩展。 |
large language model |
|
|
| 9 |
VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity |
VicSim:通过情感和语言保真度增强受害者模拟,用于情景训练。 |
large language model |
|
|
| 10 |
Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling |
提出EmpRes,一种情感引导的常识感知回复生成方法,用于心理健康咨询。 |
foundation model |
|
|
| 11 |
ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning |
提出ADePT,通过自适应分解Prompt调整,提升参数高效微调性能。 |
large language model |
✅ |
|