| 1 |
Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models |
提出 Marco-Bench-MIF,用于评估大语言模型的多语言指令遵循能力。 |
large language model instruction following |
✅ |
|
| 2 |
Improving Drug Identification in Overdose Death Surveillance using Large Language Models |
利用大型语言模型改进药物过量死亡监测中的药物识别 |
large language model |
|
|
| 3 |
Improving Contextual ASR via Multi-grained Fusion with Large Language Models |
提出一种多粒度融合的上下文ASR方法,利用大型语言模型提升关键词识别。 |
large language model |
|
|
| 4 |
A Comparative Approach to Assessing Linguistic Creativity of Large Language Models and Humans |
提出一种评估大型语言模型和人类语言创造力的通用测试方法 |
large language model |
|
|
| 5 |
Value-Based Large Language Model Agent Simulation for Mutual Evaluation of Trust and Interpersonal Closeness |
基于价值的大语言模型智能体模拟,用于互评估信任和人际亲密度 |
large language model |
|
|
| 6 |
Graph Representations for Reading Comprehension Analysis using Large Language Model and Eye-Tracking Biomarker |
利用大语言模型和眼动追踪生物标记,提出基于图表示的阅读理解分析方法。 |
large language model |
|
|
| 7 |
Tracing Facts or just Copies? A critical investigation of the Competitions of Mechanisms in Large Language Models |
探究大语言模型中机制竞争:事实追踪还是简单复制? |
large language model |
|
|
| 8 |
Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data |
提出一种高级RAG框架,用于处理结构化企业内部数据,提升问答性能。 |
large language model multimodal |
✅ |
|
| 9 |
DyG-RAG: Dynamic Graph Retrieval-Augmented Generation with Event-Centric Reasoning |
DyG-RAG:提出事件中心动态图检索增强生成框架,解决时序推理难题。 |
large language model chain-of-thought |
✅ |
|
| 10 |
Findings of MEGA: Maths Explanation with LLMs using the Socratic Method for Active Learning |
MEGA:结合苏格拉底教学法和LLM的数学解释方法,提升学生学习效果 |
large language model chain-of-thought |
|
|
| 11 |
PARAM-1 BharatGen 2.9B Model |
PARAM-1:一个以印度语言多样性为核心的29亿参数语言模型 |
large language model foundation model |
|
|
| 12 |
A Survey of Deep Learning for Geometry Problem Solving |
深度学习赋能几何问题求解:综述与前瞻 |
large language model multimodal |
✅ |
|
| 13 |
Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models |
提出基于CoT激活的线性探针,用于提前预测推理模型对齐状态 |
large language model |
|
|
| 14 |
Probing for Arithmetic Errors in Language Models |
利用语言模型内部激活探测算术错误并指导模型自纠错 |
chain-of-thought |
|
|
| 15 |
Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-ranker |
提出一种可扩展的视觉增强问答系统,利用可扩展的视觉嵌入检索和后期交互重排序器。 |
multimodal |
|
|
| 16 |
Beyond Single Models: Enhancing LLM Detection of Ambiguity in Requests through Debate |
提出多代理辩论框架以增强LLM对请求歧义的检测能力 |
large language model |
|
|
| 17 |
Chain-of-Descriptions: Improving Code LLMs for VHDL Code Generation and Summarization |
提出Chain-of-Descriptions方法,提升代码大模型在VHDL代码生成与摘要任务上的性能 |
large language model |
|
|
| 18 |
Text-ADBench: Text Anomaly Detection Benchmark based on LLMs Embedding |
Text-ADBench:基于LLM嵌入的文本异常检测基准,揭示嵌入质量是关键。 |
large language model |
✅ |
|
| 19 |
Identifying Algorithmic and Domain-Specific Bias in Parliamentary Debate Summarisation |
提出多阶段总结框架,评估LLM在议会辩论总结中的算法和领域偏差。 |
large language model |
|
|
| 20 |
Iterative Augmentation with Summarization Refinement (IASR) Evaluation for Unstructured Survey data Modeling and Analysis |
提出IASR框架,用于评估和优化LLM在非结构化调查数据建模中的增广效果。 |
large language model |
|
|
| 21 |
TopicImpact: Improving Customer Feedback Analysis with Opinion Units for Topic Modeling and Star-Rating Prediction |
TopicImpact:利用观点单元改进客户反馈分析,提升主题建模和星级预测 |
large language model |
|
|
| 22 |
Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation |
提出一种毒性感知的少样本提示框架,用于低资源Singlish翻译,提升毒性内容翻译质量。 |
large language model |
|
|
| 23 |
BlockBPE: Parallel BPE Tokenization |
提出BlockBPE以解决GPU批量推理中的BPE瓶颈问题 |
large language model |
|
|