| 1 |
LLM CHESS: Benchmarking Reasoning and Instruction-Following in LLMs through Chess |
提出LLM CHESS框架以评估LLMs的推理与指令遵循能力 |
large language model instruction following |
|
|
| 2 |
Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback |
提出Chain-of-Ground框架,通过迭代推理和反馈提升GUI定位精度 |
large language model multimodal |
|
|
| 3 |
OntoMetric: An Ontology-Driven LLM-Assisted Framework for Automated ESG Metric Knowledge Graph Generation |
OntoMetric:一种本体驱动的LLM辅助框架,用于自动生成ESG指标知识图谱 |
large language model |
|
|
| 4 |
H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs |
发现并分析LLM中与幻觉相关的神经元(H-Neurons),揭示其影响与起源 |
large language model |
|
|