Evaluating Artificial Intelligence Through a Christian Understanding of Human Flourishing

作者: Nicholas Skytland, Lauren Parsons, Alicia Llewellyn, Steele Billings, Peter Larson, John Anderson, Sean Boisen, Steve Runge

分类: cs.AI

发布日期: 2026-04-07

💡 一句话要点

提出基于基督教人类繁荣理解的AI评估框架

🎯 匹配领域: 支柱九：具身大模型 (Embodied Foundation Models)

关键词: 人工智能 道德推理 基督教 人类繁荣 评估框架 价值对齐 大型语言模型

📋 核心要点

当前AI系统在道德和神学一致性方面存在显著不足，表现出程序世俗主义的倾向。
提出基督教单轮繁荣AI基准（FAI-C-ST），用于评估AI模型在基督教人类繁荣七个维度的表现。
实验结果显示，现有AI系统在信仰与灵性维度的表现下降31分，整体表现下降约17分，揭示了价值对齐的深层问题。

📝 摘要（中文）

人工智能（AI）对齐问题不仅是安全问题，更是形成问题。随着大型语言模型（LLMs）在道德思考和精神探究中的作用日益增强，它们不仅提供信息，还积极塑造人类的理解、决策和道德反思。为使这种形成性影响可见且可测，我们提出了基督教单轮繁荣AI基准（FAI-C-ST），旨在评估前沿模型在七个维度上对基督教人类繁荣的响应。通过将20个前沿模型与多元和基督教特定标准进行比较，我们发现当前AI系统并非世界观中立，而是倾向于程序世俗主义，缺乏维持神学一致性的基础，导致所有繁荣维度的系统性表现下降约17分，尤其在信仰与灵性维度下降31分。这些发现表明，价值对齐的表现差距并非技术限制，而是源于优先考虑广泛接受性和安全性的训练目标。

🔬 方法详解

问题定义：本论文旨在解决AI系统在道德和神学一致性方面的不足，现有方法未能有效评估AI对人类繁荣的影响，尤其是在基督教视角下的表现。

核心思路：论文提出基督教单轮繁荣AI基准（FAI-C-ST），通过七个维度评估AI模型的响应，强调AI的形成性影响而非仅仅是信息提供。

技术框架：FAI-C-ST框架包括对20个前沿模型的评估，比较其在基督教和多元标准下的表现，涵盖信仰、灵性等七个维度。

关键创新：最重要的创新在于将基督教的繁荣理解作为评估标准，揭示了AI系统在道德推理上的缺陷，与现有方法的世界观中立性形成鲜明对比。

关键设计：在评估过程中，设置了多元和基督教特定标准，采用了系统的评分机制，以量化AI在各个维度的表现，确保评估的全面性和深度。

📊 实验亮点

实验结果表明，当前AI系统在基督教人类繁荣的评估中表现不佳，整体得分下降约17分，尤其在信仰与灵性维度下降31分。这一发现强调了AI系统在价值对齐方面的深层问题，提示未来研究需关注道德和神学一致性。

🎯 应用场景

该研究的潜在应用领域包括道德AI开发、教育和宗教指导等。通过提供一个基于基督教人类繁荣的评估框架，能够帮助开发者更好地理解和调整AI系统的道德和伦理表现，促进AI在社会中的积极应用，推动人类的全面发展。

📄 摘要（原文）

Artificial intelligence (AI) alignment is fundamentally a formation problem, not only a safety problem. As Large Language Models (LLMs) increasingly mediate moral deliberation and spiritual inquiry, they do more than provide information; they function as instruments of digital catechesis, actively shaping and ordering human understanding, decision-making, and moral reflection. To make this formative influence visible and measurable, we introduce the Flourishing AI Benchmark: Christian Single-Turn (FAI-C-ST), a framework designed to evaluate Frontier Model responses against a Christian understanding of human flourishing across seven dimensions.By comparing 20 Frontier Models against both pluralistic and Christian-specific criteria, we show that current AI systems are not worldview-neutral. Instead, they default to a Procedural Secularism that lacks the grounding necessary to sustain theological coherence, resulting in a systematic performance decline of approximately 17 points across all dimensions of flourishing. Most critically, there is a 31-point decline in the Faith and Spirituality dimension. These findings suggest that the performance gap in values alignment is not a technical limitation, but arises from training objectives that prioritize broad acceptability and safety over deep, internally coherent moral or theological reasoning.

Evaluating Artificial Intelligence Through a Christian Understanding of Human Flourishing

💡 一句话要点

📋 核心要点

📝 摘要（中文）

🔬 方法详解

📊 实验亮点

🎯 应用场景

📄 摘要（原文）

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理