Large Language Models for Cyber Security: A Systematic Literature Review

作者: Hanxiang Xu, Shenao Wang, Ningke Li, Kailong Wang, Yanjie Zhao, Kai Chen, Ting Yu, Yang Liu, Haoyu Wang

分类: cs.CR, cs.AI

发布日期: 2024-05-08 (更新: 2025-09-22)

备注: Accepted by ACM Transactions on Software Engineering and Methodology (TOSEM)

💡 一句话要点

综述性研究：大型语言模型在网络安全领域的应用与趋势分析

🎯 匹配领域: 支柱九：具身大模型 (Embodied Foundation Models)

关键词: 大型语言模型 网络安全 文献综述 漏洞检测 恶意软件分析 入侵检测 自主代理 提示工程

📋 核心要点

网络安全威胁日益复杂，传统方法难以有效应对，亟需智能化系统自动检测漏洞和分析恶意软件。
本研究系统性地回顾了大型语言模型在网络安全领域的应用，分析了不同模型架构的应用趋势。
论文关注LLM在网络安全领域的适配技术，如微调、提示工程和外部增强，并展望了自主代理的应用。

📝 摘要（中文）

大型语言模型（LLMs）的快速发展为在包括网络安全在内的各个应用领域利用人工智能开辟了新的机遇。随着网络威胁的数量和复杂性持续增长，越来越需要能够自动检测漏洞、分析恶意软件和响应攻击的智能系统。本综述全面回顾了LLMs在网络安全中的应用（LLM4Security）的相关文献。通过全面收集超过4万篇相关论文，并系统分析来自顶级安全和软件工程会议的185篇论文，旨在全面了解LLMs如何被用于解决网络安全领域的各种问题。通过分析，我们确定了几个关键发现。首先，我们观察到LLMs正被应用于越来越广泛的网络安全任务，包括漏洞检测、恶意软件分析和网络入侵检测。其次，我们分析了不同LLM架构（如encoder-only、encoder-decoder和decoder-only）在安全领域的应用趋势。第三，我们确定了越来越多用于将LLMs适配于网络安全的复杂技术，如高级微调、提示工程和外部增强策略。一个重要的新兴趋势是基于LLM的自主代理的使用，这代表了一种从单任务执行到编排复杂的多步骤安全工作流程的范式转变。

🔬 方法详解

问题定义：当前网络安全领域面临着日益增长和复杂的威胁，传统的安全方法在漏洞检测、恶意软件分析和入侵检测等方面存在局限性，需要更智能化的解决方案。现有方法难以有效处理大规模数据，且缺乏对新型攻击模式的适应能力。

核心思路：本研究的核心思路是系统性地调研和分析大型语言模型（LLMs）在网络安全领域的应用现状和发展趋势，从而为研究人员和从业者提供一个全面的视角。通过分析LLMs在不同安全任务中的表现，揭示其优势和局限性，并探讨未来的研究方向。

技术框架：该研究采用系统性文献综述的方法，首先通过关键词检索等方式收集了超过4万篇相关论文，然后筛选出185篇来自顶级安全和软件工程会议的论文进行深入分析。分析内容包括LLMs在不同安全任务中的应用（如漏洞检测、恶意软件分析、入侵检测），不同LLM架构（encoder-only, encoder-decoder, decoder-only）的应用趋势，以及LLMs的适配技术（微调、提示工程、外部增强）。

关键创新：本研究的关键创新在于对LLMs在网络安全领域的应用进行了全面的、系统的梳理和分析，并识别出了该领域的一些新兴趋势，例如基于LLM的自主代理。该研究为理解LLMs在网络安全中的潜力以及未来的研究方向提供了重要的参考。

关键设计：该研究侧重于对现有文献的分析和归纳，没有涉及具体的模型设计或参数设置。研究关注LLMs在不同安全任务中的应用，并分析了不同LLM架构的优劣。此外，研究还关注了LLMs的适配技术，例如如何通过微调、提示工程和外部增强来提高LLMs在网络安全任务中的性能。

🖼️ 关键图片

📊 实验亮点

该综述分析了超过4万篇相关论文，并深入研究了185篇顶级安全会议论文，揭示了LLMs在漏洞检测、恶意软件分析和网络入侵检测等任务中的应用趋势。研究还强调了LLM微调、提示工程和外部知识增强等关键技术，并预测了基于LLM的自主代理在网络安全领域的巨大潜力。

🎯 应用场景

该研究成果可应用于指导网络安全从业者和研究人员更好地利用大型语言模型来提升安全防护能力，例如开发更智能的漏洞检测工具、恶意软件分析系统和入侵检测系统。此外，基于LLM的自主代理有望实现自动化安全响应，减轻人工负担。

📄 摘要（原文）

The rapid advancement of Large Language Models (LLMs) has opened up new opportunities for leveraging artificial intelligence in a variety of application domains, including cybersecurity. As the volume and sophistication of cyber threats continue to grow, there is an increasing need for intelligent systems that can automatically detect vulnerabilities, analyze malware, and respond to attacks. In this survey, we conduct a comprehensive review of the literature on the application of LLMs in cybersecurity~(LLM4Security). By comprehensively collecting over 40K relevant papers and systematically analyzing 185 papers from top security and software engineering venues, we aim to provide a holistic view of how LLMs are being used to solve diverse problems across the cybersecurity domain. Through our analysis, we identify several key findings. First, we observe that LLMs are being applied to an expanding range of cybersecurity tasks, including vulnerability detection, malware analysis, and network intrusion detection. Second, we analyze application trends of different LLM architectures (such as encoder-only, encoder-decoder, and decoder-only) across security domains. Third, we identify increasingly sophisticated techniques for adapting LLMs to cybersecurity, such as advanced fine-tuning, prompt engineering, and external augmentation strategies. A significant emerging trend is the use of LLM-based autonomous agents, which represent a paradigm shift from single-task execution to orchestrating complex, multi-step security workflows.

Large Language Models for Cyber Security: A Systematic Literature Review

💡 一句话要点

📋 核心要点

📝 摘要（中文）

🔬 方法详解

🖼️ 关键图片

📊 实验亮点

🎯 应用场景

📄 摘要（原文）

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理