Multimodal Stock Price Prediction: A Case Study of the Russian Securities Market

作者: Kasymkhan Khubiev, Mikhail Semenov

分类: q-fin.ST, cs.LG, q-fin.CP

发布日期: 2025-03-05

备注: NSCF-2024, PROGRAM SYSTEMS: THEORY AND APPLICATIONS

期刊: http://psta.psiras.ru:8081/ru/2025/1_83-130

DOI: 10.25209/2079-3316-2025-16-1-83-130

💡 一句话要点

提出一种融合新闻文本和时间序列的多模态方法，用于提升俄罗斯股市价格预测精度。

🎯 匹配领域: 支柱九：具身大模型 (Embodied Foundation Models)

关键词: 多模态预测 股票价格预测 时间序列分析 自然语言处理 金融新闻 LSTM RuBERT Vikhr-Qwen2.5-0.5b-Instruct

📋 核心要点

传统资产价格预测方法主要依赖数值数据，忽略了新闻流对价格形成的重要影响。
本文提出一种多模态方法，结合K线时间序列和新闻文本数据，提升金融资产价格预测的准确性。
实验结果表明，融合文本模态后，预测的平均绝对百分比误差（MAPE）降低了55%。

📝 摘要（中文）

本文研究了使用多模态方法预测金融资产价格的问题，该方法结合了K线时间序列和新闻文本数据。作者收集了一个独特的数据集，包含莫斯科交易所176只俄罗斯股票的时间序列数据和79555篇俄语金融新闻文章。文本数据处理使用了预训练模型RuBERT和Vikhr-Qwen2.5-0.5b-Instruct，时间序列和向量化文本数据则使用LSTM循环神经网络进行处理。实验比较了基于单一模态（仅时间序列）和双模态的模型，以及不同的文本向量表示聚合方法。使用准确率（价格变动方向预测）和平均绝对百分比误差（MAPE）评估预测质量。实验结果表明，加入文本模态可将MAPE值降低55%。该多模态数据集对金融领域语言模型的进一步适配具有价值。未来的研究方向包括优化文本模态参数，例如时间窗口、情感和新闻消息的时间顺序。

🔬 方法详解

问题定义：论文旨在解决金融资产价格预测问题，传统方法主要依赖数值数据，忽略了新闻文本信息。现有方法的痛点在于无法有效利用新闻等非结构化数据来提升预测精度，导致预测结果与实际市场情况存在偏差。

核心思路：论文的核心思路是将新闻文本数据作为补充信息，与传统的K线时间序列数据进行融合，构建多模态预测模型。通过结合两种模态的信息，模型能够更全面地理解市场动态，从而提高预测的准确性。

技术框架：整体框架包括数据收集、文本数据处理、时间序列数据处理和模型融合四个主要阶段。首先，收集俄罗斯股票的K线时间序列数据和相关金融新闻文本数据。然后，使用预训练语言模型（RuBERT和Vikhr-Qwen2.5-0.5b-Instruct）对新闻文本进行向量化表示。接着，使用LSTM循环神经网络分别处理时间序列数据和向量化的文本数据。最后，将两种模态的特征进行融合，输入到最终的预测模型中。

关键创新：论文的关键创新在于将预训练的俄语语言模型应用于金融新闻文本的向量化，并将其与时间序列数据进行有效融合。此外，论文还比较了不同的文本向量表示聚合方法，并分析了文本模态对预测结果的影响。

关键设计：在文本数据处理方面，论文使用了RuBERT和Vikhr-Qwen2.5-0.5b-Instruct两种预训练模型，并比较了它们在金融新闻文本向量化方面的性能。在模型融合方面，论文采用了LSTM循环神经网络，并探索了不同的融合策略。此外，论文还使用了准确率（Accuracy）和平均绝对百分比误差（MAPE）作为评估指标，以全面衡量模型的预测性能。

🖼️ 关键图片

📊 实验亮点

实验结果表明，与仅使用时间序列数据的模型相比，融合新闻文本数据的多模态模型显著提高了预测精度，MAPE值降低了55%。这表明新闻文本信息对于股票价格预测具有重要价值。此外，实验还比较了不同的文本向量表示聚合方法，为后续研究提供了参考。

🎯 应用场景

该研究成果可应用于金融投资决策支持系统，帮助投资者更准确地预测股票价格走势，降低投资风险。此外，该方法还可以扩展到其他金融资产的预测，例如外汇、期货等。未来，该研究可以促进金融领域语言模型的进一步发展，为金融市场的智能化提供技术支持。

📄 摘要（原文）

Classical asset price forecasting methods primarily rely on numerical data, such as price time series, trading volumes, limit order book data, and technical analysis indicators. However, the news flow plays a significant role in price formation, making the development of multimodal approaches that combine textual and numerical data for improved prediction accuracy highly relevant. This paper addresses the problem of forecasting financial asset prices using the multimodal approach that combines candlestick time series and textual news flow data. A unique dataset was collected for the study, which includes time series for 176 Russian stocks traded on the Moscow Exchange and 79,555 financial news articles in Russian. For processing textual data, pre-trained models RuBERT and Vikhr-Qwen2.5-0.5b-Instruct (a large language model) were used, while time series and vectorized text data were processed using an LSTM recurrent neural network. The experiments compared models based on a single modality (time series only) and two modalities, as well as various methods for aggregating text vector representations. Prediction quality was estimated using two key metrics: Accuracy (direction of price movement prediction: up or down) and Mean Absolute Percentage Error (MAPE), which measures the deviation of the predicted price from the true price. The experiments showed that incorporating textual modality reduced the MAPE value by 55%. The resulting multimodal dataset holds value for the further adaptation of language models in the financial sector. Future research directions include optimizing textual modality parameters, such as the time window, sentiment, and chronological order of news messages.

Multimodal Stock Price Prediction: A Case Study of the Russian Securities Market

💡 一句话要点

📋 核心要点

📝 摘要（中文）

🔬 方法详解

🖼️ 关键图片

📊 实验亮点

🎯 应用场景

📄 摘要（原文）

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理