CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-Tuning

Authors

  • Peiyuan Liu Tsinghua Shenzhen International Graduate School
  • Hang Guo Tsinghua Shenzhen International Graduate School
  • Tao Dai Shenzhen University
  • Naiqi Li Tsinghua Shenzhen International Graduate School
  • Jigang Bao Tsinghua Shenzhen International Graduate School
  • Xudong Ren Tsinghua Shenzhen International Graduate School
  • Yong Jiang Tsinghua Shenzhen International Graduate School
  • Shu-Tao Xia Tsinghua Shenzhen International Graduate School Pengcheng Laboratory

DOI:

https://doi.org/10.1609/aaai.v39i18.34082

Abstract

Deep learning (e.g., Transformer) has been widely and successfully used in multivariate time series forecasting (MTSF). Unlike existing methods that focus on training models from a single modal of time series input, large language models (LLMs) based MTSF methods with cross-modal text and time series input have recently shown great superiority, especially with limited temporal data. However, current LLM-based MTSF methods usually focus on adapting and fine-tuning LLMs, while neglecting the distribution discrepancy between textual and temporal input tokens, thus leading to sub-optimal performance. To address this issue, we propose a novel Cross-Modal LLM Fine-Tuning (CALF) framework for MTSF by reducing the distribution discrepancy between textual and temporal data, which mainly consists of the temporal target branch with temporal input and the textual source branch with aligned textual input. To reduce the distribution discrepancy, we develop the cross-modal match module to first align cross-modal input distributions. Additionally, to minimize the modality distribution gap in both feature and output spaces, feature regularization loss is developed to align the intermediate features between the two branches for better weight updates, while output consistency loss is introduced to allow the output representations of both branches to correspond effectively. Thanks to the modality alignment, CALF establishes state-of-the-art performance for both long-term and short-term forecasting tasks with low computational complexity, and exhibits favorable few-shot and zero-shot abilities similar to that in LLMs.

Downloads

Published

2025-04-11

How to Cite

Liu, P., Guo, H., Dai, T., Li, N., Bao, J., Ren, X., … Xia, S.-T. (2025). CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-Tuning. Proceedings of the AAAI Conference on Artificial Intelligence, 39(18), 18915–18923. https://doi.org/10.1609/aaai.v39i18.34082

Issue

Section

AAAI Technical Track on Machine Learning IV