Time-LLM: Reprogram an LLM for Time Sequence Forecasting | by Marco Peixeiro

Machine Learning

Time-LLM: Reprogram an LLM for Time Sequence Forecasting | by Marco Peixeiro | Mar, 2024

hhhhm

2024年3月6日

Time-LLM: Reprogram an LLM for Time Sequence Forecasting | by Marco Peixeiro | Mar, 2024

[ad_1]

Uncover the structure of Time-LLM and apply it in a forecasting undertaking with Python

Photograph by Zdeněk Macháček on Unsplash

It’s not the primary time that researchers attempt to apply pure language processing (NLP) strategies to the sector of time collection.

For instance, the Transformer structure was a big milestone in NLP, however its efficiency in time collection forecasting remained common, till PatchTST was proposed.

As , massive language fashions (LLMs) are being actively developed and have demonstrated spectacular generalization and reasoning capabilities in NLP.

Thus, it’s value exploring the thought of repurposing an LLM for time collection forecasting, such that we will profit from the capabilities of these massive pre-trained fashions.

To that finish, Time-LLM was proposed. Within the authentic paper, the researchers suggest a framework to reprogram an present LLM to carry out time collection forecasting.

On this article, we discover the structure of Time-LLM and the way it can successfully enable an LLM to foretell time collection knowledge. Then, we implement the mannequin and apply it in a small forecasting undertaking.

For extra particulars, ensure to learn the authentic paper.

Let’s get began!

Time-LLM is to be thought-about extra as a framework than an precise mannequin with a particular structure.

The final construction of Time-LLM is proven beneath.

Basic construction of Time-LLM. Picture by M. Jin, S. Wang, L. Ma, Z. Chu, J. Zhang, X. Shi, P. Chen, Y. Liang, Y. Li, S. Pan, Q. Wen from Time-LLM: Time Sequence Forecasting by Reprogramming Giant Language Fashions

Your entire concept behind Time-LLM is to reprogram an embedding-visible language basis mannequin, like LLaMA or GPT-2.

Be aware that that is totally different from fine-tuning the LLM. As an alternative, we train the LLM to take an enter sequence of time steps and output forecasts over a sure horizon. Which means that the LLM itself stays unchanged.

At a excessive stage, Time-LLM begins by tokenizing the enter time collection sequence with a personalized patch embedding layer. These patches are then despatched by way of…

[ad_2]