Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

AtmosMJ: Revisiting Gating Mechanism for AI Weather Forecasting Beyond the Year Scale

Created by
  • Haebom

Author

Minjong Cheon

Outline

This paper addresses the stability issue of long-term forecasts using large-scale weather models (LWMs). Existing models, such as SFNO and DLWP-HPX, achieve stable long-term forecasts by transforming input data into non-standard spatial domains, such as spherical harmonic functions or HEALPix meshes. This has been considered essential for physical consistency and long-term stability. In this paper, we challenge this assumption and investigate whether similar long-term forecast performance can be achieved on a standard latitude-longitude grid. To achieve this, we propose AtmosMJ, a deep convolutional neural network that directly processes ERA5 data. AtmosMJ utilizes a novel Gated Residual Fusion (GRF) mechanism to prevent error accumulation and ensures stability in long-term recursive simulations. Experimental results demonstrate that AtmosMJ generates stable and physically plausible forecasts for approximately 500 days, and is competitive with models such as Pangu-Weather and GraphCast in terms of 10-day forecast accuracy. It is also noteworthy that these results were achieved in a short training time of 5.7 days using a V100 GPU. In conclusion, this paper suggests that efficient architecture design, rather than non-standard data representations, is the key to ensuring the stability and computational efficiency of long-term weather forecasting.

Takeaways, Limitations

Takeaways:
It shows that long-term stable weather forecasting is possible even on a standard latitude-longitude grid.
Emphasizes that efficient architecture design is important for the stability and computational efficiency of long-term weather forecasting.
Achieve competitive prediction performance with low training cost compared to existing models.
The GRF mechanism of the AtmosMJ model presents a new method that can contribute to improving the stability of long-term prediction models.
Limitations:
AtmosMJ's long-term prediction performance is limited to 500 days. Further research is needed to achieve longer-term predictions.
Since it was only applied to ERA5 data, generalization performance verification on other datasets is required.
Further research is needed on the physical interpretation of the model.
Comparative analysis of forecast performance for various weather phenomena may be lacking.
👍