Research · Frontier Lab & Model News

Back to sweep

Research sweep · deep · 2015 – 2026

AI in Weather and Climate Prediction

AI in weather and climate prediction across the 2015 to June 2026 machine-learning era, with historical context from mid-twentieth-century numerical weather prediction and Lorenz's chaos theory: the shift from physics-based NWP and statistical post-processing (MOS) to data-driven models (GraphCast, GenCast, Pangu-Weather, FourCastNet, Aurora, NeuralGCM, ECMWF AIFS), how forecasters at ECMWF, NOAA, and the Met Office have operationalised them, measured accuracy versus the IFS, and the predictability limits imposed by chaos, the Lorenz attractor, and the butterfly effect.

  • Claude Opus 4.8
  • financial
  • frontier
  • academic
  • vc
  • blogs

Synthesised 2026-06-26

Narrative

The wave of deep-learning weather models that emerged from 2022 onwards represents the most consequential shift in meteorology since ensemble numerical weather prediction. The first significant contact between big-tech AI methods and operational meteorology came in 2022, when NVIDIA released FourCastNet (Pathak et al.), Huawei published Pangu-Weather (Bi et al.), and Google DeepMind circulated an early GraphCast preprint. FourCastNet used an Adaptive Fourier Neural Operator architecture on a Vision Transformer backbone, training on ERA5 at 0.25-degree resolution and achieving inference speeds five orders of magnitude faster than conventional NWP. Pangu-Weather followed with a 3D Earth-Specific Transformer, becoming the first model to outperform ECMWF's deterministic IFS on all tested variables at that resolution. Both were trained primarily on ERA5 reanalysis, which provided the decades of consistent global atmospheric state data that made supervised learning feasible at scale.

The landmark publication came in November 2023, when Remi Lam and the Google DeepMind team published GraphCast in Science. The model used a message-passing graph neural network on a multi-scale spherical mesh, training directly from ERA5 and fine-tuning on ECMWF HRES analyses, and demonstrated superior accuracy on 90 percent of 1,380 verification targets against HRES, completing a 10-day global forecast in under one minute. GenCast, described in a 2024 Nature paper, extended the architecture to probabilistic ensemble forecasting using a conditional diffusion transformer, outperforming ECMWF's 51-member ENS on 97.2 percent of verification targets and producing physically coherent ensemble members with appropriate spherical harmonic power spectra. Microsoft Research then published Aurora in Nature in May 2025, a 1.3-billion-parameter foundation model pre-trained on over one million hours of geophysical data that beat existing numerical and AI models across 91 percent of forecasting targets and extended to air quality, ocean wave, and hurricane prediction through fine-tuning. Google's NeuralGCM, published in Nature in August 2024, took a different path, embedding learned ML physics parameterisations inside a classical differentiable dynamical core, achieving accurate 1-to-15-day ensemble weather forecasts while also running stable multi-decade climate simulations, the only hybrid model to bridge weather and climate timescales credibly.

Operationalisation moved from research demonstration to production across 2024 and 2025. ECMWF launched AIFS 1.0 on 25 February 2025, running the GNN-based model alongside the physics-based IFS and reporting tropical cyclone track improvements of up to 20 percent and an approximately 1,000-fold reduction in energy use per forecast. A deterministic AIFS Single and a probabilistically trained AIFS ENS were both upgraded to version 2 by May 2026. NOAA declared three AI-driven models operational in December 2025: AIGFS (deterministic), AIGEFS (31-member ensemble), and HGEFS (a 62-member hybrid of AI and physics traces), with AIGFS completing a 16-day global forecast using only 0.3 percent of the computing resources of the traditional GFS. Early AIGEFS skill scores showed forecast lead-time extensions of 18 to 24 hours against the traditional Global Ensemble Forecast System, though tropical cyclone intensity forecasts remained a known weakness.

The fundamental question of whether ML models alter or merely approach the intrinsic predictability limit imposed by atmospheric chaos remains scientifically contested. A 2023 Geophysical Research Letters paper (Selz and Craig) showed that current AI models cannot simulate the butterfly effect and incorrectly suggest unlimited atmospheric predictability, because their error growth rate and structure differ fundamentally from those of high-resolution physics-based models. A 2025 arxiv study explored whether ML models could extend predictability beyond 30 days by using backpropagation to optimise initial conditions, exploiting the differentiability that physics models lack. The consensus is that ML models can approach the theoretical two-week deterministic limit more cheaply than NWP, but do not extend it; the Lorenz chaos horizon remains intact. Ensemble-based ML models such as GenCast address uncertainty quantification more credibly than deterministic counterparts, but challenges around physical conservation, out-of-distribution extreme events, and climate-timescale generalisation persist across all current architectures.


Sources

ID Title Outlet Date Significance
t1 Learning skillful medium-range global weather forecasting Science 2023-11 Peer-reviewed publication of GraphCast by Google DeepMind, demonstrating superior accuracy on 90% of verification targets versus ECMWF HRES and establishing the graph neural network approach as a benchmark for the field.
t2 GenCast: Diffusion-based ensemble forecasting for medium-range weather Nature / arXiv 2024-12 Introduces GenCast, a diffusion-model ensemble forecaster that outperforms ECMWF ENS on 97.2% of verification targets, directly addressing probabilistic forecasting and ensemble spread without MSE-driven blurring.
t3 A foundation model for the Earth system Nature 2025-05 Peer-reviewed publication of Microsoft Aurora, a 1.3-billion-parameter model trained on over one million hours of geophysical data, covering weather, air quality, ocean waves, and cyclone tracks in a single foundation-model framework.
t4 Neural general circulation models for weather and climate Nature 2024-08 Nature paper on NeuralGCM, the Google hybrid model that embeds ML parameterisations inside a classical atmospheric dynamical core, bridging medium-range weather forecasting and multi-decade climate simulation.
t5 AIFS Single 1.1.0: an update to ECMWF's machine-learned weather forecast model AIFS Geoscientific Model Development (Copernicus) 2026-05 Peer-reviewed technical description of ECMWF's operational AIFS, including physical consistency constraints, updated training schedule, and verification against IFS on the full operational forecast portfolio.
t6 ECMWF's AI forecasts become operational ECMWF 2025-02 Official ECMWF announcement that AIFS moved to operational status on 25 February 2025, running alongside the physics-based IFS with tropical cyclone track gains of up to 20% and ~1,000-fold energy reduction.
t7 AIFS Machine Learning data ECMWF 2026-05 ECMWF data page documenting the operational timeline of AIFS Single and AIFS ENS versions, including the May 2026 v2 upgrade, providing primary source evidence for operational deployment status.
t8 Another year of rapid development in machine-learning-based forecasting ECMWF Annual Report 2024-12 Official ECMWF review documenting 2024 upgrades to AIFS: resolution increase from 1 degree to 0.25 degree, forecast frequency from 2 to 4 times daily, and lead time extension to 15 days.
t9 AIFS: a new ECMWF forecasting system ECMWF Newsletter 2024-01 Describes AIFS architecture (GNN encoder-decoder with sliding window transformer processor), training data, and context of ECMWF's decision to build an in-house ML model alongside proprietary physics-based IFS.
t10 AIFS -- ECMWF's data-driven forecasting system arXiv 2024-06 Primary technical paper by the ECMWF team on AIFS architecture, training methodology on ERA5 and operational IFS analyses, and initial verification results versus IFS.
t11 An update to ECMWF's machine-learned weather forecast model AIFS arXiv / Geoscientific Model Development 2025-09 Describes AIFS 1.1.0 operational release, the introduction of physical consistency bounding layers, expanded variable set, and the Anemoi open framework now adopted by multiple ECMWF member states.
t12 GraphCast: AI model for faster and more accurate global weather forecasting Google DeepMind 2023-11 Official Google DeepMind blog post accompanying the Science paper, providing public documentation of GraphCast's architecture, ERA5 training, Hurricane Lee forecasting case study, and ECMWF collaboration.
t13 Accurate medium-range global weather forecasting with 3D neural networks Nature 2023-09 Peer-reviewed publication of Pangu-Weather by Huawei, the first AI model demonstrated to outperform ECMWF IFS on all tested weather variables, establishing the 3D Earth-Specific Transformer as a competitive architecture.
t14 FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators arXiv 2022-02 NVIDIA paper introducing FourCastNet, the first deep-learning global weather model to reach 0.25-degree resolution and comparable short-range skill to IFS, using Adaptive Fourier Neural Operators on ERA5 training data.
t15 FourCastNet: Accelerating Global High-Resolution Weather Forecasting using Adaptive Fourier Neural Operators arXiv / PASC Conference 2022-08 Extended NVIDIA FourCastNet conference paper demonstrating five orders of magnitude inference speedup versus NWP and enabling large ensemble generation for extreme event probability estimation.
t16 Introducing Aurora: The first large-scale foundation model of the atmosphere Microsoft Research 2024-06 Microsoft Research announcement of Aurora as a 1.3-billion-parameter foundation model using a 3D Swin Transformer with Perceiver encoders, establishing the pretrain-then-fine-tune paradigm for atmospheric prediction.
t17 NOAA deploys new generation of AI-driven global weather models NOAA 2025-12 Official NOAA press release announcing operational deployment of AIGFS, AIGEFS, and HGEFS in December 2025, with AIGFS completing a 16-day forecast using 0.3% of GFS computing resources.
t18 NOAA says its new AI-driven weather models improve forecast speed and accuracy CBS News 2025-12 Reports NOAA estimates of 91-99% compute reduction for AI models and a 18-24 hour extension of effective forecast skill from AIGEFS versus the traditional GEFS, with NOAA confirmation that AI models augment rather than replace physics-based systems.
t19 Can Artificial Intelligence-Based Weather Prediction Models Simulate the Butterfly Effect? Geophysical Research Letters 2023-10 Peer-reviewed study by Selz and Craig demonstrating that current AI models cannot simulate the butterfly effect, with error growth differing structurally from physics-based models, challenging claims of unlimited atmospheric predictability.
t20 Testing the Limit of Atmospheric Predictability with a Machine Learning Weather Model arXiv 2025-04 Recent study using ML backpropagation to probe the Lorenz predictability horizon, examining whether data-driven models can approach or extend the classical two-week deterministic forecast limit.
t21 Generative AI to quantify uncertainty in weather forecasting Google Research 2024-03 Google Research explanation of how generative diffusion models address the Lorenz chaos problem by sampling ensemble members from a learned probability distribution rather than propagating a single deterministic state.
t22 Evaluation of five global AI models for predicting weather in Eastern Asia and Western Pacific npj Climate and Atmospheric Science 2024-09 Independent homogeneous evaluation of Pangu-Weather, FourCastNet v2, GraphCast, FuXi, and FengWu on identical ERA5 initial conditions, including typhoon track verification, providing third-party skill comparisons across architectures.
t23 NeuralGCM harnesses AI to better simulate long-range global precipitation Google Research 2026-01 Google follow-up on NeuralGCM progress in precipitation simulation, the most difficult variable for global-scale models, illustrating how hybrid physics-ML approaches address out-of-distribution and tail-event forecasting challenges.
t24 Machine learning for numerical weather prediction ECMWF 2025 ECMWF internal review of ML history at the centre, covering the 2021 roadmap, AIFS ensemble development (AIFS ENS 1 implemented July 2025), and CRPS-optimised training versus MSE-trained deterministic models.
t25 Deep Learning and Foundation Models for Weather Prediction: A Survey arXiv 2025-01 Comprehensive taxonomy of transformer, GNN, physics-AI, and domain-specific weather models, situating all major architectures (FourCastNet, GraphCast, AIFS, NeuralGCM, Aurora, ClimaX) within a unified framework for comparative analysis.

We use analytics cookies to understand site usage and improve the service. We do not use marketing cookies.