Research · Frontier Lab & Model News
Back to sweepResearch sweep · deep · 2015 – 2026
AI in Weather and Climate Prediction
AI in weather and climate prediction across the 2015 to June 2026 machine-learning era, with historical context from mid-twentieth-century numerical weather prediction and Lorenz's chaos theory: the shift from physics-based NWP and statistical post-processing (MOS) to data-driven models (GraphCast, GenCast, Pangu-Weather, FourCastNet, Aurora, NeuralGCM, ECMWF AIFS), how forecasters at ECMWF, NOAA, and the Met Office have operationalised them, measured accuracy versus the IFS, and the predictability limits imposed by chaos, the Lorenz attractor, and the butterfly effect.
- Claude Opus 4.8
- financial
- frontier
- academic
- vc
- blogs
Synthesised 2026-06-26
Narrative
The wave of deep-learning weather models that emerged from 2022 onwards represents the most consequential shift in meteorology since ensemble numerical weather prediction. The first significant contact between big-tech AI methods and operational meteorology came in 2022, when NVIDIA released FourCastNet (Pathak et al.), Huawei published Pangu-Weather (Bi et al.), and Google DeepMind circulated an early GraphCast preprint. FourCastNet used an Adaptive Fourier Neural Operator architecture on a Vision Transformer backbone, training on ERA5 at 0.25-degree resolution and achieving inference speeds five orders of magnitude faster than conventional NWP. Pangu-Weather followed with a 3D Earth-Specific Transformer, becoming the first model to outperform ECMWF's deterministic IFS on all tested variables at that resolution. Both were trained primarily on ERA5 reanalysis, which provided the decades of consistent global atmospheric state data that made supervised learning feasible at scale.
The landmark publication came in November 2023, when Remi Lam and the Google DeepMind team published GraphCast in Science. The model used a message-passing graph neural network on a multi-scale spherical mesh, training directly from ERA5 and fine-tuning on ECMWF HRES analyses, and demonstrated superior accuracy on 90 percent of 1,380 verification targets against HRES, completing a 10-day global forecast in under one minute. GenCast, described in a 2024 Nature paper, extended the architecture to probabilistic ensemble forecasting using a conditional diffusion transformer, outperforming ECMWF's 51-member ENS on 97.2 percent of verification targets and producing physically coherent ensemble members with appropriate spherical harmonic power spectra. Microsoft Research then published Aurora in Nature in May 2025, a 1.3-billion-parameter foundation model pre-trained on over one million hours of geophysical data that beat existing numerical and AI models across 91 percent of forecasting targets and extended to air quality, ocean wave, and hurricane prediction through fine-tuning. Google's NeuralGCM, published in Nature in August 2024, took a different path, embedding learned ML physics parameterisations inside a classical differentiable dynamical core, achieving accurate 1-to-15-day ensemble weather forecasts while also running stable multi-decade climate simulations, the only hybrid model to bridge weather and climate timescales credibly.
Operationalisation moved from research demonstration to production across 2024 and 2025. ECMWF launched AIFS 1.0 on 25 February 2025, running the GNN-based model alongside the physics-based IFS and reporting tropical cyclone track improvements of up to 20 percent and an approximately 1,000-fold reduction in energy use per forecast. A deterministic AIFS Single and a probabilistically trained AIFS ENS were both upgraded to version 2 by May 2026. NOAA declared three AI-driven models operational in December 2025: AIGFS (deterministic), AIGEFS (31-member ensemble), and HGEFS (a 62-member hybrid of AI and physics traces), with AIGFS completing a 16-day global forecast using only 0.3 percent of the computing resources of the traditional GFS. Early AIGEFS skill scores showed forecast lead-time extensions of 18 to 24 hours against the traditional Global Ensemble Forecast System, though tropical cyclone intensity forecasts remained a known weakness.
The fundamental question of whether ML models alter or merely approach the intrinsic predictability limit imposed by atmospheric chaos remains scientifically contested. A 2023 Geophysical Research Letters paper (Selz and Craig) showed that current AI models cannot simulate the butterfly effect and incorrectly suggest unlimited atmospheric predictability, because their error growth rate and structure differ fundamentally from those of high-resolution physics-based models. A 2025 arxiv study explored whether ML models could extend predictability beyond 30 days by using backpropagation to optimise initial conditions, exploiting the differentiability that physics models lack. The consensus is that ML models can approach the theoretical two-week deterministic limit more cheaply than NWP, but do not extend it; the Lorenz chaos horizon remains intact. Ensemble-based ML models such as GenCast address uncertainty quantification more credibly than deterministic counterparts, but challenges around physical conservation, out-of-distribution extreme events, and climate-timescale generalisation persist across all current architectures.
Sources
| ID | Title | Outlet | Date | Significance |
|---|---|---|---|---|
| t1 | Learning skillful medium-range global weather forecasting | Science | 2023-11 | Peer-reviewed publication of GraphCast by Google DeepMind, demonstrating superior accuracy on 90% of verification targets versus ECMWF HRES and establishing the graph neural network approach as a benchmark for the field. |
| t2 | GenCast: Diffusion-based ensemble forecasting for medium-range weather | Nature / arXiv | 2024-12 | Introduces GenCast, a diffusion-model ensemble forecaster that outperforms ECMWF ENS on 97.2% of verification targets, directly addressing probabilistic forecasting and ensemble spread without MSE-driven blurring. |
| t3 | A foundation model for the Earth system | Nature | 2025-05 | Peer-reviewed publication of Microsoft Aurora, a 1.3-billion-parameter model trained on over one million hours of geophysical data, covering weather, air quality, ocean waves, and cyclone tracks in a single foundation-model framework. |
| t4 | Neural general circulation models for weather and climate | Nature | 2024-08 | Nature paper on NeuralGCM, the Google hybrid model that embeds ML parameterisations inside a classical atmospheric dynamical core, bridging medium-range weather forecasting and multi-decade climate simulation. |
| t5 | AIFS Single 1.1.0: an update to ECMWF's machine-learned weather forecast model AIFS | Geoscientific Model Development (Copernicus) | 2026-05 | Peer-reviewed technical description of ECMWF's operational AIFS, including physical consistency constraints, updated training schedule, and verification against IFS on the full operational forecast portfolio. |
| t6 | ECMWF's AI forecasts become operational | ECMWF | 2025-02 | Official ECMWF announcement that AIFS moved to operational status on 25 February 2025, running alongside the physics-based IFS with tropical cyclone track gains of up to 20% and ~1,000-fold energy reduction. |
| t7 | AIFS Machine Learning data | ECMWF | 2026-05 | ECMWF data page documenting the operational timeline of AIFS Single and AIFS ENS versions, including the May 2026 v2 upgrade, providing primary source evidence for operational deployment status. |
| t8 | Another year of rapid development in machine-learning-based forecasting | ECMWF Annual Report | 2024-12 | Official ECMWF review documenting 2024 upgrades to AIFS: resolution increase from 1 degree to 0.25 degree, forecast frequency from 2 to 4 times daily, and lead time extension to 15 days. |
| t9 | AIFS: a new ECMWF forecasting system | ECMWF Newsletter | 2024-01 | Describes AIFS architecture (GNN encoder-decoder with sliding window transformer processor), training data, and context of ECMWF's decision to build an in-house ML model alongside proprietary physics-based IFS. |
| t10 | AIFS -- ECMWF's data-driven forecasting system | arXiv | 2024-06 | Primary technical paper by the ECMWF team on AIFS architecture, training methodology on ERA5 and operational IFS analyses, and initial verification results versus IFS. |
| t11 | An update to ECMWF's machine-learned weather forecast model AIFS | arXiv / Geoscientific Model Development | 2025-09 | Describes AIFS 1.1.0 operational release, the introduction of physical consistency bounding layers, expanded variable set, and the Anemoi open framework now adopted by multiple ECMWF member states. |
| t12 | GraphCast: AI model for faster and more accurate global weather forecasting | Google DeepMind | 2023-11 | Official Google DeepMind blog post accompanying the Science paper, providing public documentation of GraphCast's architecture, ERA5 training, Hurricane Lee forecasting case study, and ECMWF collaboration. |
| t13 | Accurate medium-range global weather forecasting with 3D neural networks | Nature | 2023-09 | Peer-reviewed publication of Pangu-Weather by Huawei, the first AI model demonstrated to outperform ECMWF IFS on all tested weather variables, establishing the 3D Earth-Specific Transformer as a competitive architecture. |
| t14 | FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators | arXiv | 2022-02 | NVIDIA paper introducing FourCastNet, the first deep-learning global weather model to reach 0.25-degree resolution and comparable short-range skill to IFS, using Adaptive Fourier Neural Operators on ERA5 training data. |
| t15 | FourCastNet: Accelerating Global High-Resolution Weather Forecasting using Adaptive Fourier Neural Operators | arXiv / PASC Conference | 2022-08 | Extended NVIDIA FourCastNet conference paper demonstrating five orders of magnitude inference speedup versus NWP and enabling large ensemble generation for extreme event probability estimation. |
| t16 | Introducing Aurora: The first large-scale foundation model of the atmosphere | Microsoft Research | 2024-06 | Microsoft Research announcement of Aurora as a 1.3-billion-parameter foundation model using a 3D Swin Transformer with Perceiver encoders, establishing the pretrain-then-fine-tune paradigm for atmospheric prediction. |
| t17 | NOAA deploys new generation of AI-driven global weather models | NOAA | 2025-12 | Official NOAA press release announcing operational deployment of AIGFS, AIGEFS, and HGEFS in December 2025, with AIGFS completing a 16-day forecast using 0.3% of GFS computing resources. |
| t18 | NOAA says its new AI-driven weather models improve forecast speed and accuracy | CBS News | 2025-12 | Reports NOAA estimates of 91-99% compute reduction for AI models and a 18-24 hour extension of effective forecast skill from AIGEFS versus the traditional GEFS, with NOAA confirmation that AI models augment rather than replace physics-based systems. |
| t19 | Can Artificial Intelligence-Based Weather Prediction Models Simulate the Butterfly Effect? | Geophysical Research Letters | 2023-10 | Peer-reviewed study by Selz and Craig demonstrating that current AI models cannot simulate the butterfly effect, with error growth differing structurally from physics-based models, challenging claims of unlimited atmospheric predictability. |
| t20 | Testing the Limit of Atmospheric Predictability with a Machine Learning Weather Model | arXiv | 2025-04 | Recent study using ML backpropagation to probe the Lorenz predictability horizon, examining whether data-driven models can approach or extend the classical two-week deterministic forecast limit. |
| t21 | Generative AI to quantify uncertainty in weather forecasting | Google Research | 2024-03 | Google Research explanation of how generative diffusion models address the Lorenz chaos problem by sampling ensemble members from a learned probability distribution rather than propagating a single deterministic state. |
| t22 | Evaluation of five global AI models for predicting weather in Eastern Asia and Western Pacific | npj Climate and Atmospheric Science | 2024-09 | Independent homogeneous evaluation of Pangu-Weather, FourCastNet v2, GraphCast, FuXi, and FengWu on identical ERA5 initial conditions, including typhoon track verification, providing third-party skill comparisons across architectures. |
| t23 | NeuralGCM harnesses AI to better simulate long-range global precipitation | Google Research | 2026-01 | Google follow-up on NeuralGCM progress in precipitation simulation, the most difficult variable for global-scale models, illustrating how hybrid physics-ML approaches address out-of-distribution and tail-event forecasting challenges. |
| t24 | Machine learning for numerical weather prediction | ECMWF | 2025 | ECMWF internal review of ML history at the centre, covering the 2021 roadmap, AIFS ensemble development (AIFS ENS 1 implemented July 2025), and CRPS-optimised training versus MSE-trained deterministic models. |
| t25 | Deep Learning and Foundation Models for Weather Prediction: A Survey | arXiv | 2025-01 | Comprehensive taxonomy of transformer, GNN, physics-AI, and domain-specific weather models, situating all major architectures (FourCastNet, GraphCast, AIFS, NeuralGCM, Aurora, ClimaX) within a unified framework for comparative analysis. |