IFS upgrade provides more skilful ensemble forecasts

On 11 May 2021, ECMWF implemented a substantial upgrade of its Integrated Forecasting System (IFS). As with almost all upgrades, this involved contributions from many teams within the Centre. IFS Cycle 47r2 includes changes to the forecast model, but not to the data assimilation system. The upgrade is neutral for the medium-range deterministic high-resolution (HRES) forecast but brings benefits to the medium- and extended-range ensemble forecasts (ENS). Cycle 47r2 is the culmination of two strands of work:

A change from double precision to single precision in HRES and ENS forecasts
An increase in the number of model levels from 91 to 137 in ENS forecasts

Forecast model

Previous versions of the IFS have used ‘double precision’, where each number is stored using 64 bits of memory. This is often more accurate than required when we consider observational errors and model approximations. Single precision, in which each number is stored with 32 bits of memory, offers the prospect of freeing up memory and, importantly, increasing processing speeds. Single precision of the IFS started as a research project in collaboration with the University of Oxford and as part of the OpenIFS effort. Similar lines of research were pursued in the COSMO (Consortium for Small-scale Modelling) model. It then became a collaborative project across many ECMWF teams. With more people working on the project, forecast skill became increasingly neutral over time with respect to double precision, up to a point where it could be incorporated into our operational forecasts. This allows computational savings to be made which can be used to achieve skill improvements. Figure 1 shows the computational changes to the ensemble forecast. Faster core processing (green circles) of single-precision data permits a 50% increase in ENS model levels from 91 to 137. Even with this increase in levels, data transferred (red arrows) between the memory on each node (yellow boxes) is reduced because it is now in single precision.

%3Cstrong%3EFIGURE%201%3C/strong%3E%20The%20computational%20change%20from%20(a)%20IFS%20Cycle%2047r1%20with%20double%20precision%20(DP)%20and%2091%20levels%20in%20the%20ensemble%20forecast%20to%20(b)%20IFS%20Cycle%2047r2%20with%20single%20precision%20(SP)%20and%20137%20levels%20in%20the%20ensemble%20forecast%20allows%20faster%20core%20processing%20(green%20circles)%20and%20reduced%20data%20transfer%20(red%20arrows)%20between%20the%20memory%20on%20each%20node%20(yellow%20boxes).

Double precision is still used throughout the data assimilation process, and some calculations in the forecast do still require double precision. The most expensive of those, such as the calculation of the associated Legendre polynomials and the finite-element integral operators of the vertical discretisation, are only done once and are not repeated during time-stepping. Hence, there is minimal impact on computational efficiency. Further detailed experimentation helped us to identify a few other calculations in parts of dynamics and physics and the stochastic physics perturbations that need to be secured with double precision. However, those also represent a very small part of the total computational load. Note also that GRIB encoding is unchanged, so archived files remain the same size.

The change to 137 levels brings us one step closer to a more seamless ensemble data assimilation and forecasting system. The need for vertical interpolation when generating ensemble initial conditions is now greatly reduced as the ensemble of data assimilations (EDA) is already run with 137 levels. The consistency with the HRES vertical resolution should also aid the evaluation process of future cycles. Technical changes to the ensemble include the calculation of singular vector perturbations with 137 levels.

%3Cstrong%3EFIGURE%202%3C/strong%3E%20HRES%20scorecard%20of%20IFS%20Cycle%2047r2%20versus%20IFS%20Cycle%2047r1,%20verified%20by%20the%20respective%20analyses%20and%20observations%20at%2000%20and%2012%20UTC,%20based%20on%20619%20forecast%20runs%20in%20the%20period%20May%202020%20to%20April%202021.%20The%20HRES%20is%20run%20at%20TCo1279%20resolution,%20corresponding%20to%20a%20horizontal%20grid%20spacing%20of%20about%209%20km.

Impact on medium- and extended-range forecasts

The goal for the implementation of single precision was neutrality in HRES scores, together with major computational cost savings. Neutrality would be demonstrated in an HRES scorecard (Figure 2) with approximately a third of the boxes being grey, a third red and a third blue, and with little more than 5% of the red and blue boxes being statistically significant at the 5% significance level (indicated by triangles). As can be seen in Figure 2, this has largely been achieved. A possible exception is a degradation (typically less than 1%) in stratospheric extratropical geopotential height scores.

The neutrality for the HRES is illustrated in Figure 3 by track forecasts of Hurricane Laura. While agreement cannot be perfect for a chaotic system, the medium-range track differences between single and double precision are much smaller than the spread of the ensemble, which represents the impacts of initial and model uncertainty. More generally, the impact of single precision on HRES tropical cyclone track and intensity scores is neutral.

%3Cstrong%3EFIGURE%203%3C/strong%3E%20Eight-day%20tracks%20of%20Hurricane%20Laura%20from%2012%20UTC%20on%2022%20August%202020%20in%20high-resolution%20deterministic%20forecasts%20with%20double%20precision%20(red)%20and%20single%20precision%20(blue)%20along%20with%20those%20from%20the%20operational%20ensemble%20at%20the%20time%20(grey).

The increase in vertical resolution from 91 to 137 levels has been introduced to all ENS forecasts in the medium to the extended range. The ENS scorecard is shown in Figure 4. The change leads to statistically significant improvements to many ENS scores of about 0.5–2% throughout most of the free atmosphere. Stratospheric temperature scores are greatly improved, typically by 5–20%. This is, among other things, due to a weaker growth of temperature biases because the ENS can better resolve gravity waves in the vertical. Figure 5 shows this improvement at day 10, but it persists into the extended range. The mean cooling difference below 600 hPa (Figure 5 bottom panel) acts to decrease the warm bias around 850 hPa. It improves tropical medium-range scores at that level by over 6%. It does also slightly increase the tropical near-surface cool bias, and this is reflected in the 2‑metre temperature scores in Figure 4, which are degraded by up to 1% by day 14. Ten-metre wind scores are also slightly degraded by 0.1–0.3%.

The extra levels mean that sharper inversions can be resolved. For example, the ensemble vertical profile product now uses 34 model levels below 700 hPa instead of the previous 22. The Cycle 47r2 test profile in Figure 6, which uses the new mapping of model levels, shows a slightly sharper thermal inversion at around 850 hPa than the Cycle 47r1 profile. Users will need to ensure that they extract the correct model levels when creating their own forecast products.

%3Cstrong%3EFIGURE%204%3C/strong%3E%20ENS%20scorecard%20of%20IFS%20Cycle%2047r2%20versus%20IFS%20Cycle%2047r1%20for%20medium-range%20forecasts%20up%20to%20forecast%20day%2015,%20verified%20by%20the%20respective%20analyses%20and%20observations%20at%2000%20UTC%20based%20on%20408%20ENS%20forecast%20runs%20in%20the%20period%20November%202019%20to%20April%202021.%20The%20ENS%20is%20run%20at%20TCo639,%20corresponding%20to%20a%20horizontal%20grid%20spacing%20of%20about%2018%20km.

Tropical cyclones show reduced intensity errors (see Figure 7a). This is largely associated with reduced bias. There is a mean reduction of about 2 hPa in central pressure in the medium range, increased spread, and improved reliability as measured by the spread-error agreement. The cycle is neutral in terms of track errors (Figure 7b). Along with the increased tropical cyclone intensity, other tropical activity is increased: calculating anomalies from the operational extended-range re‑forecasts may be advisable.

A key source of sub-seasonal predictability is the Madden–Julian Oscillation (MJO). Out to the extended range, the amplitude of the MJO is better sustained: the amplitude loss by day 15 is now about 15% rather than the previous value of about 20% (Figure 8). There is also an increase in MJO spread, improved reliability and better scores. Changes come mostly from improvements in tropical zonal winds at 200 hPa.

%3Cstrong%3EFIGURE%205%3C/strong%3E%20Zonal%20means%20of%20mean%20temperature%20errors%20at%20a%20lead%20time%20of%2010%20days%20in%20the%20ensemble%20control%20forecast%20for%20(a)%20IFS%20Cycle%2047r1,%20(b)%20IFS%20Cycle%2047r2,%20and%20(c)%20the%20difference%20between%20IFS%20Cycles%2047r2%20and%2047r1.%20More%20saturated%20colours%20indicate%20statistical%20significance%20at%20the%205%25%20level%20using%20a%20t-test%20accounting%20for%20temporal%20correlation.%20Evaluated%20over%20all%20forecasts%20between%2025%20November%202019%20and%2028%20February%202020%20as%20well%20as%2010%20May%202020%20and%207%20November%202020.

Forecast outputs

More frequent tropical cyclone track updates are now available with the inclusion of forecasts from 6 and 18 UTC initial times, alongside those of the 0 and 12 UTC forecasts. More snowfall Extreme Forecast Index (EFI) and Shift of Tails (SOT) products are now available with the inclusion of 3-, 5-, 10- and 15‑day accumulation periods, in addition to the previous 1‑day accumulations. A selection of new specialist climatological model parameters includes some which describe the characteristics of topographic features smaller than the model grid box, some which are used within radiation calculations, and the ‘Logarithm of surface roughness length for heat’.

%3Cstrong%3EFIGURE%206%3C/strong%3E%20The%20vertical%20structure%20of%20temperature%20(red)%20and%20moisture%20(dewpoint,%20green)%20in%20tephigram%20format%20in%20(a)%20IFS%20Cycle%2047r1%20in%20a%2060-hour%20forecast%20from%2021%20January%202021%2000%20UTC%20at%2020.03%C2%B0S%2090%C2%B0W%20and%20(b)%20the%20same%20forecast%20in%20IFS%20Cycle%2047r2.%20Shaded%20bands%20denote%20the%20minimum,%2025th%20and%2075th%20percentiles%20and%20maximum%20for%20temperature%20and%20dewpoint%20ENS%20distributions%20at%20each%20level.%20The%20median%20dewpoint%20value%20is%20shown%20by%20the%20solid%20line%20within%20the%20dark%20green%20shading.%20The%20other%20solid%20line%20represents%20HRES%20and%20a%20thick%20dashed%20line%20represents%20the%20Control%20forecast.

%3Cstrong%3EFIGURE%207%3C/strong%3E%20Root-mean-square%20errors%20(RMSE)%20in%20the%20ensemble%20mean%20of%20(a)%20tropical%20cyclone%20(TC)%20intensities%20along%20with%20the%20standard%20deviation%20(spread)%20among%20ensemble%20members%20and%20(b)%20tropical%20cyclone%20locations%20along%20with%20the%20standard%20deviation%20(spread)%20among%20ensemble%20members.%20Results%20are%20based%20on%20all%20TC%20basins%20for%20the%20periods%2025%20November%202019%20to%2028%20February%202020%20and%2010%20May%202020%20to%2030%20November%202020.%20The%20numbers%20at%20the%20top%20of%20the%20panels%20indicate%20the%20number%20of%20TCs%20which%20could%20be%20evaluated%20at%20each%20lead%20time.%20The%20bars%20indicate%2095%25%20confidence%20intervals.

%3Cstrong%3EFIGURE%208%3C/strong%3E%20Score%20differences%20(Cycle%2047r2%20with%20137%20levels%20minus%20Cycle%2047r2%20with%2091%20levels)%20for%20the%20bivariate%20real-time%20multivariate%20Madden%E2%80%93Julian%20Oscillation%20(RMM)%20index%20based%20on%20re-forecasts%20initialised%20at%20the%20start%20of%20each%20month%20for%20the%20period%201989%E2%80%932016.%20The%20differences%20shown%20are%20for%20(i)%20the%20fair%20version%20of%20the%20continuous%20rank%20probability%20skill%20score%20(FCRPSS),%20(ii)%20the%20bivariate%20anomaly%20correlation,%20(iii)%20the%20bivariate%20root-mean-square%20error%20(RMSE),%20(iv)%20the%20bivariate%20spread%20of%20the%20ensemble%20with%20respect%20to%20ensemble%20mean,%20and%20(v)%20the%20amplitude%20of%20the%20RMM%20index.%20Bivariate%20scores%20are%20calculated%20and%20verified%20against%20the%20RMM%20index%20constructed%20from%20the%20ERA5%20reanalysis.%20Triangles%20indicate%20increased%20(pointing%20up)%20and%20decreased%20(pointing%20down)%20values,%20which%20are%20significant%20at%20the%201%25%20level%20when%20the%20shading%20is%20more%20saturated.

Summary

The change to single precision in forecast mode for the HRES and ENS systems has freed up computing resources to be used to enhance forecast skill. In IFS Cycle 47r2, the choice has been made to use these resources to make the model levels used in the ENS match those of the EDA and HRES systems. This represents an important step within ECMWF’s ten‐year Strategy 2021–2030, which highlights “work towards a seamless integration from the ensemble of data assimilations to the ensemble forecast system”. The fact that the ENS and HRES now have the same model levels should also facilitate future cycle development. The new cycle increases ENS forecast skill by typically 0.5–2% in the free atmosphere, but by 5–20% for stratospheric temperatures at 50 hPa and by 6% in the tropical troposphere. It also intensifies tropical cyclones, thus reducing intensity errors and improving reliability, and it helps to better sustain the amplitude of the Madden–Julian Oscillation into the extended range.

News

Meteorology

Editorial

Newsletter

IFS upgrade provides more skilful ensemble forecasts

Forecast model

Impact on medium- and extended-range forecasts

Forecast outputs

Summary

Further reading