Meteorological Model Basics

How Does a Meteorological Model Work ?
Accuracy and Resolution
Dr. John W. (Jack) Glendening, Meteorologist
Updated: Dec 31, 2007

This webpage gives a basic description of how meteorological models generate their predictions, with an emphasis on model errors and the influence of model resolution. It is intended to provide a framework for discussions of weather prediction models and their errors. Additional information on errors of particular importance to BLIPMAP predictions is given in the Model Notes webpage.

The Grid

Numerical meteorological models produce predictions by solving equations at grid points created by dividing the atmosphere into cells and forecasting the average of each predicted variable over that cell. Generally cells are regularly spaced in the horizontal but vary in the vertical, with vertical spacings becoming smaller near the surface since atmospheric conditions change more rapidly with height there. Similarly, vertical cell spacings in general are much smaller than horizontal spacings since conditions change much more rapidly in the vertical than in the horizontal. (See the Grid Cell Diagram)

The Equations of Motion

A meterological model produces forecasts by solving the "equations of motion" for a fluid, which are derived from fundamental physical laws: conservation of mass (both of air and moisture), conservation of momentum (Newton's laws), and conservation of thermal energy (thermodynamics). These equations predict the change in variables such as temperature or velocity resulting from the physical phenomena which affect them. These equations strongly interact with each other - for example, a change in thermal energy (affecting temperature) will change atmospheric density (affecting mass), in turn changing pressure differences (affecting velocity). These feedback interactions tend to counter a forced change, e.g. to oppose an atmospheric heating by introducing a cooling effect. If forcing were to remain constant then eventually the feedbacks would produce an "equilibrium" in which changes with time would become increasingly smaller - but the forcing of the atmosphere is never constant, as solar heating is constantly changing, so the atmosphere is in a state of constantly adjusting "quasi-equilibrium".

Solution Methodology

What is being solved is a set of "Partial Differential Equations", where the PDEs predict the time-change of a variable based upon conditions in each central cell and its horizontal/vertical neighbors. Those who have taken calculus will remember that differential equations only apply to "infinitesimally" small spacings so we are assuming that a "finite difference" over the finite grid spacing is a reasonable approximation to a differential. The equations are solved in a "marching-forward-in-time" fashion, starting from the assumed 3D initial conditions (which are usually based on available observations at that time), and predicting how the variables change at each time step at each grid point to give forecasts at a given time. PDEs are solved for the "prognostic variables" of temperature, humidity, the three components of velocity, condensed moisture (clouds), etc. The model also calculates "diagnostic" variables which depend solely upon conditions at the present time (so they depend upon the prognostic variables but do not have to be solved in a "time-marching" fashion). Since all the equations are inter-related, an error in any one will to some degree affect the others. These PDEs are not empirical equations, since they are fundamentally derived from well-established physical laws of conservation of energy and momentum and mass.

Model Initialization

Since the equations of motion predict changes there must be an initially prescribed 3D atmosphere for them to start from. This is obtained from atmospheric observations (baloon, satellite, and aircraft data) at a limited number of points which are then interpolated to complete the 3D grid. There will of course be errors in the interpolated initial state (it is not a solution of the equations of motion), so initially the equations predict relatively large changes as they evolve towards the equation of motion quasi-equilibrium. Forecasts during this "spinup" period are subject to large errors and should be taken with a grain of salt, since they reflect the initial "interpolated" guess. Spinup inaccuracies decrease with time - within the BL, the minimum-to-maximum spinup time is roughly 1-to-4 hours, the longer time being for larger grid spacings and larger initial errors (spinup times are longer above the BL).

Overall Prediction Accuracy

Forecast accuracy of the many parameters predicted by a meteorological model can be generally ordered, from most accurate to least accurate, as: (1) Winds, (2) Thermal parameters, (3) Moisture parameters, (4) Cloud parameters, (5) Rainfall.

Model Errors

The accuracy of the a model depends on many factors, which can be roughly grouped as:

Time Step Errors:
Since the equations represent the change over an "infinitesimal" time but the actual time step is finite, errors will occur with larger time steps creating larger errors. The actual time step used is, like much else in numerical modeling, a compromise between the ideal and the practical.
Initial Condition Errors:
The model requires "Initial Conditions" to start from, needing "knowledge" of the full three-dimensional atmosphere i.e. all prognostic variables at every grid point! This is obtained by combining available observations - notably, but not limited to, observed soundings - with previous model predictions to produce an initial "analysis". This analysis must be done carefully while respecting the dynamic constraints of the equations, i.e. is not a matter of simple interpolation of observations, since a large error introduced between observing points would cause large and unrealistic changes immediately after startup as the equations tried to adjust to an unrealistic initial imbalance. Correct initial conditions can be important to model predictions, but are less important at later times and closer to the surface. Initial conditions are most important for predicting the progress of disturbances above the BL, such as frontal movement.
Lateral Boundary Condition Errors:
A 20 km resolution model cannot cover the entire world, so such a "fine-mesh" model is "nested" inside a "coarse-mesh" model, having larger-spacing and larger domain, which provides "neighboring" cells along the lateral (outside) boundary of the fine-mesh model. The coarse-grid model typically covers the entire globe (though sometimes there will be multiple "nestings" to get to the global model). Errors in the lateral boundary conditions will, over time, reach further into the fine-mesh grid so lateral BC errors can be reduced by increasing the size of the fine-mesh domain - but of course that takes additional grid points and computational time and given finite constraints on each the choice of a lateral boundary is always a trade off. The model domain used by the RAP model can be viewed here. By comparison, the NAM grid is much larger, reaching from the North Pole to the Equator!
Surface Boundary Condition Errors:
The model must know the type of surface the atmosphere is interacting with since its roughness, vegetation type, soil type and water content, etc. all affect model predictions. This is the one true boundary the atmosphere has and creates the "Boundary Layer" (BL) so of course the predicted BL is especially affected by surface boundary errors! A major difficulty is that "average" conditions for all of surface quantities must be known for each of the surface model grid cells while in reality these quantities vary over much smaller scales so trying to, first, know what the actual surface consists of and, secondly, create a meaningful average for each grid cell are both subject to much error. Additionally, certain variables such as vegetation and soil moisture vary with time. Further, trying to predict soil moisture in detail would require calculations as intensive as that for the atmosphere itself, so greatly simplified equations are used instead.
Inadequate Spatial Resolution Errors:
The use of differential equations assumes/requires that a feature is "resolved" by the grid resolution. If the effect of a lake or mountain ridge or whatever upon the atmosphere is to be predicted, the model must adequately "know" about the existence of that lake or ridge or whatever. Realistic resolution requires a minimum of four grid points inside such a feature. Modelers keep trying, within the bounds of available computer power, to use finer and finer grid resolutions since with present grid spacings there is still much that is not being resolved, particularly if the forcing is controlled by topography. There are also many atmospheric features such as convergence-created upward motions which are smaller than can be resolved with present model grids and so are not well predicted. This error can be thought of as error that occurs because the PDE equations assume changes over an infinitesimally small distance whereas the model can only estimate changes over a finite distance - if in reality changes occurs over a smaller distance than the grid can effectively resolve, the finite-difference equations then cannnot accurately represent the actual differences existing in the atmosphere.
Model "Noise" Errors:
One result of the limited resolution resulting from use of a finite-spacing grid is that model "noise" develops, particularly at the smallest resolveable scales. Physically this is because energy in the atmosphere tends to be generated at relatively large scales and then break down into successively smaller eddies. The "differential" PDEs can and do simulate this behavior but only when resolution of atmospheric eddies is adequate, which is not true when the eddy size becomes comparable to the grid spacing - so in the model energy breaks down until it reaches the smallest model scale (i.e. the smallest resolvable eddy size) and is then trapped there. As a result, model predictions often vary at the smallest model scale in a saw-tooth manner, e.g. a forecast variable will be too large at one cell, then too small at the neighboring cell, and then again too large at the next cell, etc. This is typically reduced by "numerical filtering", but too much filtering also throws away part of the true signal so a compromise is required (as is typical in numerical modeling!). Often one can see evidence of this model noise not being fully controlled when a "bullseye" pattern appears in a BLIPMAP, with the value at one grid point being much larger/smaller than its surrounding neighbors. Decreasing model grid spacing helps to reduce model noise, but it will always exist to some degree.
Model Topography Errors:
Generally model conditions represent "average" conditions over the extent of its grid cell, but the effect of surface elevation on the model is somewhat different since the topography used by a model is typically smoothed to a coarser resolution than that of the model grid spacing and can differ significantly from the actual topography, particularly when the actual surface elevation changes abruptly. The reason for this degradation is the model noise problem discussed above: if the topography were to be fully resolved then much noise would be generated at the very smallest scale, aggravating the normal model noise build-up problem at that scale, so to avoid this the very smallest scales are filtered out of the topography. Note that this means that 8 model points are now required to resolve a ridge, so resolution of surface elevation influences requires a finer grid spacing than for many other atmospheric influences. Another terrain factor is that models often use an "envelope topography" to produce better velocity predictions - but that can result in worsened BL predictions, such as for BL top. The idea behind an "envelope topography" is that in reality velocities on either side of a mountain ridge, the Sierras being a good example, are separated by a relatively high ridge; but if one uses elevations averaged over 20 km (or larger!) cells then the ridge largely disappears, so flows at a level which are not interacting in reality will be interacting in the model. Therefore a weighting is employed which pushs model topography toward the higher elevations that actually exist over each grid cell rather than to a simple average. However, other parameters such as surface temperature and the BL driven by it do depend upon the average elevation over a grid cell, so use of an envelope topography makes those predictions less accurate! Sometimes a compromise solution is attempted - for example, the RAP model has two topographies, a "normal" (envelope) topography used for most calculations and a "minimum" topography used for surface temperature adjustments.
Additional discussion of differences between the smoothed model topography and the real topography, with two illustrations, can be found on the Grid Orientation webpage.
Parameterization Errors:
"Parameterization" refers to model terms which cannot be obtained from fundamental principles so instead are computed from approximated equations. For example, a model which has a 20 km resolution cannot resolve many small clouds, yet those clouds affect the atmosphere through effects such as release of heat aloft in condensation, reduction of solar radiation reaching the surface, etc. Since these effects are important but can't be predicted explicitly by model equations/resolution, they must be "parameterized". Parametrization is the "voodoo" part of numerical modeling since it tries to predict complex processes using necessarily over-simplified assumptions. Many cloud forecast terms fall into this category. [In one sense this might be described as an "inadequate resolution" error, but here the resolution increase required to obtain fundamnetally correct equations is so large that it cannot be achieved in the foreseeable future, if ever, so is a unique problem. An example familiar to soaring pilots is the formation of small puffy cumulus clouds - these start out is small wisps of visible vapor and even the cell resolution needed to resolve such wisps is well beyond anything presently possible but to predict this truly correctly one would have to resolve down to the cloud droplet scale! A non-cloud example which is theoretically more do-able but still impossible in practice is predicting the upward transfer of heat in the BL - in reality this occurs though eddies such as thermals and downdrafts which would need to be resolved, but that would take a grid spacing of less than 50 m so instead it is parameterized as a grid-average vertical transfer.]
End User Errors:
I cannot resist adding this after noting that some users misuse the predictions by incorrectly applying the model-produced forecasts, apparently due to a lack of appropriate knowledge.

Accuracy and Model Resolution

Many of the errors listed above are reduced when the spatial resolution of a model is increased, notably those errors resulting from surface averaging effects, from lack of resolved topography, and from the model's finite-difference equations lacking proper resolution of actual atmospheric differences. Therefore meteorologists are constantly striving to obtain more powerful computers so that they can use ever smaller grid spacings (in both the horizontal and vertical) to reduce these errors and provide better predictions. Of course, there will continue to be errors due to other factors, such as initial condition errors, which cannot be reduced by using a finer resolution - but it is best to at least eliminate the errors that one can control. BL predictions are particularly affected by grid resolution errors, and often much less affected by other errors, so better model resolution is especially valuable for soaring predictions. Some parameterization errors are reduced as resolution is improved, but other are not so parameterziation error will often continue to be a significant source of BL prediction error, particularly for cloud forecasts.
I should point out that the computer power needed for improved model resolution increases as the fourth power of the resolution increase, e.g. halving the grid spacing requires the computer power to be 16 times larger! So doubling a model's resolution is a big accomplishment! [This is because in addition to a factor of 8 increase corresponding to the increased number of horizontal and vertical grid points, since the finer resolution should occur in all directions, there is an additional factor of two because the time step must now be halved to accommodate the smaller grid size.]