**Figure:** *A comparison between correlation maps between* $\Phi _{500hPa}(60^\circ N,5^\circ E)$ *and* $\Phi _{500hPa}(x,y)$ , based on EOFs (a) and conventional calculations (b). The confidence limit for panel b was estimated using the an MC-test on the point of maximum correlation. Note, that the confidence limits are generally higher for the conventional method, whereas only the lowest correlations in panel a are insignificant. [stats_uib_8_4.m]
$\begin{figure}\centerline{ \epsfxsize=5in \epsfysize=3in \epsffile{figs/stats_uib_8-4.eps} } \end{figure}$

By computing the EOFs and retaining a few of the first leading EOFs ( $n_{eofs} \ll n_t$ ), one can compress the size of the data from $n_x \times n_y \times n_t$ to $n_x \times n_y \times n_{eofs} + (n_t+1) \times n_{eofs}$ with a minimal loss of information (filters away much of the small-scale noise). If we have a $\Phi _{500hPa}$ record, such as in Fig. 8.4, stored as 100 time slices on a $50 \times 30$ grid and we retain the 10 leading EOFs, then the data size can be reduced from 150,000 numbers to just 16,010 numbers and still account for about 90% of the variance in $\Phi _{500hPa}$ .

In addition to reducing the data size, the EOFs can save computation time since there are only n_eofs independent numbers. For instance correlation analysis can be applied to the n_eofs PCs, weighted by the EOF patterns and their variance, instead of the time series from $n_x \times n_y$ points. The subscripts used below are the location vector $\vec{r}$ , the EOF index k and the time index t.

$\begin{displaymath}r({\bf X}_{\vec{r},t},y_t)= \frac{\sum_t \left[ {\bf X}'_{\ve... ...\sqrt{\sum_t ({\bf X'}_{\vec{r},t})^2 \times \sum_t (y'_t)^2}} \end{displaymath}$

(8.17)

$\begin{displaymath}r({\bf X}_{\vec{r},t},y_t)= \frac{\sum_t {\bf W}_k \left[ {\b... ...ec{r},k} {\bf W}_k {\bf V}_{k,t}^T)^2 \times \sum_t (y'_t)^2}} \end{displaymath}$

$\begin{displaymath}r({\bf X}_{\vec{r},t},y_t)= \frac{{\bf W}_k {\bf U}_{\vec{r},... ...2 {\bf W}_k^2\sum_t ({\bf V}_{k,t}^2) \times \sum_t (y'_t)^2}} \end{displaymath}$

(8.18)

The EOFs also allow better confidence limit estimates, as a confidence interval can be computed for each of the n_eofs PC. The geographically distributed confidence levels may be computed with a similar spatial weighting as the correlations themselves. For MC-test, using EOF products greatly reduces the computation time.

Maps of linear trends can be estimated in a simpler fashion: ${\bf X}_{\mbox{\tiny trend}} = {\bf U}_{\vec{r},k}{\bf W}_k \mbox{\bf trend}_{k,t}^T$ . The EOF products are also often used in regressional analysis and CCA.

Example of use: Spatial correlation maps based on EOF products