U.S. DEPARTMENT OF COMMERCE NATIONAL OCEANIC AND ATMOSPHERIC ADMINISTRATION ENVIRONMENTAL MODELING CENTER TECHNICAL NOTE A NEW TRANSFER FUNCTION FOR SSM/I BASED ON AN EXPANDED NEURAL NETWORK ARCHITECTURE by Vladimir M. Krasnopolsky General Sciences Corporation, Laurel, MD 20707 William H. Gemmill and Laurence C. Breaker Ocean Modeling Branch Environmental Modeling Center National Centers for Environmental Prediction Washington, D.C. 20233 NATIONAL CENTERS FOR ENVIRONMENTAL PREDICTION WASHINGTON, D.C. NOVEMBER 1996

______________________________________________________________________________

OMB Contribution No. 137

==========================================================================

LIST OF ABBREVIATIONS

BT: brightness temperature

C: degrees Celsius

CC: correlation coefficient

cal/val: calibration/validation

FXX: SSM/I instrument number XX

GHz: 10⁹ cycles/second

GSW: Goodberlet, Swift and Wilkerson (1989) - see References

H: horizontal polarization

K: degrees Kelvin

KBG: Krasnopolsky, Breaker and Gemmilll (1995) - see References

L: columnar liquid water

LIMA: European oceanic weather ship

MIKE: European oceanic weather ship

NDBC: National Data Buoy Center

NN: neural network

NRL: Naval Research Laboratory

OMBNNX: Ocean Modeling Branch Neural Network number X

OWS: oceanic weather ship

SBB: Stogryn, Butler and Bartolac (1994) - see References

SD: standard deviation

SSM/I: Special Sensor Microwave / Imager

SST: sea surface temperature

TAO: tropical atmosphere ocean

TOGA: tropical ocean global atmosphere

V: vertical polarization

V: columnar water vapor

=======================================================================

ABSTRACT

A new neural network (NN) SSM/I transfer function (OMBNN3) which retrieves wind speed (W), columnar water vapor (V), columnar liquid water (L), and SST, using only satellite data (five SSM/I brightness temperatures (BTs)) is introduced and compared with the current operational (GSW) algorithm and NN algorithms developed earlier (OMBNN1 and OMBNN2). The new NN algorithm systematically outperforms all algorithms considered for all SSM/I instruments (F8, F10, F11 and F13), under all weather conditions where retrievals are possible, and for all wind speeds. It also retrieves V and L with an accuracy close to that of cal/val (for V) and Weng and Grody (for L) algorithms, and produces low resolution SSTs with moderate accuracy. OMBNN3 demonstrates significantly better performance at higher wind speeds (and higher latitudes) than previous NN-based algorithms. It generates wind speeds up to 23 m/s for the available test data, and has a theoretical upper limit of about 32 m/s. The retrieval accuracy for OMBNN3 does not depend significantly on the satellite and/or instrument.

1. INTRODUCTION

This report contains a description of a new neural network (NN) SSM/I transfer function (OMBNN3) which retrieves wind speed (W), columnar water vapor (V), columnar liquid water (L), and SST, using only satellite data (five SSM/I brightness temperatures (BTs)). Also contained is a detailed comparison of the new algorithm with the current operational (GSW) algorithm (Goodberlet, et al., 1989) and NN algorithms developed earlier (Krasnopolsky et al., 1995a, 1995b). It is shown that our new NN algorithm outperforms all other algorithms in terms of wind speed retrievals. It also retrieves V and L with an accuracy close to that of cal/val (Alishouse, 1990) and WG (Weng and Grody, 1994) algorithms, and produces low resolution SSTs with moderate accuracy.

SSM/I wind retrieval algorithms encounter two problems: (1) atmospheric moisture and (2) high wind speeds. It was shown (Stogryn et al., 1994; Krasnopolsky et al., 1994, 1995a), that an adaptive nonlinear approach such as NNs can successfully handle the nonlinearity of the SSM/I transfer function caused by atmospheric moisture, extending the retrieval capability under cloudy atmospheric conditions. However, it is not yet clear to what extent retrievals can be extended under cloudy conditions. Although an upper limit for retrievals (0.5 mm in terms of columnar liquid water) has been suggested, it is clear that in particular situations this limit may be significantly lower (e.g., in rain). Because high moisture events are relatively rare, they are poorly represented in development data sets which makes this problem even more difficult. The new OMBNN3 algorithm which estimates two moisture criteria, V and L together with the wind speed, provides an additional control on the level of moisture and on the accuracy of wind speed retrievals.

Several issues contribute to the problems at high wind speed (see Krasnopolsky et al., 1996a): (1) saturation of BT at high wind speeds due to saturation of the area of the ocean surface covered by the persistent fraction of whitecap foam, (2) increasing noise in BT from the transient part of whitecap foam fraction at high wind speeds, and (3) very few buoy observations for higher wind speeds (W > 15 m/s). The linear GSW retrieval algorithm can, in principle, generate high wind speeds; however, validation of this algorithm using buoy observations shows that it has high scatter at high wind speeds and generates high wind speeds in some cases even when observed wind speeds are low. The first NN algorithms, SBB NN (Stogryn et al., 1994 ) and OMBNN1 (originally called SER NN in Krasnopolsky et al., 1994 ), demonstrated retrieval accuracies which were significantly better than that for GSW, however, they were not able to generate high wind speeds (higher than 16-18 m/s). An improved high wind speed NN algorithm was developed, OMBNN2 (Krasnopolsky et al., 1995b), which is capable of generating higher wind speeds (up to 20-21 m/s without a bias correction). It uses a bias correction to extend retrievals to higher wind speeds (up to 25 -26). However, this bias correction is instrument and/or satellite dependent. Here we introduce a new NN algorithm which generates wind speeds up to 23-24 m/s on available data sets without any bias correction (theoretical high wind speed limit for OMBNN3 is about 32 m/s) and whose accuracy does not depend significantly on the instrument and/or satellite.

The purpose of this report is to document the development and validation of the new OMBNN3 algorithm. This new development was possible due to (1) new matchup data, and (2) a new approach for empirical retrievals using NNs. Problems mentioned above together with some mathematical and physical ideas which led us to this new algorithm will be described in Krasnopolsky et al. (1996b). In Section 2 of this report, the architecture of the new OMBNN3 algorithm is described. Section 3 describes the data we use and preprocessing procedures. Section 4 describes the NN training process. In Section 5 we perform a detailed validation of the OMBNN3 algorithm, using different criteria and matchups for all SSM/I instruments. Section 6 summarizes our results, and the FORTRAN program which implements OMBNN3 algorithm, is available upon request ⁽¹⁾.

2. NEW ALGORITHM ARCHITECTURE

The first-generation wind speed retrieval algorithms, including the GSW algorithm (Goodberlet, et al., 1989), SBB algorithm (Stogryn et al., 1994), OMBNN1 (Krasnopolsky et al., 1994, 1995a) and OMBNN2 (Krasnopolsky et al., 1995b) followed a standard empirical approach. They retrieved only one value (e.g., wind speed) regressing it on the satellite measurements (e.g., BTs), as

W = f (BT) (1)

where BT is the brightness temperature vector and f is a regression function (NN in our particular case). Representation (1) assumes (usually by default) that the data set which is used is complete (representative) enough to eliminate dependencies of W on other physical parameters (liquid water, water vapor, SST, etc.) through averaging. This assumption and, hence, representation (1), is obviously not correct at W > 10 - 15 m/s where the buoy/SSMI matchup data are sparse, and dependencies of the wind speed on V, L, and SST are not removed through averaging. These dependencies create additional noise with respect to wind speed at higher wind speeds. In this case, (1) gives a biased estimate for the wind speed with a large scatter (large bias and standard deviation).

NNs allow us to solve this problem without including V, L and SST as additional arguments in (1), which is the standard solution, that is not suitable for an operational algorithm. The new NN algorithm (OMBNN3) can be symbolically written as,

Y = g (BT) (2)

where the output vector is Y = {W, V, L, SST}, the input vector is BT = {T19V, T19H, T22V, T37V, T37H} and g is a NN. The NN, g, which implements (2) has 5 inputs and 4 outputs, it also has one hidden layer with 12 nodes. The architecture of OMBNN3 together with those for OMBNN1 and OMBNN2 are shown in Fig. 1. Including additional outputs in the NN architecture improves the training process, decreases the number of local minima in the error function, and stabilizes and accelerates convergence in the training process.

Fig. 1 Evolution of the NN architecture from OMBNN1 to OMBNN3

The NN was trained, using the weighting scheme for high wind speed data described in Krasnopolsky et al., (1995b), where the weighting function was inversely proportional to the square root of the wind speed distribution.

3. THE MATCHUP DATA

For algorithm development and validation several databases were used:

a. A raw SSMI/buoy matchup database, created by NRL was provided to us by G. Poe (NRL). This database contains 3,144 F8/buoy matchups for the period 9/91 to 6/93, 12,013 F10/buoy matchups for the same period, and 10,195 F11/buoy matchups for the period 12/91 to 6/93. NDBC buoys and TOGA-TAO buoys have been used in creating these matchups. We carefully quality controlled the matchups extracted from the NRL database. More than 30 different criteria have been applied to both the buoy and the SSM/I data for quality control and to remove missing and noisy data. Daily locations for TOGA-TAO buoys have been corrected using information from the TAO Web Home page. As a result 2,994 F8/buoy matchups, 11,705 F10/buoy matchups, and 9,948 F11/buoy matchups were extracted. As a second step, we selected matchups where the satellite data are collocated with the buoy data in space for R_s 15 km and in time for R_t 15 min. Eventually, 1765 matchups for F8, 7495 matchups for F10, and 6129 matchups for F11, were selected.

b. The F11 matchups collected by high latitude ocean weather ships (OWS) LIMA (430 matchups) and MIKE (639 matchups) were provided to us by D. Kilham of Bristol University. After quality control and applying a 15 km x 15 min collocation filter, 547 (243 MIKE + 304 LIMA) matchups were selected.

c. For F13, we have created a new matchup database containing 1036 F13/buoy wind speed matchups with a spatial collocation uncertainty R_s 25 km, and a temporal collocation uncertainty R_t 0.5 hour. Because the buoy data in this case have been preprocessed with a roundoff error of 0.5 m/sec, an additional random error of approximately 0.3 m/sec rms has been introduced. Because we did not have access to telemetry in this case, only limited filtering was applied to those BTs. As a result, these matchups have higher noise than the matchups for F8, F10, and F11 which were extracted from the NRL database. The F13 matchup data also cover a limited time interval from 11/95 to 4/96. Thus, we only use F13 for a relative comparison of the different algorithms.

For all data, wind speeds have been adjusted to a height of 20 m. Some characteristics of the data are shown in Table 1. Clear and cloudy conditions are defined below and correspond to the retrieval flags given by Stogryn et al. (1994):

T37V - T 37H > 50 K for clear case

and

T37V - T 37H 50 K (3)

T19V < T37V

T19H 185 K

T37H 210 K for cloudy case

Table 1. Statistics for data used for algorithm development and validation (SD_w denotes standard deviation).

Number of matchups Mean W
m/s
_SDw
m/s
Max W
m/s
Max W (Clear+Cloudy)
m/s
Max W (Clear)
m/s

Total Clear cond. Cloudy cond.

F08/Buoy 1765 1437 200 7.4 3.3 26.0 21.5 18.6

F10/Buoy 7495 5953 926 7.3 3.2 25.0 21.6 20.5

F11/Buoy 6633 5274 855 7.5 3.5 26.4 25.0 20.1

F13/Buoy 1071 864 172 10.3 4.7 27.5 27.5 24.7

F11/LIMA 304 253 51 10.4 4.9 26.4 26.4 23.9

F11/MIKE 243 215 27 9.8 4.9 24.2 24.2 21.1

As mentioned above, since F13 data are not extensive, contain additional noise, and cover only several months, we have not used them for algorithm development but only for comparisons with the different algorithms. As seen in Table 1, most of the high wind speed coincide with higher levels of moisture and cloudiness. Matchup data for F8 and F10 do not have buoy wind speeds higher than 21.6 m/s even under clear + cloudy conditions. Several high wind speed events in these data contain levels of liquid water which are so high that no retrievals are possible. Only the F11 data contain high wind speed events under clear + cloudy conditions (up to 25 m/s). Thus, the F11 data provide the only choice for algorithm development. To further improve the coverage for high wind speeds, F11/buoy data have been supplemented with F11/LIMA and F11/MIKE data. These data have wind speeds up to 26.4 m/s and correspond to high latitudes (LIMA was located at 57N and MIKE at 65N). The resulting blended F11 matchup database has subsequently been separated into two statistically equivalent sets: one for training and one for testing.

4. TRAINING

As shown by Stogryn et al. (1994) and Krasnopolsky et al. (1994, 1995a), NN algorithms can successfully retrieve wind speeds under clear + cloudy conditions. Therefore, for training we used all available matchups which correspond to clear + cloudy conditions, according to Stogryn's retrieval flag (3). Statistics for clear conditions were then calculated by applying the trained NN to the clear portion of the matchup data. Because higher wind speed events were given extra weight, noise in this portion of the data could reduce the effectiveness of the training process. To minimize this possibility, we additionally removed a number of outliers at higher wind speeds, but no outliers were removed for the test data, or for any other data which were used for further validation.

Five SSM/I BTs {T19V, T19H, T22V, T37V, T37H} are used as the NN inputs. The output vector is composed of wind speed and SST taken from the buoy portion of the matchup, columnar water vapor (V) produced by the cal/val algorithm derived by Alishouse et al. (1990), and columnar liquid water (L) produced by the WG algorithm from SSM/I BTs. Standard backpropagation was used to train the NN. After training, the algorithm was applied to the F11 test data.

Table 2 shows wind speed statistics for clear conditions and Table 3 for clear + cloudy conditions for both training and test sets. Under both clear and clear + cloudy conditions, OMBNN3 algorithm gives a small bias, an acceptable standard deviation (SD), and high correlation (CC). It also accurately reproduces not only the mean buoy wind speed but also its SD, SD_w. As for the maximum wind speed, OMBNN3 underestimates high wind speeds by about 10 - 15%, which we consider acceptable for wind speeds > 22 m/s, where the noise level is highest (see discussions in the introduction here and in Krasnopolsky et al. (1996a)). The differences between the statistics for the training and the test data are mainly due to outliers which have not been removed from the test set. The difference between clear and clear + cloudy case is small but significant. The cloudy case and statistics for other NN outputs (V, L, and SST) are discussed in following sections.

Table 2. Training and test statistics for OMBNN3 algorithm under clear conditions. Columns 3 - 5 show statistics for the wind speeds per se (SD_w denotes standard deviation), and columns 6 - 8 for the difference between buoy and algorithm-generated wind speeds. SD denotes standard deviation, and CC denotes correlation coefficient.

Data set Max W Mean W _SDw Bias SD CC

Training Buoy 22.8 7.13 3.27 N/A N/A N/A

OMBNN3 19.5 7.14 2.97 -0.01 1.36 0.91

Test Buoy 23.9 7.14 3.31 N/A N/A N/A

OMBNN3 20.2 7.21 3.08 -0.08 1.49 0.89

Table 3. Training and test statistics for OMBNN3 algorithm under clear+cloudy conditions. Columns 3 - 5 show statistics for the wind speeds per se (SD_w denotes standard deviation), and columns 6 - 8 for the difference between buoy and algorithm-generated wind speeds. SD denotes standard deviation, and CC denotes correlation coefficient.

Data set Max W Mean W _SDw Bias SD CC

Training Buoy 26.4 7.48 3.49 N/A N/A N/A

OMBNN3 22.8 7.49 3.20 -0.004 1.41 0.91

Test Buoy 26.4 7.44 3.31 N/A N/A N/A

OMBNN3 22.8 7.66 3.34 -0.21 1.77 0.87

5. VALIDATION AND COMPARISONS

Previous empirical wind speed algorithms have, in most cases, been developed and validated, using the F8 matchup database created by GSW. Here we use a newly-created database described in Section 3 for validation for all SSM/I instruments (F8, F10, F11, and F13) and for comparison of the various wind speed algorithms. For comparison with the new OMBNN3 algorithm we have used the current operational algorithm (GSW), our original NN algorithm OMBNN1 (or SER NN), and our OMBNN2 improved for high wind speeds. Because the bias correction for OMBNN2 is instrument and/or satellite dependent (Krasnopolsky et al., 1996a), we do not include it here but use only the NN part of OMBNN2 algorithm.

5.1 Wind Speed

In this section we present statistics for the primary output of the OMBNN3 algorithm - wind speed. By including additional outputs in OMBNN3, the performance of OMBNN3 is significantly improved, especially at higher wind speeds. Statistics for the other outputs are presented in following sections.

5.1.1 Total (for all wind speeds) statistics.

Table 4 shows total statistics for clear case, Table 5 for clear + cloudy conditions and Table 6 for cloudy conditions. Tables 4 and 5 contain statistics for four satellites and four selected algorithms. For cloudy case, F8 and F13 cloudy subsets are small and for these satellites strongly overlap with the high wind speed subsets (Table 7), thus only statistics for F10 and F11 are shown for cloudy conditions in Table 6. These tables also contain buoy wind speed statistics for each data set: maximum wind speed, mean wind speed, and the SD, SD_w.

We now summarize the information contained in Tables 4 - 6:

For all weather conditions considered, and for all SSM/I instruments, the NN-based algorithms outperform the GSW algorithm based on the standard deviation (SD) as a criterion. Based on the biases, the new OMBNN3 also outperforms the GSW algorithm for most cases; otherwise it produces similar biases. Wind speeds generated by OMBNN3 have mean values and SDs which are close to those of the observed buoy wind speeds; therefore, the OMBNN3-generated wind speed distributions are properly centered and have proper width (also see Fig. 2).

Fig. 2. Wind Speed Distributions: observed buoy (solid line), GSW (dot-dashed line), OMBNN2 (dashed line), and OMBNN3 (dotted line) for F08, F10, F11 and F13 SSM/I instruments.

Table 4. Total statistics for GSW, OMBNN1, OMBNN2 and OMBNN3 algorithms for CLEAR conditions and for four different SSM/I instruments. Columns 3 - 5 show statistics for the wind speeds per se (SD_w denotes standard deviation), and columns 6 - 8
for the difference between buoy and algorithm-generated wind speeds. SD denotes standard deviation, and CC denotes correlation coefficient.

Satellite Max W Mean W _SDw Bias SD CC

F08
1437
m-ups
Buoy 19.2 7.06 3.01 N/A N/A N/A

GSW 21.4 7.08 3.18 -0.02 1.77 0.84

OMBNN1 15.1 6.13 2.38 0.93 1.49 0.87

OMBNN2 16.8 6.56 2.68 0.50 1.48 0.88

OMBNN3 20.1 7.07 3.01 -0.01 1.43 0.88

F10
5953
m-ups
Buoy 20.5 6.98 2.95 N/A N/A N/A

GSW 20.8 7.20 3.22 -0.22 1.86 0.82

OMBNN1 14.7 6.23 2.46 0.75 1.63 0.84

OMBNN2 17.1 6.13 2.61 0.84 1.60 0.84

OMBNN3 20.2 7.21 2.97 -0.23 1.68 0.84

F11
5274
m-ups
Buoy+OWS 23.9 7.13 3.29 N/A N/A N/A

GSW 20.9 7.34 3.36 -0.21 1.72 0.87

OMBNN1 16.9 6.47 2.55 0.66 1.55 0.89

OMBNN2 17.9 6.32 2.72 0.81 1.56 0.88

OMBNN3 20.2 7.17 3.03 -0.04 1.43 0.90

F13
864
m-ups
Buoy 24.0 9.46 4.16 N/A N/A N/A

GSW 23.6 10.49 3.84 -1.02 2.13 0.86

OMBNN1 18.5 9.01 3.39 0.45 2.02 0.88

OMBNN2 21.1 9.35 3.51 0.11 1.96 0.88

OMBNN3 22.0 10.1 3.70 -0.61 1.87 0.89

Table 5. Total statistics for GSW, OMBNN1, OMBNN2 and OMBNN3 algorithms for CLEAR plus CLOUDY conditions and for four different SSM/I instruments. Columns 3 - 5 show statistics for the wind speeds per se (SD_w denotes standard deviation), and columns 6 - 8 for the difference between buoy and algorithm-generated wind speeds. SD denotes standard deviation, and CC denotes correlation coefficient.

Satellite Max W Mean W _SDw Bias SD CC

F08
1637
m-ups
Buoy 21.5 7.31 3.17 N/A N/A N/A

GSW 25.9 7.65 3.54 -0.34 2.13 0.80

OMBNN1 17.1 6.32 2.45 0.99 1.62 0.86

OMBNN2 18.4 6.80 2.92 0.51 1.60 0.87

OMBNN3 20.6 7.41 3.09 -0.10 1.59 0.87

F10
6879
m-ups
Buoy 21.6 7.26 3.18 N/A N/A N/A

GSW 26.0 7.81 3.59 -0.55 2.15 0.80

OMBNN1 16.4 6.42 2.53 0.85 1.74 0.84

OMBNN2 19.5 6.32 2.77 0.95 1.72 0.84

OMBNN3 22.5 7.57 3.18 -0.31 1.81 0.84

F11
6129
m-ups
Buoy+OWS 26.4 7.47 3.51 N/A N/A N/A

GSW 30.3 7.99 3.77 -0.53 2.09 0.84

OMBNN1 19.4 6.70 2.65 0.76 1.70 0.88

OMBNN2 20.7 6.56 2.90 0.91 1.70 0.88

OMBNN3 22.8 7.57 3.27 -0.11 1.61 0.89

F13
1036
m-ups
Buoy 27.5 10.21 4.58 N/A N/A N/A

GSW 29.0 11.43 4.36 -1.22 2.59 0.83

OMBNN1 18.5 9.65 3.61 0.55 2.41 0.85

OMBNN2 20.5 9.55 3.49 0.66 2.40 0.86

OMBNN3 23.1 10.84 4.04 -0.63 2.26 0.87

Table 6. Total statistics for GSW, OMBNN1, OMBNN2 and OMBNN3 algorithms for CLOUDY conditions and for two different SSM/I instruments. Columns 3 - 5 show statistics for the wind speeds per se (SD_w denotes standard deviation), and columns 6 - 8 for the difference between buoy and algorithm-generated wind speeds. SD denotes standard deviation, and CC denotes correlation coefficient.

Satellite Algorithm Max W Mean W _SDw Bias SD CC

F10
1068
m-ups
Buoy 21.6 8.90 3.77 N/A N/A N/A

GSW 26.0 11.91 3.48 -3.01 3.19 0.61

OMBNN1 16.4 7.61 2.58 1.28 2.47 0.76

OMBNN2 19.5 7.49 3.38 1.41 2.50 0.76

OMBNN3 22.5 9.97 3.52 -1.08 2.76 0.72

F11
895
m-ups
Buoy+OWS 25.0 8.79 3.63 N/A N/A N/A

GSW 30.3 11.97 3.42 -3.18 3.07 0.62

OMBNN1 15.8 7.79 2.49 0.99 2.39 0.76

OMBNN2 20.7 7.65 3.26 1.13 2.40 0.76

OMBNN3 22.8 9.78 3.39 -0.99 2.59 0.73

Under cloudy conditions, the biases and SDs are unacceptably high for GSW algorithm, whereas OMBNN3 algorithm yields a bias and SD which are acceptable for operational use. Wind speeds are higher on average under cloudy conditions (see Table 6) and with an rms error of less than 3 m/s yielding a relative error of 15 - 25 % of the wind speed, considered acceptable, taking into account the higher level of noise under cloudy conditions. Thus, the OMBNN3 algorithm extends the retrieval domain from clear, to clear plus cloudy, conditions yielding an increase in areal coverage of 15%. This result is particularly significant for obtaining more complete coverage of synoptic-scale weather systems such as extratropical cyclones which are typically characterized by higher levels of moisture and higher wind speeds. Since the BT retrieval flags which we use are essentially statistical, they are not highly sensitive to local conditions. In some cases this may lead to corrupted retrievals; therefore, any additional information about local conditions (e.g., such as rain/norain) may help to further improve the accuracy of retrievals under cloudy conditions.

SDs for OMBNN3 are comparable with SDs for OMBNN1 and OMBNN2 (sometimes even smaller), which indicates that our NN approach, including the previous weighting of higher wind speeds, is robust enough to prevent decreasing the accuracy of lower wind speeds because of high levels of noise at higher wind speeds. Additionally, there is a consistent improvement (from OMBNN1 to OMBNN3) in the ability of these NN algorithms to generate higher wind speeds in each case. In comparing F8, F10, and F11, the variations in SD and bias are relatively small for all algorithms (we do not include F13 here). The largest differences for all algorithms occur for F10 which may be due to the orbit ellipticity for this satellite (G. Poe, personal communication).

Fig. 3 shows scatter plots of retrieved vs. observed wind speeds for all four instruments and for GSW, OMBNN2 and OMBNN3 algorithms. OMBNN3 yields the lowest scatter both at low and high wind speeds.

Fig. 3. Scatter Plots for GSW (black diamonds), OMBNN2 (black stars) and OMBNN3 (gray crosses) algorithms for F08, F10, F11 and F13 SSM/I instruments.

5.1.2 High wind speeds statistics.

Table 7 shows statistics calculated separately for wind speeds > 15 m/s, only.

Although the sample sizes are small in each case, some conclusions can be drawn from the table. At high wind speeds, the NN-based algorithms perform significantly better than GSW based on the SD. OMBNN1 and OMBNN2 have large positive biases because they significantly underestimate the speed at high wind speeds; however, OMBNN3 demonstrates a smaller bias at high wind speeds.

5.1.3 Binned wind speed statistics

Fig. 4 shows binned bias, SD, and rms error for the difference between buoy wind speeds and algorithm-generated wind speeds vs. observed wind speed for GSW, OMBNN2 and OMBNN3 algorithms, where the bin size is 1 m/s. Fig. 4 shows that OMBNN3 is uniformly better than the other two algorithms in terms of SD and rms error (except occasionally at high wind speeds for rms error) for all instruments and all wind speeds.

Fig. 4. Binned statistics (bias, SD, and rms errors) for GSW (dashed line with diamonds), OMBNN2 (dotted line with stars) and OMBNN3 (solid with crosses) algorithms for F08, F10, F11 and F13 SSM/I instruments.

Table 7. High winds (W > 15 m/s) statistics for algorithms presented in Table 4, for CLEAR+CLOUDY conditions and for four different SSM/I instruments. Columns 3 - 5 show statistics for the wind speeds per se (SD_w denotes standard deviation), and columns 6 - 7 for the difference between buoy and algorithm-generated wind speeds. SD denotes standard deviation.

Satellite Max W Mean W _SDw Bias SD

F08
33
m-ups
Buoy 21.5 16.8 1.55 N/A N/A

GSW 21.4 16.9 2.97 -0.10 1.52

OMBNN1 15.1 12.6 1.21 4.15 1.39

OMBNN2 17.4 13.6 1.40 3.21 1.47

OMBNN3 20.6 16.4 1.76 0.42 1.40

F10
155
m-ups
Buoy 21.6 16.8 1.51 N/A N/A

GSW 26.0 17.1 2.95 -0.3 2.61

OMBNN1 15.7 12.5 1.63 4.30 1.64

OMBNN2 19.5 13.9 1.78 2.90 1.93

OMBNN3 22.5 16.4 2.62 0.40 2.16

F11
212
m-ups
Buoy+OWS 26.4 17.5 2.34 N/A N/A

GSW 30.3 17.0 2.98 0.46 2.68

OMBNN1 19.4 13.7 1.93 4.33 1.90

OMBNN2 20.7 14.0 2.23 3.53 2.25

OMBNN3 22.8 16.3 2.50 1.17 2.25

F13
154
m-ups
Buoy 27.5 18.1 2.51 N/A N/A

GSW 29.0 17.5 2.68 0.57 2.48

OMBNN1 18.5 14.6 1.68 3.45 2.28

OMBNN2 20.5 14.6 1.91 3.52 2.18

OMBNN3 23.1 16.8 2.26 1.23 2.17

Fig. 5 shows binned bias and rms error for the difference between buoy wind speed and algorithm-generated wind speeds for GSW, OMBNN2 and OMBNN3 algorithms vs. amount of columnar liquid water L, where the bin size is 0.05 mm. For all algorithms, biases and rms errors increase with L; however, OMBNN3 demonstrates better performance for all values of L.
These dependencies provide additional information regarding the accuracy of wind speed retrievals under cloudy conditions and can be used to improve the retrieval flags.

Fig. 5. Bias and RMS error vs. Columnar Liquid Water for GSW (dashed line with diamonds), OMBNN2 (dotted line with stars) and OMBNN3 (solid line with crosses) algorithms for F10 and F11 SSM/I instruments. Fig. 6 shows binned bias and rms error for the difference between buoy wind speeds and algorithm generated wind speeds for GSW, OMBNN2 and OMBNN3 algorithms vs. amount of columnar water vapor V, where the bin size is 5 mm. Bias and rms error increase sharply at V > 40 mm for GSW. This agrees with our previous experience which shows that GSW performs poorly in tropical areas. For OMBNN3, the bias is small and almost independent of V; however, rms error increases slowly at V > 50 mm. Fig. 6. Bias and RMS error vs. Columnar Water Vapor for GSW (dashed line with diamonds), OMBNN2 (dotted line with stars) and OMBNN3 (solid line with crosses) algorithms for F10 and F11 SSM/I instruments.

Fig. 7 shows binned bias and rms error for the difference between buoy wind speeds and algorithm-generated wind speeds for GSW, OMBNN2 and OMBNN3 algorithms vs. SST, where the bin size is 5C. Bias and rms error for GSW increases sharply for SST > 20C, which is related to GSW's poor performance in tropical areas. For OMBNN3, the bias does not show a significant dependence on SST.

Fig. 7. Bias and RMS error vs. SST for GSW (dashed line with diamonds), OMBNN2 (dotted line with stars) and OMBNN3 (solid line with crosses) algorithms for F10 and F11 SSM/I instruments.

Fig. 8 shows binned bias and rms error for the difference between buoy wind speeds and algorithm-generated wind speeds for GSW, OMBNN2 and OMBNN3 algorithms vs. latitude, where the bin size is 5. OMBNN1 and OMBNN2 have been developed, using F8 matchup data where high latitudes were poorly represented. As a result, these algorithms may be expected to demonstrate large (up to 1 - 2 m/s) biases at high latitudes. For OMBNN3, the bias and rms error are much smaller at high latitudes which is due to the new matchup data which include matchups at high latitudes where the moisture/wind speed relationships are expected to be different. For GSW algorithm, the latitude dependence is not smooth and there are regions where bias and/or rms error are unacceptably high.

Fig. 8. Bias and RMS error vs. Latitude for GSW (dashed line with diamonds), OMBNN2 (dotted line with stars) and OMBNN3 (solid line with crosses) algorithms for F10 and F11 SSM/I instruments.

5.2 Columnar Water Vapor.

OMBNN3 has been trained to retrieve the amount of columnar water vapor V, using SSM/I BTs. Values of V generated by the cal/val algorithm developed by Alishouse et al. (1990) were used as ground truth during the training. Therefore, OMBNN3 simulates V-retrievals produced by the cal/val algorithm. Table 8 shows retrieval statistics for columnar water vapor (max V, mean V, and standard deviation SD_V) for the cal/val and OMBNN3 algorithms. It also shows bias, SD for the difference between the cal/val and OMBNN3 and the correlation coefficient (CC) between cal/val and OMBNN3 retrievals. OMBNN3 reproduces the cal/val retrievals with an rms difference of about 1 mm and a bias of 0.3 mm.

5.3 Columnar Liquid Water.

OMBNN3 has also been trained to retrieve the amount of columnar liquid water L, using SSM/I BTs. Values of L generated by the WG algorithm developed by Weng and Grody (1994) were used as ground truth during the training. Table 9 shows retrieval statistics for columnar liquid water (max L, mean L, and standard deviationSD_L) for the WG and OMBNN3 algorithms. It also shows bias, SD for the difference between WG and the OMBNN3, and CC between WG and OMBNN3 retrievals. OMBNN3 reproduces WG retrievals with an rms difference of about 0.015 mm and a bias of 0.05 mm.

Table 8. Total statistics for columnar water vapor V (in mm) retrieved by cal/val and OMBNN3 algorithms for CLEAR + CLOUDY conditions and for F10 and F11 SSM/I instruments. Columns 3 - 5 show statistics for the columnar water vapor per se (SD_V denotes standard deviation), and columns 6 - 8 for the difference between cal/val and OMBNN3 algorithm-generated columnar water vapor. SD denotes standard deviation, and CC denotes correlation coefficient.

Satellite Algorithm Max V Mean V _SDv Bias SD CC

F10
6947
m-ups
Alishouse 60.8 31.0 14.7 N/A N/A N/A

OMBNN3 59.2 30.9 15.4 0.1 1.1 1.0

F11
5673
m-ups
Alishouse 64.4 31.6 15.2 N/A N/A N/A

OMBNN3 60.1 31.4 15.7 0.3 0.9 1.0

Table 9. Total statistics for columnar liquid water L (in mm) retrieved by WG and OMBNN3 algorithms for CLEAR + CLOUDY conditions and for F10 and F11 SSM/I instruments. Columns 3 - 5 show statistics for the columnar liquid water per se (SD_L denotes standard deviation), and columns 6 - 8 for the difference between WG and OMBNN3 algorithm-generated wind speeds. SD denotes standard deviation, and CC denotes correlation coefficient.

Satellite Algorithm Max L Mean L _SDL Bias SD CC

F10
6847
m-ups
WG 0.44 0.034 0.058 N/A N/A N/A

OMBNN3 0.38 0.039 0.058 0.005 0.016 0.96

F11
5673
m-ups
WG 0.38 0.034 0.058 N/A N/A N/A

OMBNN3 0.36 0.036 0.057 0.00 0.015 0.97

5.4 Sea Surface Temperature.

OMBNN3 has been trained to retrieve SSTs from SSM/I BTs, using buoy SST measurements. Table 10 shows retrieval statistics for SST (max SST, mean SST, and standard deviation _SST) based on the OMBNN3 algorithm. It also shows bias, SD and CC for OMBNN3 vs. the buoy observations. OMBNN3 reproduces buoy SSTs with an rms error of < 5 C, and bias < 0.7C. Although these retrievals have relatively low resolution (of order of SSM/I footprint size), as mentioned above, incorporation of SST as an additional output for the NN improves the overall accuracy of the training process.

Table 10. Total statistics for SST (C) retrieved by OMBNN3 vs. buoy for CLEAR + CLOUDY conditions for F10 and F11 SSM/I instruments. Columns 3 - 5 show statistics for the SST (SD_SST denotes standard deviation), and columns 6 - 8 for the difference between buoy and OMBNN3-generated SST. SD denotes standard deviation, and CC denotes correlation coefficient.

Satellite Max SST Mean SST _SDSST Bias SD CC

F10
6847
m-ups
Buoy 31.0 20.7 8.57 N/A N/A N/A

OMBNN3 31.0 20.16 8.29 0.58 4.87 0.83

F11
5673
m-ups
Buoy 31.3 20.0 8.86 N/A N/A N/A

OMBNN3 30.7 20.7 7.91 -0.68 4.52 0.86

6. CONCLUSIONS

We have presented a new NN-based OMBNN3 transfer function (i.e., retrieval algorithm) for SSM/I retrievals (including wind speed, columnar water vapor, columnar liquid water, and SST ) which demonstrates high retrieval accuracy overall, together with the ability to generate high wind speeds with acceptable accuracy. The results demonstrate that OMBNN3 systematically outperforms all algorithms considered for all SSM/I instruments, for all weather conditions where retrievals are possible, and for all wind speeds.

Previous NN-based algorithms have not performed well at high wind speeds. This problem may be due to several factors including increased buoy wind speed errors at high wind speeds, nonuniformity of the wind speed distribution itself, collocation errors in the matchups, and systematic and random errors which occur at high wind speeds due to increasing complexity of the ocean surface as an emitter of microwave radiation (e.g., whitecaps and foam) (Krasnopolsky et al., 1996a). Thus, a practical upper limit for making SSM/I wind speed retrievals low as 30 m/s in some cases (for some ocean surface states). In developing the OMBNN3 SSM/I transfer function, a new NN training strategy which includes preferential weighting at high wind speeds was introduced to compensate for the nonuniformity in the distribution of observed wind speeds. Also, the OMBNN3 algorithm was developed and tested, using a new matchup database. We created this database from F11 SSMI/buoy matchups and high latitude SSMI/OWS matchups which contained a significant number of high wind speed events. As a result, OMBNN3 demonstrates significantly better performance at higher wind speeds and at higher latitudes than previous NN-based algorithms. It generates wind speeds up to 23 m/s for the available test data, and has a theoretical upper limit of about 32 m/s (Krasnopolsky et al., 1996a). It was also validated for the F8, F10, and F13 sensors and showed significant improvement in the accuracy of the retrievals for these instruments at higher wind speeds.

The retrieval accuracy for OMBNN3 does not depend significantly on the satellite and/or instrument. The largest bias and rms error occur for F10 (not taking into consideration the noisy data from F13) which may be due to the increased orbit ellipticity for this satellite.

The NN-based algorithms demonstrate on average satisfactory retrieval capabilities under cloudy conditions. Under clear plus cloudy conditions, the biases and SDs are unacceptably high for GSW algorithm, whereas the OMBNN3 algorithm yields a bias and SD which are acceptable for operational use. Therefore, the NN-based algorithms have also expanded the retrieval domain from clear, to clear plus cloudy, conditions yielding an increase in retrieval coverage of 15%. This result is particularly significant for obtaining more complete coverage of synoptic-scale weather systems such as extratropical cyclones which are typically characterized by higher levels of moisture and higher wind speeds. In this study we have defined cloudy conditions, according to the BT retrieval flags given by Stogryn et al. (1994). These retrieval flags are based only on BTs and are statistical by definition; therefore, they do not preclude contamination from rain in all cases. If information about local conditions is available, it can be used to improve the accuracy of retrievals under cloudy conditions significantly. Because OMBNN3 generates columnar liquid water, columnar water vapor and SST simultaneously with wind speed, it offers additional opportunities for specifying local conditions and improving retrieval flags.

Regarding columnar liquid water L and columnar water vapor V, OMBNN3 was trained to simulate cal/val retrievals for V, and WG retrievals for L. As shown in Sections 5.2 and 5.3, it reproduces the cal/val and WG results with high accuracy. Although, we did not have ground truth data to validate or improve these retrieval estimates, if such data become available (e.g., radiosonde measurements), they could be used in the future during the process of training to improve the algorithm's retrieval capabilities.

Acknowledgments

We take this opportunity to thank D.B. Rao for a thorough review of this manuscript. We also thank Marie Colton of the Fleet Numerical Meteorology and Oceanography Center and Gene Poe of the Naval Research Laboratory for providing us with the new NRL database containing the raw matchups, David Kilham of Bristol University for providing us with additional matchup data for high latitudes, and Michael McPhaden and Linda Magnum for providing us with additional information concerning the TOGA-TAO buoys.

REFERENCES

Alishouse, J.C., et al., Determination of oceanic total precipitable water from the SSM/I. IEEE Trans. Geosci. Remote Sens., GE 23, 811-816, 1990

Goodberlet, M.A. , C.T. Swift, and J.C. Wilkerson, Remote sensing of ocean surface winds with the Special Sensor Microwave/Imager, J. Geophys. Res., 94, 14,547-14, 555, 1989.

Krasnopolsky, V., L.C. Breaker, and W.H. Gemmill, Development of a single "all-weather" neural network algorithm for estimating ocean surface wind from the Special Sensor Microwave Imager, Technical Note, OPC contribution No. 94, National Meteorological Center, Washington D.C., 1994.

Krasnopolsky, V., L.C. Breaker, and W.H. Gemmill, A neural network as a nonlinear transfer function model for retrieving surface wind speeds from the special sensor microwave imager, J. Geophys. Res, 100, 11,033-11,045, 1995a.

Krasnopolsky, V., W.H. Gemmill, and L.C. Breaker. Improved SSM/I wind speed retrievals at high wind speeds. Technical Note, OMB contribution No. 111, Environmental Modeling Center, Washington D.C., 1995b.

Krasnopolsky, V., W.H. Gemmill, L.C. Breaker, and V.Yu. Raizer. Improved SSM/I wind speed retrievals at high wind speeds. Submitted to Remote Sensing of Environment, September 1996a.

Krasnopolsky, V., W.H. Gemmill, and L.C. Breaker. NN SSM/I transfer function OMBNN3. To be published, 1996b

Stogryn, A.P., C.T. Butler, and T.J. Bartolac, Ocean surface wind retrievals from special sensor microwave imager data with neural networks, J. of Geophys. Res., 90, 981-984, 1994.

Weng, F., and N.G. Grody, Retrieval of cloud liquid water using the special sensor microwave imager (SSM/I). J. Geophys. Res., 99, 25,535-25,551, 1994

1. The corresponding FORTRAN file is available upon request from Vladimir Krasnopolsky, e-mail address: Vladimir.Krasnopolsky@noaa.gov, tel. 301-763-8133.

	Number of matchups			Mean W m/s	_SDw m/s	Max W m/s	Max W (Clear+Cloudy) m/s	Max W (Clear) m/s
	Total	Clear cond.	Cloudy cond.	Mean W m/s	_SDw m/s	Max W m/s	Max W (Clear+Cloudy) m/s	Max W (Clear) m/s
F08/Buoy	1765	1437	200	7.4	3.3	26.0	21.5	18.6
F10/Buoy	7495	5953	926	7.3	3.2	25.0	21.6	20.5
F11/Buoy	6633	5274	855	7.5	3.5	26.4	25.0	20.1
F13/Buoy	1071	864	172	10.3	4.7	27.5	27.5	24.7
F11/LIMA	304	253	51	10.4	4.9	26.4	26.4	23.9
F11/MIKE	243	215	27	9.8	4.9	24.2	24.2	21.1

Data set		Max W	Mean W	_SDw	Bias	SD	CC
Training	Buoy	22.8	7.13	3.27	N/A	N/A	N/A
Training	OMBNN3	19.5	7.14	2.97	-0.01	1.36	0.91
Test	Buoy	23.9	7.14	3.31	N/A	N/A	N/A
Test	OMBNN3	20.2	7.21	3.08	-0.08	1.49	0.89

Satellite		Max W	Mean W	_SDw	Bias	SD	CC
F08 1437 m-ups	Buoy	19.2	7.06	3.01	N/A	N/A	N/A
	GSW	21.4	7.08	3.18	-0.02	1.77	0.84
	OMBNN1	15.1	6.13	2.38	0.93	1.49	0.87
	OMBNN2	16.8	6.56	2.68	0.50	1.48	0.88
	OMBNN3	20.1	7.07	3.01	-0.01	1.43	0.88
F10 5953 m-ups	Buoy	20.5	6.98	2.95	N/A	N/A	N/A
	GSW	20.8	7.20	3.22	-0.22	1.86	0.82
	OMBNN1	14.7	6.23	2.46	0.75	1.63	0.84
	OMBNN2	17.1	6.13	2.61	0.84	1.60	0.84
	OMBNN3	20.2	7.21	2.97	-0.23	1.68	0.84
F11 5274 m-ups	Buoy+OWS	23.9	7.13	3.29	N/A	N/A	N/A
	GSW	20.9	7.34	3.36	-0.21	1.72	0.87
	OMBNN1	16.9	6.47	2.55	0.66	1.55	0.89
	OMBNN2	17.9	6.32	2.72	0.81	1.56	0.88
	OMBNN3	20.2	7.17	3.03	-0.04	1.43	0.90
F13 864 m-ups	Buoy	24.0	9.46	4.16	N/A	N/A	N/A
	GSW	23.6	10.49	3.84	-1.02	2.13	0.86
	OMBNN1	18.5	9.01	3.39	0.45	2.02	0.88
	OMBNN2	21.1	9.35	3.51	0.11	1.96	0.88
	OMBNN3	22.0	10.1	3.70	-0.61	1.87	0.89

Satellite	Algorithm	Max W	Mean W	_SDw	Bias	SD	CC
F10 1068 m-ups	Buoy	21.6	8.90	3.77	N/A	N/A	N/A
	GSW	26.0	11.91	3.48	-3.01	3.19	0.61
	OMBNN1	16.4	7.61	2.58	1.28	2.47	0.76
	OMBNN2	19.5	7.49	3.38	1.41	2.50	0.76
	OMBNN3	22.5	9.97	3.52	-1.08	2.76	0.72
F11 895 m-ups	Buoy+OWS	25.0	8.79	3.63	N/A	N/A	N/A
	GSW	30.3	11.97	3.42	-3.18	3.07	0.62
	OMBNN1	15.8	7.79	2.49	0.99	2.39	0.76
	OMBNN2	20.7	7.65	3.26	1.13	2.40	0.76
	OMBNN3	22.8	9.78	3.39	-0.99	2.59	0.73

Satellite	Algorithm	Max V	Mean V	_SDv	Bias	SD	CC
F10 6947 m-ups	Alishouse	60.8	31.0	14.7	N/A	N/A	N/A
F10 6947 m-ups	OMBNN3	59.2	30.9	15.4	0.1	1.1	1.0
F11 5673 m-ups	Alishouse	64.4	31.6	15.2	N/A	N/A	N/A
F11 5673 m-ups	OMBNN3	60.1	31.4	15.7	0.3	0.9	1.0

Satellite	Algorithm	Max L	Mean L	_SDL	Bias	SD	CC
F10 6847 m-ups	WG	0.44	0.034	0.058	N/A	N/A	N/A
F10 6847 m-ups	OMBNN3	0.38	0.039	0.058	0.005	0.016	0.96
F11 5673 m-ups	WG	0.38	0.034	0.058	N/A	N/A	N/A
F11 5673 m-ups	OMBNN3	0.36	0.036	0.057	0.00	0.015	0.97

Satellite		Max SST	Mean SST	_SDSST	Bias	SD	CC
F10 6847 m-ups	Buoy	31.0	20.7	8.57	N/A	N/A	N/A
F10 6847 m-ups	OMBNN3	31.0	20.16	8.29	0.58	4.87	0.83
F11 5673 m-ups	Buoy	31.3	20.0	8.86	N/A	N/A	N/A
F11 5673 m-ups	OMBNN3	30.7	20.7	7.91	-0.68	4.52	0.86