Please cite our work if you found it useful!
Li, Z., Wen, Y., Schreier, M., Behrangi, A., Hong, Y., & Lambrigtsen, B. (2020). Advancing satellite precipitation retrievals with data driven approaches: is black box model explainable?. Earth and Space Science, 7, e2020EA001423. https://doi.org/10.1029/2020EA001423
In this study, we take two steps towards passive microwave (AMSU) precipitation retrival: first, segment satellite imagery into rain and no-rain classes (binary); second, apply second-round ML with rainy pixels.
Fig.1 Schematic overview of the pipeline processing AMSU data.
We crop AMSU satellite swath which is approximatly 45km at nadir into (64,64) sub-imageries randomly.
AMSU-A channels 1,2, 15 and AMSU-B channels 1, 2, 3, 4, 5 are selected as inputs because the low frequency channels (AMSU-A) are targeting water vapor in liquid phase, and higher frequencies (AMSU-B) are targeting mix-phase water.
As for the target, we mapped NSSL MRMS (multi-radar multi-sensor) ground based radar QPE to match the same spatiotemporary feature as AMSU flight.
In the imagery segmentation, we performed LinkNet with pretrained model that trained by imagenet.
Model description | inputs | learning type | epoches | loss | dice | threshold | name |
---|---|---|---|---|---|---|---|
LinkNet+ResNet18 | amsu-a(1,2,3,4)+amsu-b(5 channels) | unfreeze | 100 | 0.68 | 0.95 | -4.464768/7 | segmentation-class1 |
LinkNet+ResNet18 | amsu-b (4channels) | unfreeze | 100 | 0.60 | 0.82 | 0.75 | Segmentation-4channels |
Fig.2 LinkNet Architecture
Fig.3 Loss evolution with epoches
Fig.4 Dice evolution with epoches
Results
Fig.5 LinkNet-1class-8channels-benchmark results
Fig.6 Classification report for LinkNet
Fig.7 Classification report for Benchmark
Fig.8 PR-AUC curve to determine the best threshold
Fig.9 objective surface plot.
We used the same structure for precipitation type segmentation. However, the results are not satisfactory especially for convective and snow case.
Fig. 10 precipitation type segmentation results
Attempt to use Random forest Regressor to quantify rain rate with grid search. The validation is based on KFolds, specifically 5 folds to validate data. It is running in 48 cores server, and it costs 60 hours to complete.
# Grid search for hyperparameter tuning
rf= RandomForestRegressor()
hyperparam_grid= {
'n_estimators': np.arange(10,500,20),
'max_depth': np.arange(10,50,5),
'warm_start':[True, False]
}
gridsearch= GridSearchCV(rf, hyperparam_grid, scoring='neg_mean_squared_error', verbose=2, n_jobs=-1)
Regressor | Parameters | median RMSE (benchmark) | R^2 (train/test) | model name |
---|---|---|---|---|
Random Forest | depth-10,estimators-800 | 1.05(12.09) | 0.21/0.13 | model-1 |
Random Forest | depth-9, estimators-600 | - | 0.19/0.36 | modeol-2 |
Adaboost model-2 | depth-9, estimators=600 | - | - | model-3 |
Adaboost model-1 | depth-10, estimators=800 | - | - | model-4 |
Fig.11 Spatial rainfall map for benchmark and model-1
Fig.12 RMSE results for benchmark and model-1
Fig.13 Residual plots of training and test results for model-2
feature Importance
Model | 23.8 GHz | 31.4 GHz | 89.0 GHz | 89+0.9 GHz | 150+0.9 GHz | 183.31+0.1 GHz | 183.31+0.3 GHz | 183.31+7.0 GHz |
---|---|---|---|---|---|---|---|---|
model-1 | 0.1033 | 0.0654 | 0.0774 | 0.2109 | 0.3645 | 0.0319 | 0.0843 | 0.0622 |
model-2 | 0.0473 | 0.0313 | 0.0542 | 0.1983 | 0.5371 | 0.0173 | 0.0499 | 0.0644 |
As in Fig.13 and Fig.11, they both show our trained forest underestimates the true rainfall value. In this chapter, we will investigate the reason.
Fig. 14 variance of brightless temperature as rain increases
Because of the imbalanced data, meaning light rain occupies large portion of the rainy cases, thus we cluster the rain rates data by 100 instances as follows:
Fig. 15 variance of brightless temperature as rain increases
As rain rate increases, the variance gets increases until it meets ard 70 mm/hour and then decreases.
Because the mechanism of precipitation formation, in generary, straitiform rainfall has mild rain rates and also mild emissivity from surface. However, convective rainfall normally associates with large rain rates, and more reduction in brightness temperature. On the other hand, snowfall rate is way smaller than rainfall rate. And the emissivity of snow is smaller as well. It is thus significant to understand the hydrometeor phase before prediction.
experiment | retrieval algorithm | segmentation | input features | rain type classification |
---|---|---|---|---|
Benchmark | GPROPH(?) | no | 89GHz+150GHz | no |
simulation 1 | RF | no | AMSU-a 3channels+ AMSU-b 5channels | no |
simulation 2 | RF | yes | AMSU-a 3channels+ AMSU-b 5channels | no |
simulation 3 | RF | yes | local features+non local features +geometric | no |
simulation 4 | RF | yes | local features+non local features +geometric | yes |