利用中国160站降水记录及欧洲中心ERA-Interim再分析数据等构建了16种线性回归降水预测模型,包括"站点"降水直接/间接预测模型(间接模型是指先预测东亚季风指数,再以此估计站点降水),以及"区域-站点"降水直接/间接预测模型,即先预测一个区域的降水再分配到站点。此外,还构建了所有模型集合的降水预测模型。预报因子包括两种ENSO指数、北大西洋涛动(NAO)指数和青藏高原积雪深度等4个因子。模型亦分为3因子和4因子(包括积雪因子)组及对降水取/不取对数组等前处理。2005—2016年的回报试验表明,"站点"模型优于"区域-站点"模型,对降水取对数模型优于不取对数模型。另外,"站点"模型组中的间接降水预测模型优于直接模型,但"区域-站点"组却相反。ERA-Interim积雪深度资料不确定性带来的偏差超过该因子对降水预测的贡献。平均PS评分最高的是3因子的直接站点降水取对数模型(MDS-3Ln),平均达到71分,高于集合模型(MEM)得分。这些结果表明,线性回归降水模型的设计理念与实际预测效果可能并不一致,其原因是因子的选取或数学处理过程会引入新的不确定性或偏差,必须综合评估各种设计方案的"成本-效益"关系。
16 kinds of linear regression prediction models are developed based on precipitation data of160 stations in China and ECMWF reanalysis data( ERA-Interim). The models include station direct/indirect precipitation prediction model group( in indirect models,the East Asian monsoon index is firstly predicted,and based on which,station precipitations are predicted.),region-station direct/indirect precipitation prediction model group( regional precipitations are firstly predicted,and then distributed tothe stations). In addition,an ensemble precipitation prediction model is composed with those 16 models as members. There are totally 4 factors in the regression equations,including two types of ENSO index,North Atlantic Oscillation( NAO) index and the mean snow depth on Tibetan plateau. According to the number of factors,the models can also be grouped into 3 factors( including the ENSO indices and NAO index) models or 4 factors( contains all the factors) models. In addition,the differences of the models also lies in their predictor,precipitation or its logarithm. Hindcast for 2005—2016 shows that the average PS score for the station prediction model group is higher than that of region-station prediction model group. The performance of the models taking logarithmic precipitation as predictor is better than those using precipitation itself as predictor. In the station prediction model group,the indirect precipitation prediction models are usually superior to the direct ones,while it is opposite for the region-station model group. For precipitation prediction,the deviation caused by the uncertainty in snow depth of ERA-Interim reanalysis data is greater than the contribution of the factor of snow depth. Among all the models,the direct station prediction model with 3 factors for logarithmic precipitation( MDS-3Ln) get the highest PS score on average,reaching 71 point,which is higher than that of the ensemble model( MEM). It is indicated that the real performance of a linear regression precipi