近地表气温是城市热环境的重要表征,是改变和影响城区气候的重要因素。为获得空间上连续的近地表气温,本文以北京市为研究区,利用Landsat5/TM数据计算分别得到地表温度、归一化植被指数、改进的归一化差异水体指数、地表反照率、不透水面盖度,并结合气象站点气温和高程作为输入参数建立随机森林模型反演近地表气温。结果表明,随机森林反演的近地表气温平均绝对误差(MAE)为0.80℃,均方根误差(RMSE)为1.06℃,与传统多元线性气温回归方法相比,平均绝对误差(MAE)和均方根误差(RMSE)分别提高0.06℃和0.09℃。研究表明,利用随机森林模型反演近地表气温是可行的,并且具有一定的优越性。此外,对随机森林模型的输入参数进行重要性分析,地表温度对气温反演模型的影响最大,其次为高程。
Near-surface air temperature is an important symbol of urban thermal environment, which is also an important factor affecting and changing the climate of the city. The data of near-surface air temperature is often in absence because the number of meteorological stations is few. In order to obtain spatial continuous near surface air temperature data, this study takes Beijing city as the research area, using Landsat5/TM data to retrieve land surface temperature, normalized difference vegetation index, modified normalized difference water index,albedo and impervious surface cover. These are combined with the meteorological station temperature and DEM as the input parameters into random forest regression model to retrieve near surface air temperature. In this study, land surface temperature was retrieved by single-channel algorithm which was proposed by JiménezMuoz in 2003. The imperious surface cover was calculated by the linear spectral unmixing method and Vegetation-Impervious surface-Soil(VIS) model. The random forest is one of the most effective methods of classification and it runs by constructing multiple decision tree while training and outputting the class. This study uses the R language which is a free software environment for statistical computing and graphics to achieve random forest.The results show that the random forest method has good applicability in the near surface temperature retrieval.The mean absolute error(MAE) and root mean square error(RMSE) of the random forest method are 0.80 and1.07, respectively. Compared with the ordinary regression model, the MAE and(RMSE) accuracy increased by0.06 and 0.09. Using R language to analyze the importance of variables, land surface temperature has the greatest influence on the results. The increase in Mean Square Error of land surface temperature is 14% and the increase in node purity of land surface temperature is 241.36%.