TY - JOUR
T1 - Data mining techniques for improved WSR-88D rainfall estimation
AU - Trafalis, T. B.
AU - Richman, M. B.
AU - White, A.
AU - Santosa, B.
N1 - Funding Information:
This research was supported by DOC/NOAA Grant NA67RJ0150. We extend thanks to Tim O'Bannon for assistance in interpreting the radar tapes and providing the program to interpret Level 2 radar data and to Tim Kwiatkowski for programming assistance. Additional gratitude is extended to Suat Kasap for data processing assistance and Jean Shingledecker for administrative assistance. This research was performed on The University of Oklahoma ECAS Cray-J90.
PY - 2002/9
Y1 - 2002/9
N2 - The main objective of this paper is to utilize data mining and an intelligent system, Artificial Neural Networks (ANNs), to facilitate rainfall estimation. Ground truth rainfall data are necessary to apply intelligent systems techniques. A unique source of such data is the Oklahoma Mesonet. Recently, with the advent of a national network of advanced radars (i.e. WSR-88D), massive archived data sets have been created generating terabytes of data. Data mining can draw attention to meaningful structures in the archives of such radar data, particularly if guided by knowledge of how the atmosphere operates in rain producing systems. The WSR-88D records digital database contains three native variables: velocity, reflectivity, and spectrum width. However, current rainfall detection algorithms make use of only the reflectivity variable, leaving the other two to be exploited. The primary focus of the research is to capitalize on these additional radar variables at multiple elevation angles and multiple bins in the horizontal for precipitation prediction. Linear regression models and feedforward ANNs are used for precipitation prediction. Rainfall totals from the Oklahoma Mesonet are utilized for the training and verification data. Results for the linear modeling suggest that, taken separately, reflectivity and spectrum width models are highly significant. However, when the two are combined in one linear model, they are not significantly more accurate than reflectivity alone. All linear models are prone to underprediction when heavy rainfall occurred. The ANN results of reflectivity and spectrum width inputs show that a 250-5-1 architecture is least prone to underprediction of heavy rainfall amounts. When a three-part ANN was applied to reflectivity based on light, moderate to heavy rainfall, in addition to spectrum width, it estimated rainfall amounts most accurately of all methods examined.
AB - The main objective of this paper is to utilize data mining and an intelligent system, Artificial Neural Networks (ANNs), to facilitate rainfall estimation. Ground truth rainfall data are necessary to apply intelligent systems techniques. A unique source of such data is the Oklahoma Mesonet. Recently, with the advent of a national network of advanced radars (i.e. WSR-88D), massive archived data sets have been created generating terabytes of data. Data mining can draw attention to meaningful structures in the archives of such radar data, particularly if guided by knowledge of how the atmosphere operates in rain producing systems. The WSR-88D records digital database contains three native variables: velocity, reflectivity, and spectrum width. However, current rainfall detection algorithms make use of only the reflectivity variable, leaving the other two to be exploited. The primary focus of the research is to capitalize on these additional radar variables at multiple elevation angles and multiple bins in the horizontal for precipitation prediction. Linear regression models and feedforward ANNs are used for precipitation prediction. Rainfall totals from the Oklahoma Mesonet are utilized for the training and verification data. Results for the linear modeling suggest that, taken separately, reflectivity and spectrum width models are highly significant. However, when the two are combined in one linear model, they are not significantly more accurate than reflectivity alone. All linear models are prone to underprediction when heavy rainfall occurred. The ANN results of reflectivity and spectrum width inputs show that a 250-5-1 architecture is least prone to underprediction of heavy rainfall amounts. When a three-part ANN was applied to reflectivity based on light, moderate to heavy rainfall, in addition to spectrum width, it estimated rainfall amounts most accurately of all methods examined.
KW - Backpropagation
KW - Clustering
KW - Data mining applications
KW - Dimensionality reduction
KW - Exploratory data analysis
KW - Feedforward neural networks
KW - Mean-square error
KW - Neural network architectures
KW - Pattern recognition
KW - Principal component analysis
KW - Rainfall estimation
UR - http://www.scopus.com/inward/record.url?scp=0036713592&partnerID=8YFLogxK
U2 - 10.1016/S0360-8352(02)00139-0
DO - 10.1016/S0360-8352(02)00139-0
M3 - Article
AN - SCOPUS:0036713592
SN - 0360-8352
VL - 43
SP - 775
EP - 786
JO - Computers and Industrial Engineering
JF - Computers and Industrial Engineering
IS - 4
ER -