KLI

Data reformation - A novel data processing technique enhancing machine learning applicability for predicting streamflow extremes

Metadata Downloads
Abstract
Hydrologists have been actively exploring the utility of machine learning (ML) models for predicting streamflow. While ML methods have proven to be as accurate as conventional modeling techniques for streamflows well represented in the training set, they continue to lack satisfactory skills for extreme events. In this study, a novel ‘data reformation’ technique is proposed based on the Relative Strength Index (RSI) – a measure of speed and direction of changes in the time series. RSI homogenizes all observations to a constrained 0–100 range, and all ‘out-of-sample’ data in the testing set fall within the space of the training set. Long Short-Term Memory network with an attention mechanism is used to train three ML models using 55,055 events from the CAMELS dataset (670 basins, 1980–2014). Predictions are made for 12,424 events, of which 3,810 are significantly higher than streamflows in the training set. The ML model based on RSI-reformed data exhibits superior performance, as compared to other advanced ML models without data reformation. Peaks up to 15 times larger than those in the training events are accurately predicted, leading to an outperforming model skill for 433 out of 670 catchments. These findings indicate that incorporating a new data reformation technique into the data pre-processing step in ML modeling can enhance the utility of ML models for extreme events. This research encourages further exploration to identify better data reformation methods to enable confident ML predictions.
Author(s)
Vinh Ngoc TranValeriy Y. IvanovJongho Kim
Issued Date
2023
Type
Article
Keyword
Physical sciencesWater-supply
DOI
10.1016/j.advwatres.2023.104569
URI
https://oak.ulsan.ac.kr/handle/2021.oak/16938
Publisher
ADVANCES IN WATER RESOURCES
Language
영어
ISSN
0309-1708
Citation Volume
182
Citation Number
1
Citation Start Page
104569
Appears in Collections:
Engineering > Civil and Environmental Engineering
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.