A NOVEL SPLIT SELECTION OF A LOGISTIC REGRESSION TREE FOR THE CLASSIFICATION OF DATA WITH HETEROGENEOUS SUBGROUPS
- Abstract
- A logistic regression tree (LRT) is a hybrid machine learning method that combines a decision tree model and logistic regression models. An LRT recursively partitions the input data space through splitting and learns multiple logistic regression models optimized for each subpopulation. The split selection is a critical procedure for improving the predictive performance of the LRT. In this paper, we present a novel separability-based split selection method for the construction of an LRT. The separability measure, defined on the feature space of logistic regression models, evaluates the performance of potential child models without fitting, and the optimal split is selected based on the results. Heterogeneous subgroups that have different class-separating patterns can be identified in the split process when they exist in the data. In addition, we compare the performance of our proposed method with the benchmark algorithms through experiments on both synthetic and real-world datasets. The experimental results indicate the effectiveness and generality of our proposed method.
- Author(s)
- A NOVEL SPLIT SELECTION OF A LOGISTIC REGRESSION TREE FOR THE CLASSIFICATION OF DATA WITH HETEROGENEOUS SUBGROUPS
- Issued Date
- 2023
Sudong Lee
Chi-Hyuck Jun
- Type
- Article
- Keyword
- model tree; logistic regression tree; subgroup identification; class separability
- DOI
- 10.23055/ijietap.2023.30.2.8743
- URI
- https://oak.ulsan.ac.kr/handle/2021.oak/17578
- Publisher
- INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE
- Language
- 영어
- ISSN
- 1072-4761
- Citation Volume
- 30
- Citation Number
- 2
- Citation Start Page
- 298
- Citation End Page
- 311
-
Appears in Collections:
- Engineering > Industrial Management Engineering
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.