Forecasting business failure: The use of nearest-neighbour support vectors and correcting imbalanced samples - Evidence from the Chinese hotel industry |
| |
Authors: | Hui Li Jie Sun |
| |
Affiliation: | a School of Economics and Management, Zhejiang Normal University, P.O. Box 62, 688 YingBinDaDao, Jinhua, Zhejiang 321004, PR China b College of Engineering, The Ohio State University, 470 Hitchcock Hall, 2070 Neil Avenue, Columbus, OH 43210, USA |
| |
Abstract: | Previous studies on firm failure prediction (FFP) have chiefly addressed predictions based on balanced datasets without considering that the real-world target population consists of imbalanced data. The current study investigates tourism FFP based on the imbalanced data of Chinese listed companies in the hotel industry. The imbalanced dataset was collected and represented in terms of significant financial ratios, and a new up-sampling approach and forecasting method were proposed to correct imbalanced samples. To balance the imbalanced dataset, the up-sampling method generates new minority samples according to random percentage distances from each minority sample to its nearest neighbour (NN). The NNs of unlabelled samples are retrieved from the balanced dataset to produce a knowledge base of nearest-neighbour support vectors, from which base support vector machines (SVMs) are generated and assembled. Empirical results indicate that the proposed sampling approach helped models produce more accurate performance on minority samples, with accuracy rates in excess of 90 per cent. This method of using nearest-neighbour support vectors and correcting imbalanced samples is useful in controlling risk in tourism management. |
| |
Keywords: | Firm failure prediction Tourism imbalanced dataset Nearest-neighbour support vector machine Bagging ensemble Tourism risk forecasting |
本文献已被 ScienceDirect 等数据库收录! |
|