Data mining in road crash analysis: the context of developing countries |
| |
Authors: | Md Asif Raihan Tanweer Hasan |
| |
Institution: | 1. Department of Civil and Environmental Engineering, Florida International University, Miami, FL, USA;2. Department of Civil Engineering, King Abdulaziz University, Jeddah, Saudi Arabia |
| |
Abstract: | The recent advancements in the field of data mining have made vast progress in extracting new information and hidden patterns from large datasets which are often overlooked by the traditional statistical approaches. These methods focus on searching for new and interesting hypothesis which were previously unobserved. Road safety researchers working with the crash data from developed world have seen encouraging success in obtaining new insight into crash mechanism through data mining. An attempt was made in this study to apply these advance methods and evaluate their performance in manifesting crash causes for Bangladesh. The study applies hierarchical clustering to identify hazardous clusters, random forest to find important variables explaining each of these clusters, and classification and regression trees to unveil their respective crash mechanisms for the road crash data of Bangladesh. The results identified several new interesting relationships and acknowledged issues related to quality of data. |
| |
Keywords: | Data mining crash data hierarchical clustering random forest classification and regression trees |
|
|