Freeway Incident Frequency Analysis Based on CART Method

  • Xuecai Xu Huazhong University of Science and Technology & University of Hong Kong
  • Željko Šarić University of Zagreb
  • Ahmad Kouhpanejade University of Nevada, Las Vegas
Keywords: data mining, classification and regression tree, incident frequency, binary tree,


Classification and Regression Tree (CART), one of the most widely applied data mining techniques, is based on the classification and regression model produced by binary tree structure. Based on CART method, this paper establishes the relationship between freeway incident frequency and roadway characteristics, traffic variables and environmental factors. The results of CART method indicate that the impact of influencing factors (weather, weekday/weekend, traffic flow and roadway characteristics) of incident frequency is not consistent for different incident types during different time periods. By comparing with Negative Binomial Regression model, CART method is demonstrated to be a good alternative method for analyzing incident frequency. Then the discussion about the relationship between incident frequency and influencing factors is provided, and the future research orientation is pointed out.


Lindley JA. Urban freeway congestion: quantification of the problem and effectiveness of potential solutions. ITE J. 1987 Jan;57:27-32.

Washington SP, Karlaftis MG, Mannering FL. Statistical and econometric methods for transportation data Analysis, New York: Chapman and Hall/CRC; 2003.

Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and regression trees. London: Chapman & Hall/CRC; 1998.

Chang LY, Chen WC. Data mining of tree-based models to analyze freeway accident frequency. J Safety Res. 2005;36(4):365-375.

Yang P, Wu B. Traffic management and control. Beijing: Renmin Traffic Press; 2004.

Lord D, Mannering FL. The statistical analysis of crash-frequency data: a review and assessment of methodological alternatives. Transp Res Part A Policy Pract. 2010 Jun;44(5):291–305.

Poch M, Mannering FL. Negative binomial analysis of intersection-accident frequencies. J Transp Eng. 1996 Mar;122(2):105–113.

Ivan JN, Wang C, Bernardo NR. Explaining two-lane highway crash rates using land use and hourly exposure. Accid Anal Prev. 2000 Nov;32(6):787–795.

Carson J, Mannering FL. The effect of ice warning signs on accident frequencies and severities. Accid Anal Prev. 2001 Jan;33(1):99-109.

Shankar VN. Limited dependent variable and structural equations models: empirical applications to traffic operations and safety. Dissertation: University of Washington; 1997.

Karlaftis MG, Tarko AP. Heterogeneity considerations in accident modeling. Accid Anal Prev. 1998 Jul;30(4):425-433.

Kuhnert PM, Do K, McClure R. Combining non-parametric models with logistic regression: an application to motor vehicle injury data. Comput Stat Data Anal. 2000 Sept;34(3):371-386.

Karlaftis MG., Golias I. Effects of road geometry and traffic volumes on rural roadway accident rates. Accid Anal Prev, 2002 May;34(3):357-365.

Chang LY, Wang HW. Analysis of traffic injury severity: an application of non-parametric classification tree techniques. Accid Anal Prev, 2006 Sept;38(5):1019-1027.

Kashani AT, Mohaymany AS. Analysis of the traffic injury severity on two-lane, two-way rural roads based on classification tree models. Safety Sci. 2011 Dec;49(10):1314-1320.

Pakgohar A, Tabrizi RS, Khalili M, Esmaeili A. The role of human factors in incidence and severity of road crashes based on CART and LR regression: a data mining approach. Procedia Comput Sci. 2011;3:764-769.

Yap BW, Norashikin N, Wong, SV, Mohamad AL. Decision tree model for count data. Proceedings of the World Congress on Engineering 2012. Vol I; July 4-6, 2012, London, U.K.

Anastasopoulos PCh, Mannering FL. A note on modeling vehicle-accident frequencies with random parameter count models. Accid Anal Prev. 2009 Jan;41(1):153–159.

El-Basyouny K, Sayed T. Accident prediction models with random corridor parameters. Accid Anal Prev. 2009 Sept;41(5):1118–1123.

Wang X, Abdel-Aty M. Temporal and spatial analyses of rear-end crashes at signalized intersections. Accid Anal Prev. 2006 Nov;38(6):1137–1150.

Xu X, Kwigizile V, Teng H. Identifying access management factors associated with safety of urban arterials mid-blocks: a panel data simultaneous equation models approach. Traffic Inj Prev. 2013;14(7):734-742.

How to Cite
Xu X, Šarić Željko, Kouhpanejade A. Freeway Incident Frequency Analysis Based on CART Method. Promet [Internet]. 2014May26 [cited 2024Mar.1];26(3):191-9. Available from: