Skip to main content
Erschienen in: Journal of Big Data 1/2018

Open Access 01.12.2018 | Case Study

ANN based short-term traffic flow forecasting in undivided two lane highway

verfasst von: Bharti Sharma, Sachin Kumar, Prayag Tiwari, Pranay Yadav, Marina I. Nezhurina

Erschienen in: Journal of Big Data | Ausgabe 1/2018

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Short term traffic forecasting is one of the important fields of study in the transportation domain. Short term traffic forecasting is very useful to develop a more advanced transportation system to control traffic signals and avoid congestions. Several studies have made efforts for short term traffic flow forecasting for divided and undivided highways across the world. However, all these studies relied on the dataset which are greatly varied between countries due to the technology used for transportation data collection. India is a developing country in which efforts are being done to improve the transportation system to avoid congestion and travel time. Two-lane undivided highways with mixed traffic constitute a large portion of Indian road network. This study is an attempt to develop a short term traffic forecasting model using back propagation artificial neural network for two lane undivided highway with mixed traffic conditions in India. The results were compared with random forest, support vector machine, k-nearest neighbor classifier, regression tree and multiple regression models. It was found that back-propagation neural network performs better than other approaches and achieved an R2 value 0.9962, which is a good score.
Abkürzungen
BPANN
back propagation neural network
ITS
Intelligent Transportation Systems
MSE
mean square error
MAE
mean absolute error
NMSE
normalized MSE
MAPE
mean absolute percentage error
CFE
cumulative forecast error
VAPE
variance of absolute percentage error
ATIS
Advanced Traveller Information System

Introduction

India is the second most dense and populated country in the world and one of the fastest growing economies. It is experiencing extreme congestion problems on road specifically on undivided two lane highways with mixed traffic. Facilitating infrastructure, imposing proper taxes to restrict personal vehicle growth and enhancing public transport facilities are long term solutions to this problem. These permanent solutions need government’s involvement. The Indian government has spent a huge amount in the urban infrastructure sector. Many public transports like Bus Rapid Transit, Metro are being built in several new places to encourage the use of public transport. However, still there is a rapid growth of private vehicles [1]. The country’s growing population is also one of the reasons for increased transportation needs. Meeting such needs with infrastructure growth is seemingly less viable because of space and cost constraints.
In general, short term traffic forecasting deals with analyzing previous traffic data and predict the traffic flow for next 5–30 min. The duration of 5–30 min is very important because traffic can grow drastically sometimes in next 5 min and can be very congested in next few minutes. Therefore, if it is possible to predict the traffic situation in next 5 min then suitable actions can be taken by traffic engineers and police to divert the traffic on other routes in case of the possibility of high congestion in next few minutes. Hence, short term traffic forecasting is very important area for investigation specifically for two-lane undivided highways with mixed traffic in India. Researchers in India have considered 4 lane divided highways for short term traffic prediction but two-lane undivided highways with mixed traffic is still requires extensive investigation and research. This is the motivation for the present study. Therefore, this study identified a two-lane undivided highway stretch with mixed traffic conditions and developed a short term traffic forecasting model using back-propagation neural network. The rest of the paper is organized as follows: “Literature review” section provides s literature review of relevant studies. In “Materials and methods” section, materials and methods is discussed that consists of information related to data collection, preparation methods and the back propagation neural network approach used to develop short term traffic forecasting model. “Results and discussion” section presents the results and discussion. Finally, the study is concluded in “Conclusion” section.

Literature review

Intelligent management of traffic flow and with providing travelers more accurate information about traffic and road status can reduce the negative impact of congestion [2]. Developing intelligent transport system (ITS) in India requires extensive quality research and development efforts. In design, planning and operations of highways; the traffic flow forecasting is very important. Short term traffic [3] mentioned that short term traffic flow forecasting is an important aspect of ITS. The importance of traffic flow forecasting for ITS has long been seen in many applications including the development of traffic control strategies in advanced traffic management systems [4] and Advanced Traveller Information Systems [5]. Furthermore, traffic forecasting can be very useful for drivers in saving time and also it helps in reducing traffic congestion and air pollution. In order to predict traffic parameters, the modeling process requires past records.
There are several studies that considered short term traffic flow prediction and developed various methodologies. Kalman filtering [6], local linear regression [7], neural network [8] and fuzzy logic based models [9] are some of the methods used for the short term traffic flow prediction. Due to stochastic and highly non-linear behavior of traffic stream,  machine learning techniques [10] have received a great attention and hence are taken as an alternative for traffic flow prediction. Dougherty and Cobbett [11] used back propagation neural network to develop a model to predict traffic flow, speed and traffic occupancy in the Utrecht/Rotterdam/Hague region of The Netherlands. They mentioned that elasticity test can be a good option to interpret the developed neural network model. A comparative study between neural network and statistical models for short term traffic flow forecasting on motorway traffic data in France was done by Kirby et al. [12]. Dia [13] proposed an object oriented neural network approach for the prediction of short term traffic conditions on a highway stretch between Brisbane and the Gold Coast in Queensland, Australia. Wang and Shi [14] used support vector machines (SVM) model for short term traffic prediction. They suggested that appropriate selection of kernel parameters for SVM is a big challenge. They introduced a new approach to construct a new kernel function with the use of wavelet theory to capture the non-stationary properties of short term traffic speed data. Further, they tested this approach in a real world traffic speed data.
Theja and Vanajakshi [15] considered a mixed and less-lane disciplined traffic data with homogeneous traffic flow on Indian road. They used SVM and back propagation ANN to develop a traffic prediction model. They mentioned that SVM method was found more accurate in their study. In a different study, Centiner et al. [16] considered the homogeneous traffic flow and used ANN model to develop a short term traffic forecasting model on the traffic data collected in Istanbul. They mentioned that day of week, hour and minute had played an important role in traffic volume prediction. The stability and efficiency of neural network for short term prediction of traffic volume with mixed Indian traffic flow conditions on 4-lane undivided highways were studied by Kumar et al. [17]. Kumar et al. [17] considered ANN model for traffic flow forecasting and used traffic volume, speed, traffic density, time and day of week as input parameters. They mentioned that performance of ANN was consistent even if they changed the prediction time interval from 5 min to 15 min. [18] used adaptive Kalman filter approach for short term traffic flow rate prediction and uncertainty quantification. They developed a short term traffic prediction model for the real world traffic data collected from four different highway systems from United Kingdom, Minnesota, Washington and Maryland from USA. They suggested that adaptive Kalman filter is highly effective in case of highly volatile traffic.
Habtemichael and Cetin [19] proposed a non-parametric prediction model using enhanced k-nearest neighbors approach for short term traffic flow rate prediction. They applied and tested this model on 36 datasets (12 datasets from United Kingdom and 24 datasets from USA) collected from different regions. They found that their model outperformed the other advanced parametric models used in the study. Further, Ma et al. [20] pointed out that accuracy is very important in short term traffic flow prediction. They proposed a 2-dimensional prediction method using Kalman filtering for historic traffic data. They mentioned that their proposed approach provided better accuracy than standard Kalman filtering approach. Guo et al. [21] suggested that interval prediction is more important and challenging than point prediction for traffic managers considering the future scenario of ITS. They used fuzzy information granulation method along with ANN, SVM and KNN methods to develop a forecasting model for both point and interval prediction on a real world traffic data collected from American field transportation systems. Their results showed that with an increase in time interval, stability of prediction systems increased.
Such studies are possible with high quality data available with high-tech technology used in the ITS in European countries and United States where people are disciplined towards traffic rules. Considering India, a different scenario can be seen where all roads are not well constructed and people are not very friendly and disciplined towards traffic rules. A two lane undivided road is a global feature for any state and national highways in India. In India, very few studies [22, 23] have taken two lane undivided highways into consideration but there motivation was other than short term traffic flow forecasting. Therefore, it is very important to consider two lane undivided highway stretch with mixed traffic flow into consideration because a large portion of Indian road network are two lane undivided roads with mixed traffic.
The key objective of this study is to develop a short term traffic flow forecasting model using back propagation neural network for non-urban undivided two-lane roads with mixed traffic flow conditions in India.

Materials and methods

Data collection and preprocessing

The data set used in the study was collected from 2-lane undivided highway stretch between Roorkee and Hardwar on National Highway-58 (NH-58). In the NH-58, Delhi to Muzaffarnagar road is constructed as four-lane divided national highway and remaining road is an undivided two-lane. In the present study, undivided two lane highway stretch on NH-58, from Roorkee to Hardwar is selected. The three locations, L1 (near hotel Prakash, Roorkee), L2 (near Rehmadpur) and L3 (near Badheri) shown in Fig. 1 were considered for data collection.
Data were collected during 900–1200 and 1500–1800 h from 1/4/2017 to 31/8/2017. High quality digital cameras were used to effectively capture the traffic on entire selected stretch. Traffic flow was assumed to be simple having no change of direction. All the data were captured with a timer effect. These recordings were played in the computer and features were extracted with the help of a computer program written in python language. Further, all the vehicles were classified into ten categories (Table 1). The number of vehicles was counted, passing through a trap length manually to obtain the traffic volume data. The speed of each vehicle in all categories was calculated by dividing the trap length (20 m) by entry and exit time difference crossing the trap length.
Table 1
Summary of speed and traffic volume measurement for 5 min interval
S. no
Vehicle category
Delhi–Haridwar and Delhi–Haridwar
Average speed (km/h)
Traffic volume
Min
Max
Mean
SD
Min
Max
Mean
SD
1
Car
14.123
43.373
18.140
4.699
38
60
46.254
4.387
2
Bus
9.56
31.4
16.000
5.716
4
24
10.550
3.325
3
Truck
11.1
39.29
16.034
6.352
1
17
5.847
2.838
4
LCV
13.09
29.95
19.549
4.914
3
23
8.847
2.707
5
3-Wheeler
9.96
31.23
18.86
5.643
1
20
4.337
2.993
6
2-Wheelers
9.54
34.13
18.968
5.789
98
155
114.735
9.093
7
Cycle
9.11
22.29
12.861
3.843
5
25
15.0509
3.696
8
Rickshaw
8.59
19.63
11.256
2.899
3
21
13.2777
4.563
9
Tractor
9
20.68
14.342
2.888
1
12
3.643
2.068
10
ADV
0
1.42
0.2335
0.4199
0
2
0.4444
0.7573
Min, minimum; Max, maximum; SD, standard deviation
The measured data has been used in the same fashion mentioned by de Luca et al. [24] . Data extraction was done in the intervals of 5 min for both directions. Statistical characteristics of extracted data are mentioned in Table 1. Since data were collected at the same day same location at two peak times (morning and late afternoon) for the period of three hours for all days, approximately 100 data samples were obtained on each day by considering both sides of the road compositely. Therefore, a total of 15,069 data samples were obtained for the entire duration.
In this study, 22 parameters were taken into consideration to create database for ANN modeling of traffic volume that includes the frequencies of all category of vehicles. For pre-processing of the dataset, the whole exemplars (dataset) were first randomized and then divided into three data sets. First dataset was taken as the training set, second dataset for cross validation and third dataset was used for testing purpose. Randomization is used to stop bias in the dataset and create different samples as a representative of the entire population. By dividing the dataset, 10% of the samples were used for cross-validation, 10% for testing and 80% were used for training purposes. The criterion in separating the data was to assign sufficient samples for the ANN training and some for cross validation and testing.

Model development

ANN has the potential to perceive the non-linear relationship between input and output features and can provide generalize solutions to forecast traffic volume. Multi-Layer Perceptron (MLP) is one of the popular network structure of ANN with an additional layer called hidden layer. MLP can be used to solve different problems because of non-linear characteristic of activation function between its layers of processing elements. The selection of activation function plays a critical role in the performance of a neural network. The error is calculated at each epoch by comparing computed output of each input with expected output. Back propagation is widely used technique to propagate the error. Each processing unit is initially assigned a random weight. The main objective of neural network optimization process is to minimize the mean square error in training, cross validation and testing phase.
There is no limitation on selection of number of input variables in ANN modeling. The selection of number of input and output variables depends on the type of problem. In literature, there is no general approach for creation of perfect neural network architecture. Trial and error simply means that initially we have to decide the weight parameters for each neuron in the hidden layer at random. Further, these weights are modified by propagating the prediction error backwards. Certain parameters like number of input variables, number of hidden layers, activation or transfer function and learning rate plays an important role in designing neural network architecture. The general architecture of ANN is illustrated in Fig. 2.
Previously, ANN model was used by Kumar et al. [17] for the short term traffic flow predictions for 4 lane highway. Here, we are trying to use ANN model to investigate its performance for short term traffic prediction on 2 lane undivided highway with mixed traffic conditions.
An ANN with one hidden layer can be defined as a function
$$y : Z^{A} \to Z^{B}$$
(1)
where, A and B are the length of input and output vector f(x), respectively.
In matrix form, it can be defined as:
$$y\left( x \right) = \varphi \left( {b^{\left( 2 \right)} + W^{\left( 2 \right)} h\left( x \right)} \right)$$
(2)
where,
$$h\left( x \right) = \delta \left( {b^{\left( 1 \right)} + W^{\left( 1 \right)} x} \right)$$
(3)
b(1), b(2) are bias vectors, W(1), W(2) are weight matrices and \(\varphi\) and \(\delta\) are activation functions.
Some of the popular activation functions are sigmoid (x) and tanh (x) used in this study. Equations 4 and 5 provide the mathematical notation and Fig. 3a, b provide graphical illustration for tanh (x) and sigmoid (x) respectively.
$$y\left( {x_{i} } \right) = \tanh \left( {x_{i} } \right)$$
(4)
$$y\left( {x_{i} } \right) = \frac{1}{{\left( {1 + e^{{ - x_{i} }} } \right)}}$$
(5)
The above two activations functions are both sigmoid except that first one is hyperbolic tangent in the range [− 1, + 1] and second one is a logistic function within range [0, 1].
Initially, random weights are assigned to the hidden layer because it is very difficult to identify the accurate weight parameters. Therefore, a loss function is required to adjust the weight parameters accurately. This loss function (Eq. 6) calculates the error between predicted output and exact output and then propagates the error backwards.
$$L_{f} = \frac{1}{2}\sum (y - y^{\prime})^{2}$$
(6)
\(y^{\prime}\) is the predicted output and y is the actual output. The goal is to minimize the loss function by changing the weight matrix. The weight matrix can be changed using gradient decent method. It tries to find the rate of change of error for a specific weight in the error. Weight matrix can be updated using weight update equation as given in Eq. 7.
$$W_{ab} = W_{ab} - \Delta W_{ab}$$
(7)
where, \(W_{ab}\) represents the connection weight from a neuron in layer a to layer b and \(\Delta W\) is given by,
$$\Delta W = - \theta \frac{{\partial L_{f} }}{{\partial W_{ab} }}$$
(8)
\(\theta\) is the rate of learning.

Development of ANN models

In this study, multilayer perceptron network has been used for the prediction of traffic flow for 5 min in future using past 55 min data. For development of ANN model, 216 data samples have been taken, each of which contained 22 features i.e. location, time of day, 10 vehicles categories and respective average speed of each vehicle category. Single class speed flow model is not sufficient for explaining traffic conditions in India because we do not have single road just for one type of vehicle. Moreover, all kind of vehicles (motorized and non-motorized) share the same road and there is a variation in their speed. Therefore, a single class speed flow model is not applicable in this study. Therefore average speed of different class of vehicles is considered to predict the multiclass traffic flow of undivided two lane highway. The development and implementation of the ANN model was done Anaconda Spyder 3.6 version using Scikit Learn package. The best performing neural network structure is obtained by getting the best values of network parameters for the training and the testing. Due to failure of getting appropriate values of network parameters by using other approaches available in literature [2527] [S, T, U], the trial and error approach has been used. The stopping criterion during training was the least mean square error (MSE).
Twelve different ANN models have been developed to train on the dataset. The specification of 12 models with different structures has been presented in Table 2. From the Table 2 it is clear that neural network with 7 hidden neurons gives the best prediction result. Thus, architecture of ANN model in the present study has 22 inputs, 7 neurons in hidden layer and single output. The performance of the ANN models were determined using cross validation and testing data sets. Coefficient of correlation (r), mean absolute error (MAE), mean square error (MSE) and Normalized mean square error (NMSE) were used to evaluate the performance of predicted results.
Table 2
Different ANN networks architectures for traffic volume prediction
Model
Algorithm
Hidden layer
hidden neurons
Transfer Function
Epochs
Learning
Step size/Mo
Training
Cross validation
Testing
Min MSE (*104)
Final MSE (*104)
Min MSE (*104)
Final MSE (*104)
MSE
NMSE (*104)
MAE
R (%)
M1
MLP
1
4
Tanh
100
LM
5.9
5.9
12.5
10.37
4.66
47.99
1.93
98.8
M2
MLP
1
4
Tanh
200
LM
5.3
5.3
15
7.13
6.00
61.81
2.14
98.4
M3
MLP
1
4
Tanh
300
LM
4
4
21.1
20.94
3.25
33.49
1.26
98.7
M4
MLP
1
5
Tanh
100
LM
3.6
3.8
20.24
8.371
4.79
49.34
1.84
99.2
M5
MLP
1
5
Tanh
200
LM
2.8
2.8
16.05
16.866
19.8
204.4
3.73
93.7
M6
MLP
1
5
Tanh
100
MO
1/0.7
29.9
29.9
23.19
2.506
4.12
42.43
1.64
98.3
M7
MLP
1
5
Tanh
200
MO
1/0.7
27.3
27.3
25.8
3.3
7.10
73.12
2.06
96.6
M8
MLP
1
3
SigmoidAxon
100
LM
2.6
2.6
3.09
0.832
7.980
82.1
2.32
97.9
M9
MLP
1
3
SigmoidAxon
200
LM
1.6
1.8
2.63
0.263
17.66
181.8
3.13
92.7
M10
MLP
1
5
SigmoidAxon
100
LM
0.81
0.8141
3.73
1.883
15.012
154.55
3.44
95.6
M11
MLP
1
5
SigmoidAxon
200
LM
0.03
0.0358
4.92
13.479
151.9
1564.4
11.18
25.5
M12
MLP
1
7
SigmoidAxon
150
LM
0.36
0.366
4.7
2.89
0.816
8.4
0.77
99.8

Sensitivity analysis of traffic volume parameter to input

Sensitivity analysis measures the variation in the performance of model with a change in input value [28, 29]. Irrelevant inputs can be eliminated by implementing sensitivity analysis on a trained network. Eliminating irrelevant inputs may result in reduced data collection cost and improved network’s performance. Moreover, sensitivity investigation gives understanding of the fundamental relations between input variables and output. In this investigation, ANN model (Model 12) was applied for sensitivity investigation. Sensitivity investigation was achieved about the mean on the pre-trained MLP network. This batch starts by changing first input between its mean ± 5 while all other inputs were stable at their respective means. The network output was calculated for hundred steps above and below the mean. This procedure was then done repeatedly for each input. Figure 4 shows the deviation of output with respect to deviation of each input. According to sensitivity investigation, thirteen most significant inputs factors are Time, CR/JP/VN, BUS, TRUCK, MB/TT, 3 W, ST/MT, BCYCLE, PRICSHAW, TT, BCART/HCART, SPRICSHAW, STT (as depicted in Fig. 4). In next stage, neural network was trained and verified with same ANN structure as the best selected ANN model (Model 12) considering only the 13 most important input parameters under the sensitivity investigation. Output of training, cross validation and testing stage of the new sensitivity model are described in Table 3. Table 3 illustrated that the sensitivity model does not perform well compared to the best performing proposed ANN model i.e. model 12 even after suppressing number of input variables from 22 to 13.
Table 3
Performance analysis between proposed ANN Model and sensitivity model
Model/parameter
Minimum MSE (T)
Final MSE (T)
Minimum MSE (X)
Final MSE (X)
MSE (Y)
NMSE (Y)
MAE (Y)
r (Y) (%)
Proposed model
3.67E−05
3.67E−05
0.00047
0.00289
0.81655
0.00840
0.7711
99.8
Sensitivity based model
9.46E−05
9.46E−05
0.00040
0.00095
0.91439
0.00941
0.6876
99.5

Results and discussion

Several ANN models have been developed and trained on the data. All these models were trained at different epochs to adjust the weight parameters in the network. Figure 5 illustrates the performance of best 6 models trained on different number of epochs. It can be seen in Fig. 5 that an increase in the number of epochs reduced the MSE and improved the performance of the model both in training and validation phase. It can be seen in Fig. 5 that prediction model M12 achieved the minimum MSE in comparison to other models. Figure 6 shows the regression plot between observed and simulated traffic volume. It can be seen that model M12 achieved the highest prediction accuracy among other models and its R2 value is 0.9919 (higher than other models). R2 can be defined as a coefficient of determination that illustrates how nicely the regression line fitting the data. It is considered that more closed the value of R2 to 1, the better the prediction model would be. Further, the performance of each developed model is evaluated using mean absolute error (MAE), mean absolute percentage error (MAPE), Theils U Statistic (U1, U2), cumulative forecast error (CFE) and variance of absolute percentage error (VAPE). MAE, MAPE, Theils U Statistic (Eqs. 9 and 10) measure the accuracy of prediction and VAPE is used to measure the prediction stability and CFE is used for bias estimation.
Table 4 provides the value for all these parameters for different ANN models. Table 4 illustrated that the Model M12 achieved the best score for all parameters in comparison to other developed models.
$$U_{1} = \frac{{\left( {\mathop \sum \nolimits_{i = 1} \left( {P_{i} - A_{i} } \right)^{2} } \right)^{1/2} }}{{\left( {\mathop \sum \nolimits_{i = 1} A_{i}^{2} } \right)^{1/2} }}$$
(9)
$$U_{2} = \frac{{\left( {\frac{1}{n}\mathop \sum \nolimits_{i = 1} \left( {A_{i} - P_{i} } \right)^{2} } \right)^{1/2} }}{{\left( {\frac{1}{n}\mathop \sum \nolimits_{i = 1} A_{i}^{2} )^{1/2} + \left( {\frac{1}{n}\mathop \sum \nolimits_{i = 1} P_{i}^{2} } \right)} \right)^{1/2} }}$$
(10)
Table 4
Statistical indices of different models
Model
Error
CFE
Theil’s U statistic
MAE
MAPE (%)
VAPE (%)
MPE
U1
U2
M1
1.93
0.9010
0.0221
− 0.6872
− 32.068
0.0031
0.0450
M2
2.14
0.9917
0.0265
− 0.8014
− 38.081
0.0056
1.3689
M3
1.26
0.5889
0.0287
− 0.4358
− 20.459
0.0041
0.9605
M4
1.84
0.0386
0.0029
− 0.8498
− 40.538
0.0050
0.0441
M5
3.73
1.7325
0.2774
− 1.1564
− 52.574
0.0102
0.9964
M6
1.64
0.7689
0.0268
− 0.3665
− 16.559
0.0046
0.0425
M7
2.06
0.9546
12.690
− 0.1373
− 5.1569
1.7368
0.0563
M8
2.32
1.0963
0.0369
− 0.9114
− 42.496
3.08E−05
0.0588
M9
3.13
1.4442
0.0607
− 0.5223
− 25.028
0.0096
0.0811
M10
11.18
5.2393
0.1217
− 3.7443
− 169.104
0.0279
0.2455
M11
11.41
5.3393
0.1241
− 3.6870
− 165.905
0.0285
0.2539
M12
0.77
0.3569
0.0102
− 0.3158
− 14.937
0.0020
0.0180
In the above equations, A and P denote the changes in actual values and predicted change in values. U1 and U2 is a measure of prediction accuracy and quality respectively. Equations 1115 defines the MAE, MPE, MAPE, VAPE and CFE respectively.
$$MAE = \frac{1}{N}\sum \left| {P_{i} - A_{i} } \right|$$
(11)
$$MPE = \frac{1}{N}\sum \frac{{\left( {A_{i} - P_{i} } \right)^{2} }}{{A_{i} }} \times 100$$
(12)
$$MAPE = \frac{1}{N}\sum \frac{{\left| {(A_{i} - P_{i} )^{2} } \right|}}{{A_{i} }} \times 100$$
(13)
$$VAPE = Var\left( {\frac{{\left| {P_{i} - A_{i} } \right|}}{{A_{i} }}} \right) \times 100$$
(14)
$$CFE = \sum \left( {A_{i} - P_{i} } \right)$$
(15)
Performance comparison of proposed traffic volume prediction model has been done using the approaches used in earlier studies (Table 5). Random forest and regression trees, [30], SVM regression, [31], K-nearest neighbors, [32], Multiple linear regression, [33] have been developed on the dataset. It is found that the BPANN (M12) model can predict traffic volume more accurately than other approaches (as shown in Table 5). It is obvious from the statistical indices that ANN based model is robust and stable, and can be applied successfully for short term traffic flow prediction in Indian traffic conditions. ANN modeling and integrated sensitivity analysis is used in the present study. It is one of the most systematic and accurate way to predict the performance of traffic volume.
Table 5
Performance comparison between proposed ANN based model and traditional models
Models
MSE
RMSE
MAE
RSE
RRSE
RAE
R2
Random forest
217.718
14.7553
10.9593
0.2284
0.4779
0.4400
0.7716
Regression tree
388.380
19.7074
14.4908
0.4074
0.6383
0.5817
0.5926
SVM regression
898.723
29.9787
23.3781
0.9428
0.9710
0.9385
0.0572
K nearest neighbors Regression (KNN)
14.1294
3.7589
1.4239
0.0148
0.1217
0.0572
0.9852
Multiple linear regression
305.536
17.4796
13.3072
0.3205
0.5661
0.5342
0.6795
BP neural network
4.2848
2.06999
0.77114
0.09951
0.31546
0.16204
0.9962
In this study, 7 nodes have been taken into consideration for hidden layer which provides good results. However, this number is not an optimal selection for any model and can be selected as per the requirements and the size of the data. The back propagation algorithm can be utilized in Advanced Traveller Information Systems (ATIS) because of its efficiency in reducing prediction error. ATIS is a system which makes prediction based on the information stored in the database [34]. More the predictions are accurate, better the suggestions of ATIS would be. Therefore, the thirst for developing more robust and accurate short term prediction model is in demand. Our model achieved the better R2 score (0.9962) among other models for short term traffic flow prediction on two lane undivided highways with mixed traffic conditions. Moreover, if accuracy can be sacrificed a little and processing time is more important, then methods like SVM and conventional regression-b estimators can be a good options rather than going with ANN.

Conclusion

This study presented ANN based short term traffic volume prediction model for undivided two lane highways with mixed traffic in India. The data samples were collected on NH-58 highways stretch from Roorkee to Hardwar. The study used back propagation neural network approach to develop a short term forecasting model for two lane undivided highways with mixed traffic conditions. The major advantage of back-propagation neural network is that it calculates the prediction error and propagates back to the previous layers in order to modify the weights; resulting better prediction accuracy with more training. The training can be stopped after certain number epochs with no further improvements in prediction accuracy. The results of best back propagation model (M12) was quite promising as it achieved a good R2 value of 0.9962. This study can be useful to be used in Advanced Traveller Information Systems for short term traffic predictions.
The study is certainly capable to provide a good solution for short term traffic prediction for two lane undivided highways with mixed traffic conditions in India. But the dataset used for this study is restricted to a limited portion of highway stretch and days of week. The study can be enhanced by using more informative data about traffic flow during weekdays and weekends, for peak hours and normal hours, for different months or seasons. Although this data collection procedure requires a lot of human and technology efforts but it will certainly help in more informative and large dataset. Moreover, this large data set needs to be analyzed with more suitable algorithms i.e. deep neural networks which are known to handle large dataset efficiently. The future scope of this study would be to work on the above mentioned limitations of the study and provide a better solution.

Authors’ contributions

BS and PY have collected the data from the sites and perform conceptualization. BS, SK and PT performed the analysis and experiments. BS and SK wrote the manuscript. The whole work was supervised by MN. All authors read and approved the final manuscript.

Acknowledgements

The authors gratefully acknowledge the financial support of the Ministry of Education and Science of the Russian Federation in the framework of Increase Competitiveness Program of NUST « MISiS » (No. К4-2017-052).

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

The authors are not authorized to share the data. However, full support regarding the study will be provided for interested readers.

Funding

The research receive no external funding. The publication costs will be covered by personal funds of the authors.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://​creativecommons.​org/​licenses/​by/​4.​0/​), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Literatur
1.
Zurück zum Zitat UTIC: Urban transportation in Indian Cities, In: Compendium of good practices. New Delhi: National Institute of Urban Affairs, 2015. UTIC: Urban transportation in Indian Cities, In: Compendium of good practices. New Delhi: National Institute of Urban Affairs, 2015.
3.
Zurück zum Zitat Sun S, Zhang C, Yu G. A Bayesian network approach to traffic flow forecasting. Transp Syst IEEE Trans. 2006;7(1):124–32.CrossRef Sun S, Zhang C, Yu G. A Bayesian network approach to traffic flow forecasting. Transp Syst IEEE Trans. 2006;7(1):124–32.CrossRef
4.
Zurück zum Zitat Zeng D, Xu J, Gu J, Liu L, Xu G. Short term traffic flow prediction using hybrid ARIMA and ANN models. In: Workshop on power electronics and intelligent transportation system. 2008. p. 621–5. Zeng D, Xu J, Gu J, Liu L, Xu G. Short term traffic flow prediction using hybrid ARIMA and ANN models. In: Workshop on power electronics and intelligent transportation system. 2008. p. 621–5.
5.
Zurück zum Zitat Zheng W, Lee DH, Shi Q. Short-term freeway traffic flow prediction: Bayesian combined neural network approach. J Transp Eng. 2006;132:114–21.CrossRef Zheng W, Lee DH, Shi Q. Short-term freeway traffic flow prediction: Bayesian combined neural network approach. J Transp Eng. 2006;132:114–21.CrossRef
6.
Zurück zum Zitat Xie Y, Zhang Y, Ye Z. Short-term traffic volume forecasting using Kalman filter with discrete wavelet decomposition. Comput Aided Civil Infrastruct Eng. 2007;22(5):326–34.CrossRef Xie Y, Zhang Y, Ye Z. Short-term traffic volume forecasting using Kalman filter with discrete wavelet decomposition. Comput Aided Civil Infrastruct Eng. 2007;22(5):326–34.CrossRef
7.
Zurück zum Zitat Sun H, Liu HX, Xiao H, He RR, Ran B. Use of local linear regression model for short-term traffic forecasting. Transp Res Rec. 2003;1836:143–50.CrossRef Sun H, Liu HX, Xiao H, He RR, Ran B. Use of local linear regression model for short-term traffic forecasting. Transp Res Rec. 2003;1836:143–50.CrossRef
8.
Zurück zum Zitat Chen H, Grant-Muller S. Use of sequential learning for short-term traffic flow forecasting. Trans Res C. 2001;9(5):319–36.CrossRef Chen H, Grant-Muller S. Use of sequential learning for short-term traffic flow forecasting. Trans Res C. 2001;9(5):319–36.CrossRef
9.
Zurück zum Zitat Zhang Y, Ye Z. Short-term traffic flow forecasting using fuzzy logic system methods. J Intell Transp Syst. 2008;12(3):102–12.MATHCrossRef Zhang Y, Ye Z. Short-term traffic flow forecasting using fuzzy logic system methods. J Intell Transp Syst. 2008;12(3):102–12.MATHCrossRef
10.
Zurück zum Zitat Kumar S, Toshniwal D. Severity analysis of powered two wheeler traffic accidents in Uttarakhand, India. Eur Transp Res Rev. 2017;9(2):24.CrossRef Kumar S, Toshniwal D. Severity analysis of powered two wheeler traffic accidents in Uttarakhand, India. Eur Transp Res Rev. 2017;9(2):24.CrossRef
11.
Zurück zum Zitat Dougherty MS, Cobbett MR. Short-term inter-urban traffic forecasts using neural networks. Int J Forecast. 1997;13(1):21–31.CrossRef Dougherty MS, Cobbett MR. Short-term inter-urban traffic forecasts using neural networks. Int J Forecast. 1997;13(1):21–31.CrossRef
12.
Zurück zum Zitat Kirby HR, Watson SM, Dougherty MS. Should we use neural networks or statistical models for short-term motorway traffic forecasting? Int J Forecast. 1997;13(1):43–50.CrossRef Kirby HR, Watson SM, Dougherty MS. Should we use neural networks or statistical models for short-term motorway traffic forecasting? Int J Forecast. 1997;13(1):43–50.CrossRef
13.
Zurück zum Zitat Dia H. An object-oriented neural network approach to short-term traffic forecasting. Eur J Oper Res. 2001;131(2):253–61.MATHCrossRef Dia H. An object-oriented neural network approach to short-term traffic forecasting. Eur J Oper Res. 2001;131(2):253–61.MATHCrossRef
14.
Zurück zum Zitat Wang J, Shi Q. Short-term traffic speed forecasting hybrid model based on Chaos-wavelet analysis-support vector machine theory. Transp Res C. 2013;27(1):219–32.CrossRef Wang J, Shi Q. Short-term traffic speed forecasting hybrid model based on Chaos-wavelet analysis-support vector machine theory. Transp Res C. 2013;27(1):219–32.CrossRef
15.
Zurück zum Zitat Theja PV, Vanajakshi L. Short term prediction of traffic parameters using support vector machines technique. In: Proceedings of the third international conference on emerging trends in engineering and technology. 19–21 November 2010, Goa, India. p. 70–5. Theja PV, Vanajakshi L. Short term prediction of traffic parameters using support vector machines technique. In: Proceedings of the third international conference on emerging trends in engineering and technology. 19–21 November 2010, Goa, India. p. 70–5.
16.
Zurück zum Zitat Centiner BG, Sari M, Borat O. A neural network based traffic-flow prediction model. Math Comput Appl. 2010;15(2):269–78. Centiner BG, Sari M, Borat O. A neural network based traffic-flow prediction model. Math Comput Appl. 2010;15(2):269–78.
17.
Zurück zum Zitat Kumar K, Parida M, Katiyar VK. Short term traffic flow prediction in heterogeneous condition using Artificial Neural Network. Transport. 2015;30(4):397–405.CrossRef Kumar K, Parida M, Katiyar VK. Short term traffic flow prediction in heterogeneous condition using Artificial Neural Network. Transport. 2015;30(4):397–405.CrossRef
18.
Zurück zum Zitat Guo J, Huang W, Williams BM. Adaptive Kalman filter approach for stochastic short term traffic flow rate prediction and uncertainty quantification. Trans Res C. 2014;43(1):50–64.CrossRef Guo J, Huang W, Williams BM. Adaptive Kalman filter approach for stochastic short term traffic flow rate prediction and uncertainty quantification. Trans Res C. 2014;43(1):50–64.CrossRef
19.
Zurück zum Zitat Habtemichael FG, Cetin M. Short term traffic flow rate forecasting based on identifying similar traffic patterns. In: Transportation research Part C: emerging technologies. 2016. p. 61–78. Habtemichael FG, Cetin M. Short term traffic flow rate forecasting based on identifying similar traffic patterns. In: Transportation research Part C: emerging technologies. 2016. p. 61–78.
20.
Zurück zum Zitat Ma M, Liang S, Guo H, Yang J. Short term traffic flow prediction using a self-adaptive two dimensional forecasting method. Adv Mech Eng. 2017;9(8):1. Ma M, Liang S, Guo H, Yang J. Short term traffic flow prediction using a self-adaptive two dimensional forecasting method. Adv Mech Eng. 2017;9(8):1.
21.
Zurück zum Zitat Guo J, Liu Z, Huang W, Wei Y, Cao J. Short–term traffic flow prediction using fuzzy information granulation approach under different time intervals. IET Intel Transport Syst. 2018;12(2):143–50.CrossRef Guo J, Liu Z, Huang W, Wei Y, Cao J. Short–term traffic flow prediction using fuzzy information granulation approach under different time intervals. IET Intel Transport Syst. 2018;12(2):143–50.CrossRef
22.
Zurück zum Zitat Sharma N, Arkatkar SS, Sarkar AK. Study on heterogeneous traffic flow characteristics of a two-lane road. Transport. 2011;26(2):185–96.CrossRef Sharma N, Arkatkar SS, Sarkar AK. Study on heterogeneous traffic flow characteristics of a two-lane road. Transport. 2011;26(2):185–96.CrossRef
23.
Zurück zum Zitat Gupta AK, Redhu P. Analysis of driver’s anticipation effect in sensing relative flux in a new lattice model for two lane traffic system. Phys A. 2013;392:5622.CrossRef Gupta AK, Redhu P. Analysis of driver’s anticipation effect in sensing relative flux in a new lattice model for two lane traffic system. Phys A. 2013;392:5622.CrossRef
24.
Zurück zum Zitat de Luca M, Dell’Acqua G, Lamberty R. Road safety analysis using operating speeds: case studies in southern Italy. Proc Soc Behav Sci. 2012;53:702–10.CrossRef de Luca M, Dell’Acqua G, Lamberty R. Road safety analysis using operating speeds: case studies in southern Italy. Proc Soc Behav Sci. 2012;53:702–10.CrossRef
25.
Zurück zum Zitat Berry MJA, Linoff G. Data mining techniques. New York: Wiley; 1997. Berry MJA, Linoff G. Data mining techniques. New York: Wiley; 1997.
26.
Zurück zum Zitat Blum A. Neural networks in C++. New York: Wiley; 1992. Blum A. Neural networks in C++. New York: Wiley; 1992.
27.
Zurück zum Zitat Boger Z, Guterman H. Knowledge extraction from artificial neural network models. IEEE systems, man, and cybernetics conference. Orlando, FL; 1997. Boger Z, Guterman H. Knowledge extraction from artificial neural network models. IEEE systems, man, and cybernetics conference. Orlando, FL; 1997.
28.
Zurück zum Zitat Saltelli A, Ratto M, Andres T. “Global sensitivity analysis”, the Primer. New York: Wiley; 2008.MATH Saltelli A, Ratto M, Andres T. “Global sensitivity analysis”, the Primer. New York: Wiley; 2008.MATH
29.
Zurück zum Zitat Yeung DS, Cloete I, Shi D, Ng WWY. Sensitivity analysis for neural networks, natural computing series. Berlin: Springer; 2010.MATHCrossRef Yeung DS, Cloete I, Shi D, Ng WWY. Sensitivity analysis for neural networks, natural computing series. Berlin: Springer; 2010.MATHCrossRef
31.
Zurück zum Zitat Wei D, Liu H. An adaptive-margin support vector regression for short-term traffic flow forecast. J Intell Transp Syst. 2013;7(4):317–27.CrossRef Wei D, Liu H. An adaptive-margin support vector regression for short-term traffic flow forecast. J Intell Transp Syst. 2013;7(4):317–27.CrossRef
32.
Zurück zum Zitat Zhong JT, Ling S. Key factors of K-nearest neighbors nonparametric regression in short-time traffic flow forecasting. In: 21st international conference on industrial engineering and engineering management 2014 (IEEM 2014). Zhong JT, Ling S. Key factors of K-nearest neighbors nonparametric regression in short-time traffic flow forecasting. In: 21st international conference on industrial engineering and engineering management 2014 (IEEM 2014).
33.
Zurück zum Zitat Cinsdikici M, Memis K. Traffic flow evaluation at a short section using multiple regression approach. In: 7th European congress and exhibition on ITS (Intelligent Transport Systems and Services). 2008. Cinsdikici M, Memis K. Traffic flow evaluation at a short section using multiple regression approach. In: 7th European congress and exhibition on ITS (Intelligent Transport Systems and Services). 2008.
Metadaten
Titel
ANN based short-term traffic flow forecasting in undivided two lane highway
verfasst von
Bharti Sharma
Sachin Kumar
Prayag Tiwari
Pranay Yadav
Marina I. Nezhurina
Publikationsdatum
01.12.2018
Verlag
Springer International Publishing
Erschienen in
Journal of Big Data / Ausgabe 1/2018
Elektronische ISSN: 2196-1115
DOI
https://doi.org/10.1186/s40537-018-0157-0

Weitere Artikel der Ausgabe 1/2018

Journal of Big Data 1/2018 Zur Ausgabe

Premium Partner