nach oben

Journal of Big Data

Erschienen in:

Open Access 01.12.2018 | Case Study

ANN based short-term traffic flow forecasting in undivided two lane highway

verfasst von: Bharti Sharma, Sachin Kumar, Prayag Tiwari, Pranay Yadav, Marina I. Nezhurina

Erschienen in: Journal of Big Data | Ausgabe 1/2018

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Patentsuche

Aus

Abstract

Short term traffic forecasting is one of the important fields of study in the transportation domain. Short term traffic forecasting is very useful to develop a more advanced transportation system to control traffic signals and avoid congestions. Several studies have made efforts for short term traffic flow forecasting for divided and undivided highways across the world. However, all these studies relied on the dataset which are greatly varied between countries due to the technology used for transportation data collection. India is a developing country in which efforts are being done to improve the transportation system to avoid congestion and travel time. Two-lane undivided highways with mixed traffic constitute a large portion of Indian road network. This study is an attempt to develop a short term traffic forecasting model using back propagation artificial neural network for two lane undivided highway with mixed traffic conditions in India. The results were compared with random forest, support vector machine, k-nearest neighbor classifier, regression tree and multiple regression models. It was found that back-propagation neural network performs better than other approaches and achieved an R² value 0.9962, which is a good score.

BPANN

back propagation neural network

ITS

Intelligent Transportation Systems

MSE

mean square error

MAE

mean absolute error

NMSE

normalized MSE

MAPE

mean absolute percentage error

CFE

cumulative forecast error

VAPE

variance of absolute percentage error

ATIS

Advanced Traveller Information System

Introduction

India is the second most dense and populated country in the world and one of the fastest growing economies. It is experiencing extreme congestion problems on road specifically on undivided two lane highways with mixed traffic. Facilitating infrastructure, imposing proper taxes to restrict personal vehicle growth and enhancing public transport facilities are long term solutions to this problem. These permanent solutions need government’s involvement. The Indian government has spent a huge amount in the urban infrastructure sector. Many public transports like Bus Rapid Transit, Metro are being built in several new places to encourage the use of public transport. However, still there is a rapid growth of private vehicles [1]. The country’s growing population is also one of the reasons for increased transportation needs. Meeting such needs with infrastructure growth is seemingly less viable because of space and cost constraints.

In general, short term traffic forecasting deals with analyzing previous traffic data and predict the traffic flow for next 5–30 min. The duration of 5–30 min is very important because traffic can grow drastically sometimes in next 5 min and can be very congested in next few minutes. Therefore, if it is possible to predict the traffic situation in next 5 min then suitable actions can be taken by traffic engineers and police to divert the traffic on other routes in case of the possibility of high congestion in next few minutes. Hence, short term traffic forecasting is very important area for investigation specifically for two-lane undivided highways with mixed traffic in India. Researchers in India have considered 4 lane divided highways for short term traffic prediction but two-lane undivided highways with mixed traffic is still requires extensive investigation and research. This is the motivation for the present study. Therefore, this study identified a two-lane undivided highway stretch with mixed traffic conditions and developed a short term traffic forecasting model using back-propagation neural network. The rest of the paper is organized as follows: “Literature review” section provides s literature review of relevant studies. In “Materials and methods” section, materials and methods is discussed that consists of information related to data collection, preparation methods and the back propagation neural network approach used to develop short term traffic forecasting model. “Results and discussion” section presents the results and discussion. Finally, the study is concluded in “Conclusion” section.

Literature review

Intelligent management of traffic flow and with providing travelers more accurate information about traffic and road status can reduce the negative impact of congestion [2]. Developing intelligent transport system (ITS) in India requires extensive quality research and development efforts. In design, planning and operations of highways; the traffic flow forecasting is very important. Short term traffic [3] mentioned that short term traffic flow forecasting is an important aspect of ITS. The importance of traffic flow forecasting for ITS has long been seen in many applications including the development of traffic control strategies in advanced traffic management systems [4] and Advanced Traveller Information Systems [5]. Furthermore, traffic forecasting can be very useful for drivers in saving time and also it helps in reducing traffic congestion and air pollution. In order to predict traffic parameters, the modeling process requires past records.

There are several studies that considered short term traffic flow prediction and developed various methodologies. Kalman filtering [6], local linear regression [7], neural network [8] and fuzzy logic based models [9] are some of the methods used for the short term traffic flow prediction. Due to stochastic and highly non-linear behavior of traffic stream, machine learning techniques [10] have received a great attention and hence are taken as an alternative for traffic flow prediction. Dougherty and Cobbett [11] used back propagation neural network to develop a model to predict traffic flow, speed and traffic occupancy in the Utrecht/Rotterdam/Hague region of The Netherlands. They mentioned that elasticity test can be a good option to interpret the developed neural network model. A comparative study between neural network and statistical models for short term traffic flow forecasting on motorway traffic data in France was done by Kirby et al. [12]. Dia [13] proposed an object oriented neural network approach for the prediction of short term traffic conditions on a highway stretch between Brisbane and the Gold Coast in Queensland, Australia. Wang and Shi [14] used support vector machines (SVM) model for short term traffic prediction. They suggested that appropriate selection of kernel parameters for SVM is a big challenge. They introduced a new approach to construct a new kernel function with the use of wavelet theory to capture the non-stationary properties of short term traffic speed data. Further, they tested this approach in a real world traffic speed data.

Theja and Vanajakshi [15] considered a mixed and less-lane disciplined traffic data with homogeneous traffic flow on Indian road. They used SVM and back propagation ANN to develop a traffic prediction model. They mentioned that SVM method was found more accurate in their study. In a different study, Centiner et al. [16] considered the homogeneous traffic flow and used ANN model to develop a short term traffic forecasting model on the traffic data collected in Istanbul. They mentioned that day of week, hour and minute had played an important role in traffic volume prediction. The stability and efficiency of neural network for short term prediction of traffic volume with mixed Indian traffic flow conditions on 4-lane undivided highways were studied by Kumar et al. [17]. Kumar et al. [17] considered ANN model for traffic flow forecasting and used traffic volume, speed, traffic density, time and day of week as input parameters. They mentioned that performance of ANN was consistent even if they changed the prediction time interval from 5 min to 15 min. [18] used adaptive Kalman filter approach for short term traffic flow rate prediction and uncertainty quantification. They developed a short term traffic prediction model for the real world traffic data collected from four different highway systems from United Kingdom, Minnesota, Washington and Maryland from USA. They suggested that adaptive Kalman filter is highly effective in case of highly volatile traffic.

Habtemichael and Cetin [19] proposed a non-parametric prediction model using enhanced k-nearest neighbors approach for short term traffic flow rate prediction. They applied and tested this model on 36 datasets (12 datasets from United Kingdom and 24 datasets from USA) collected from different regions. They found that their model outperformed the other advanced parametric models used in the study. Further, Ma et al. [20] pointed out that accuracy is very important in short term traffic flow prediction. They proposed a 2-dimensional prediction method using Kalman filtering for historic traffic data. They mentioned that their proposed approach provided better accuracy than standard Kalman filtering approach. Guo et al. [21] suggested that interval prediction is more important and challenging than point prediction for traffic managers considering the future scenario of ITS. They used fuzzy information granulation method along with ANN, SVM and KNN methods to develop a forecasting model for both point and interval prediction on a real world traffic data collected from American field transportation systems. Their results showed that with an increase in time interval, stability of prediction systems increased.

Such studies are possible with high quality data available with high-tech technology used in the ITS in European countries and United States where people are disciplined towards traffic rules. Considering India, a different scenario can be seen where all roads are not well constructed and people are not very friendly and disciplined towards traffic rules. A two lane undivided road is a global feature for any state and national highways in India. In India, very few studies [22, 23] have taken two lane undivided highways into consideration but there motivation was other than short term traffic flow forecasting. Therefore, it is very important to consider two lane undivided highway stretch with mixed traffic flow into consideration because a large portion of Indian road network are two lane undivided roads with mixed traffic.

The key objective of this study is to develop a short term traffic flow forecasting model using back propagation neural network for non-urban undivided two-lane roads with mixed traffic flow conditions in India.

Materials and methods

Data collection and preprocessing

The data set used in the study was collected from 2-lane undivided highway stretch between Roorkee and Hardwar on National Highway-58 (NH-58). In the NH-58, Delhi to Muzaffarnagar road is constructed as four-lane divided national highway and remaining road is an undivided two-lane. In the present study, undivided two lane highway stretch on NH-58, from Roorkee to Hardwar is selected. The three locations, L1 (near hotel Prakash, Roorkee), L2 (near Rehmadpur) and L3 (near Badheri) shown in Fig. 1 were considered for data collection.

Data were collected during 900–1200 and 1500–1800 h from 1/4/2017 to 31/8/2017. High quality digital cameras were used to effectively capture the traffic on entire selected stretch. Traffic flow was assumed to be simple having no change of direction. All the data were captured with a timer effect. These recordings were played in the computer and features were extracted with the help of a computer program written in python language. Further, all the vehicles were classified into ten categories (Table 1). The number of vehicles was counted, passing through a trap length manually to obtain the traffic volume data. The speed of each vehicle in all categories was calculated by dividing the trap length (20 m) by entry and exit time difference crossing the trap length.

Table 1

Summary of speed and traffic volume measurement for 5 min interval

S. no	Vehicle category	Delhi–Haridwar and Delhi–Haridwar
		Average speed (km/h)				Traffic volume
		Min	Max	Mean	SD	Min	Max	Mean	SD
1	Car	14.123	43.373	18.140	4.699	38	60	46.254	4.387
2	Bus	9.56	31.4	16.000	5.716	4	24	10.550	3.325
3	Truck	11.1	39.29	16.034	6.352	1	17	5.847	2.838
4	LCV	13.09	29.95	19.549	4.914	3	23	8.847	2.707
5	3-Wheeler	9.96	31.23	18.86	5.643	1	20	4.337	2.993
6	2-Wheelers	9.54	34.13	18.968	5.789	98	155	114.735	9.093
7	Cycle	9.11	22.29	12.861	3.843	5	25	15.0509	3.696
8	Rickshaw	8.59	19.63	11.256	2.899	3	21	13.2777	4.563
9	Tractor	9	20.68	14.342	2.888	1	12	3.643	2.068
10	ADV	0	1.42	0.2335	0.4199	0	2	0.4444	0.7573

Min, minimum; Max, maximum; SD, standard deviation

The measured data has been used in the same fashion mentioned by de Luca et al. [24] . Data extraction was done in the intervals of 5 min for both directions. Statistical characteristics of extracted data are mentioned in Table 1. Since data were collected at the same day same location at two peak times (morning and late afternoon) for the period of three hours for all days, approximately 100 data samples were obtained on each day by considering both sides of the road compositely. Therefore, a total of 15,069 data samples were obtained for the entire duration.

In this study, 22 parameters were taken into consideration to create database for ANN modeling of traffic volume that includes the frequencies of all category of vehicles. For pre-processing of the dataset, the whole exemplars (dataset) were first randomized and then divided into three data sets. First dataset was taken as the training set, second dataset for cross validation and third dataset was used for testing purpose. Randomization is used to stop bias in the dataset and create different samples as a representative of the entire population. By dividing the dataset, 10% of the samples were used for cross-validation, 10% for testing and 80% were used for training purposes. The criterion in separating the data was to assign sufficient samples for the ANN training and some for cross validation and testing.

Model development

ANN has the potential to perceive the non-linear relationship between input and output features and can provide generalize solutions to forecast traffic volume. Multi-Layer Perceptron (MLP) is one of the popular network structure of ANN with an additional layer called hidden layer. MLP can be used to solve different problems because of non-linear characteristic of activation function between its layers of processing elements. The selection of activation function plays a critical role in the performance of a neural network. The error is calculated at each epoch by comparing computed output of each input with expected output. Back propagation is widely used technique to propagate the error. Each processing unit is initially assigned a random weight. The main objective of neural network optimization process is to minimize the mean square error in training, cross validation and testing phase.

There is no limitation on selection of number of input variables in ANN modeling. The selection of number of input and output variables depends on the type of problem. In literature, there is no general approach for creation of perfect neural network architecture. Trial and error simply means that initially we have to decide the weight parameters for each neuron in the hidden layer at random. Further, these weights are modified by propagating the prediction error backwards. Certain parameters like number of input variables, number of hidden layers, activation or transfer function and learning rate plays an important role in designing neural network architecture. The general architecture of ANN is illustrated in Fig. 2.

Previously, ANN model was used by Kumar et al. [17] for the short term traffic flow predictions for 4 lane highway. Here, we are trying to use ANN model to investigate its performance for short term traffic prediction on 2 lane undivided highway with mixed traffic conditions.

An ANN with one hidden layer can be defined as a function

$$y : Z^{A} \to Z^{B}$$

(1)

where, A and B are the length of input and output vector f(x), respectively.

In matrix form, it can be defined as:

$$y\left( x \right) = \varphi \left( {b^{\left( 2 \right)} + W^{\left( 2 \right)} h\left( x \right)} \right)$$

(2)

where,

$$h\left( x \right) = \delta \left( {b^{\left( 1 \right)} + W^{\left( 1 \right)} x} \right)$$

(3)

b⁽¹⁾, b⁽²⁾ are bias vectors, W⁽¹⁾, W⁽²⁾ are weight matrices and $\varphi$ and $\delta$ are activation functions.

Some of the popular activation functions are sigmoid (x) and tanh (x) used in this study. Equations 4 and 5 provide the mathematical notation and Fig. 3a, b provide graphical illustration for tanh (x) and sigmoid (x) respectively.

$$y\left( {x_{i} } \right) = \tanh \left( {x_{i} } \right)$$

(4)

$$y\left( {x_{i} } \right) = \frac{1}{{\left( {1 + e^{{ - x_{i} }} } \right)}}$$

(5)

The above two activations functions are both sigmoid except that first one is hyperbolic tangent in the range [− 1, + 1] and second one is a logistic function within range [0, 1].

Initially, random weights are assigned to the hidden layer because it is very difficult to identify the accurate weight parameters. Therefore, a loss function is required to adjust the weight parameters accurately. This loss function (Eq. 6) calculates the error between predicted output and exact output and then propagates the error backwards.

$$L_{f} = \frac{1}{2}\sum (y - y^{\prime})^{2}$$

(6)

$y^{\prime}$ is the predicted output and y is the actual output. The goal is to minimize the loss function by changing the weight matrix. The weight matrix can be changed using gradient decent method. It tries to find the rate of change of error for a specific weight in the error. Weight matrix can be updated using weight update equation as given in Eq. 7.

$$W_{ab} = W_{ab} - \Delta W_{ab}$$

(7)

where, $W_{ab}$ represents the connection weight from a neuron in layer a to layer b and $\Delta W$ is given by,

$$\Delta W = - \theta \frac{{\partial L_{f} }}{{\partial W_{ab} }}$$

(8)

$\theta$ is the rate of learning.

Development of ANN models

In this study, multilayer perceptron network has been used for the prediction of traffic flow for 5 min in future using past 55 min data. For development of ANN model, 216 data samples have been taken, each of which contained 22 features i.e. location, time of day, 10 vehicles categories and respective average speed of each vehicle category. Single class speed flow model is not sufficient for explaining traffic conditions in India because we do not have single road just for one type of vehicle. Moreover, all kind of vehicles (motorized and non-motorized) share the same road and there is a variation in their speed. Therefore, a single class speed flow model is not applicable in this study. Therefore average speed of different class of vehicles is considered to predict the multiclass traffic flow of undivided two lane highway. The development and implementation of the ANN model was done Anaconda Spyder 3.6 version using Scikit Learn package. The best performing neural network structure is obtained by getting the best values of network parameters for the training and the testing. Due to failure of getting appropriate values of network parameters by using other approaches available in literature [25‐27] [S, T, U], the trial and error approach has been used. The stopping criterion during training was the least mean square error (MSE).

Twelve different ANN models have been developed to train on the dataset. The specification of 12 models with different structures has been presented in Table 2. From the Table 2 it is clear that neural network with 7 hidden neurons gives the best prediction result. Thus, architecture of ANN model in the present study has 22 inputs, 7 neurons in hidden layer and single output. The performance of the ANN models were determined using cross validation and testing data sets. Coefficient of correlation (r), mean absolute error (MAE), mean square error (MSE) and Normalized mean square error (NMSE) were used to evaluate the performance of predicted results.

Table 2

Different ANN networks architectures for traffic volume prediction

Model	Algorithm	Hidden layer	hidden neurons	Transfer Function	Epochs	Learning	Step size/Mo	Training		Cross validation		Testing
Model	Algorithm	Hidden layer	hidden neurons	Transfer Function	Epochs	Learning	Step size/Mo	Min MSE (*10⁴)	Final MSE (*10⁴)	Min MSE (*10⁴)	Final MSE (*10⁴)	MSE	NMSE (*10⁴)	MAE	R (%)
M1	MLP	1	4	Tanh	100	LM	–	5.9	5.9	12.5	10.37	4.66	47.99	1.93	98.8
M2	MLP	1	4	Tanh	200	LM	–	5.3	5.3	15	7.13	6.00	61.81	2.14	98.4
M3	MLP	1	4	Tanh	300	LM	–	4	4	21.1	20.94	3.25	33.49	1.26	98.7
M4	MLP	1	5	Tanh	100	LM	–	3.6	3.8	20.24	8.371	4.79	49.34	1.84	99.2
M5	MLP	1	5	Tanh	200	LM	–	2.8	2.8	16.05	16.866	19.8	204.4	3.73	93.7
M6	MLP	1	5	Tanh	100	MO	1/0.7	29.9	29.9	23.19	2.506	4.12	42.43	1.64	98.3
M7	MLP	1	5	Tanh	200	MO	1/0.7	27.3	27.3	25.8	3.3	7.10	73.12	2.06	96.6
M8	MLP	1	3	SigmoidAxon	100	LM	–	2.6	2.6	3.09	0.832	7.980	82.1	2.32	97.9
M9	MLP	1	3	SigmoidAxon	200	LM	–	1.6	1.8	2.63	0.263	17.66	181.8	3.13	92.7
M10	MLP	1	5	SigmoidAxon	100	LM	–	0.81	0.8141	3.73	1.883	15.012	154.55	3.44	95.6
M11	MLP	1	5	SigmoidAxon	200	LM	–	0.03	0.0358	4.92	13.479	151.9	1564.4	11.18	25.5
M12	MLP	1	7	SigmoidAxon	150	LM	–	0.36	0.366	4.7	2.89	0.816	8.4	0.77	99.8

Sensitivity analysis of traffic volume parameter to input

Sensitivity analysis measures the variation in the performance of model with a change in input value [28, 29]. Irrelevant inputs can be eliminated by implementing sensitivity analysis on a trained network. Eliminating irrelevant inputs may result in reduced data collection cost and improved network’s performance. Moreover, sensitivity investigation gives understanding of the fundamental relations between input variables and output. In this investigation, ANN model (Model 12) was applied for sensitivity investigation. Sensitivity investigation was achieved about the mean on the pre-trained MLP network. This batch starts by changing first input between its mean ± 5 while all other inputs were stable at their respective means. The network output was calculated for hundred steps above and below the mean. This procedure was then done repeatedly for each input. Figure 4 shows the deviation of output with respect to deviation of each input. According to sensitivity investigation, thirteen most significant inputs factors are Time, CR/JP/VN, BUS, TRUCK, MB/TT, 3 W, ST/MT, BCYCLE, PRICSHAW, TT, BCART/HCART, SPRICSHAW, STT (as depicted in Fig. 4). In next stage, neural network was trained and verified with same ANN structure as the best selected ANN model (Model 12) considering only the 13 most important input parameters under the sensitivity investigation. Output of training, cross validation and testing stage of the new sensitivity model are described in Table 3. Table 3 illustrated that the sensitivity model does not perform well compared to the best performing proposed ANN model i.e. model 12 even after suppressing number of input variables from 22 to 13.

Table 3

Performance analysis between proposed ANN Model and sensitivity model

Model/parameter	Minimum MSE (T)	Final MSE (T)	Minimum MSE (X)	Final MSE (X)	MSE (Y)	NMSE (Y)	MAE (Y)	r (Y) (%)
Proposed model	3.67E−05	3.67E−05	0.00047	0.00289	0.81655	0.00840	0.7711	99.8
Sensitivity based model	9.46E−05	9.46E−05	0.00040	0.00095	0.91439	0.00941	0.6876	99.5

Results and discussion

Several ANN models have been developed and trained on the data. All these models were trained at different epochs to adjust the weight parameters in the network. Figure 5 illustrates the performance of best 6 models trained on different number of epochs. It can be seen in Fig. 5 that an increase in the number of epochs reduced the MSE and improved the performance of the model both in training and validation phase. It can be seen in Fig. 5 that prediction model M12 achieved the minimum MSE in comparison to other models. Figure 6 shows the regression plot between observed and simulated traffic volume. It can be seen that model M12 achieved the highest prediction accuracy among other models and its R² value is 0.9919 (higher than other models). R² can be defined as a coefficient of determination that illustrates how nicely the regression line fitting the data. It is considered that more closed the value of R² to 1, the better the prediction model would be. Further, the performance of each developed model is evaluated using mean absolute error (MAE), mean absolute percentage error (MAPE), Theils U Statistic (U1, U2), cumulative forecast error (CFE) and variance of absolute percentage error (VAPE). MAE, MAPE, Theils U Statistic (Eqs. 9 and 10) measure the accuracy of prediction and VAPE is used to measure the prediction stability and CFE is used for bias estimation.

Table 4 provides the value for all these parameters for different ANN models. Table 4 illustrated that the Model M12 achieved the best score for all parameters in comparison to other developed models.

$$U_{1} = \frac{{\left( {\mathop \sum \nolimits_{i = 1} \left( {P_{i} - A_{i} } \right)^{2} } \right)^{1/2} }}{{\left( {\mathop \sum \nolimits_{i = 1} A_{i}^{2} } \right)^{1/2} }}$$

(9)

$$U_{2} = \frac{{\left( {\frac{1}{n}\mathop \sum \nolimits_{i = 1} \left( {A_{i} - P_{i} } \right)^{2} } \right)^{1/2} }}{{\left( {\frac{1}{n}\mathop \sum \nolimits_{i = 1} A_{i}^{2} )^{1/2} + \left( {\frac{1}{n}\mathop \sum \nolimits_{i = 1} P_{i}^{2} } \right)} \right)^{1/2} }}$$

(10)

Table 4

Statistical indices of different models

Model	Error				CFE	Theil’s U statistic
Model	MAE	MAPE (%)	VAPE (%)	MPE	CFE	U1	U2
M1	1.93	0.9010	0.0221	− 0.6872	− 32.068	0.0031	0.0450
M2	2.14	0.9917	0.0265	− 0.8014	− 38.081	0.0056	1.3689
M3	1.26	0.5889	0.0287	− 0.4358	− 20.459	0.0041	0.9605
M4	1.84	0.0386	0.0029	− 0.8498	− 40.538	0.0050	0.0441
M5	3.73	1.7325	0.2774	− 1.1564	− 52.574	0.0102	0.9964
M6	1.64	0.7689	0.0268	− 0.3665	− 16.559	0.0046	0.0425
M7	2.06	0.9546	12.690	− 0.1373	− 5.1569	1.7368	0.0563
M8	2.32	1.0963	0.0369	− 0.9114	− 42.496	3.08E−05	0.0588
M9	3.13	1.4442	0.0607	− 0.5223	− 25.028	0.0096	0.0811
M10	11.18	5.2393	0.1217	− 3.7443	− 169.104	0.0279	0.2455
M11	11.41	5.3393	0.1241	− 3.6870	− 165.905	0.0285	0.2539
M12	0.77	0.3569	0.0102	− 0.3158	− 14.937	0.0020	0.0180

In the above equations, A and P denote the changes in actual values and predicted change in values. U₁ and U₂ is a measure of prediction accuracy and quality respectively. Equations 11–15 defines the MAE, MPE, MAPE, VAPE and CFE respectively.

$$MAE = \frac{1}{N}\sum \left| {P_{i} - A_{i} } \right|$$

(11)

$$MPE = \frac{1}{N}\sum \frac{{\left( {A_{i} - P_{i} } \right)^{2} }}{{A_{i} }} \times 100$$

(12)

$$MAPE = \frac{1}{N}\sum \frac{{\left| {(A_{i} - P_{i} )^{2} } \right|}}{{A_{i} }} \times 100$$

(13)

$$VAPE = Var\left( {\frac{{\left| {P_{i} - A_{i} } \right|}}{{A_{i} }}} \right) \times 100$$

(14)

$$CFE = \sum \left( {A_{i} - P_{i} } \right)$$

(15)

Performance comparison of proposed traffic volume prediction model has been done using the approaches used in earlier studies (Table 5). Random forest and regression trees, [30], SVM regression, [31], K-nearest neighbors, [32], Multiple linear regression, [33] have been developed on the dataset. It is found that the BPANN (M12) model can predict traffic volume more accurately than other approaches (as shown in Table 5). It is obvious from the statistical indices that ANN based model is robust and stable, and can be applied successfully for short term traffic flow prediction in Indian traffic conditions. ANN modeling and integrated sensitivity analysis is used in the present study. It is one of the most systematic and accurate way to predict the performance of traffic volume.

Table 5

Performance comparison between proposed ANN based model and traditional models

Models	MSE	RMSE	MAE	RSE	RRSE	RAE	R²
Random forest	217.718	14.7553	10.9593	0.2284	0.4779	0.4400	0.7716
Regression tree	388.380	19.7074	14.4908	0.4074	0.6383	0.5817	0.5926
SVM regression	898.723	29.9787	23.3781	0.9428	0.9710	0.9385	0.0572
K nearest neighbors Regression (KNN)	14.1294	3.7589	1.4239	0.0148	0.1217	0.0572	0.9852
Multiple linear regression	305.536	17.4796	13.3072	0.3205	0.5661	0.5342	0.6795
BP neural network	4.2848	2.06999	0.77114	0.09951	0.31546	0.16204	0.9962

In this study, 7 nodes have been taken into consideration for hidden layer which provides good results. However, this number is not an optimal selection for any model and can be selected as per the requirements and the size of the data. The back propagation algorithm can be utilized in Advanced Traveller Information Systems (ATIS) because of its efficiency in reducing prediction error. ATIS is a system which makes prediction based on the information stored in the database [34]. More the predictions are accurate, better the suggestions of ATIS would be. Therefore, the thirst for developing more robust and accurate short term prediction model is in demand. Our model achieved the better R² score (0.9962) among other models for short term traffic flow prediction on two lane undivided highways with mixed traffic conditions. Moreover, if accuracy can be sacrificed a little and processing time is more important, then methods like SVM and conventional regression-b estimators can be a good options rather than going with ANN.

Conclusion

This study presented ANN based short term traffic volume prediction model for undivided two lane highways with mixed traffic in India. The data samples were collected on NH-58 highways stretch from Roorkee to Hardwar. The study used back propagation neural network approach to develop a short term forecasting model for two lane undivided highways with mixed traffic conditions. The major advantage of back-propagation neural network is that it calculates the prediction error and propagates back to the previous layers in order to modify the weights; resulting better prediction accuracy with more training. The training can be stopped after certain number epochs with no further improvements in prediction accuracy. The results of best back propagation model (M12) was quite promising as it achieved a good R² value of 0.9962. This study can be useful to be used in Advanced Traveller Information Systems for short term traffic predictions.

The study is certainly capable to provide a good solution for short term traffic prediction for two lane undivided highways with mixed traffic conditions in India. But the dataset used for this study is restricted to a limited portion of highway stretch and days of week. The study can be enhanced by using more informative data about traffic flow during weekdays and weekends, for peak hours and normal hours, for different months or seasons. Although this data collection procedure requires a lot of human and technology efforts but it will certainly help in more informative and large dataset. Moreover, this large data set needs to be analyzed with more suitable algorithms i.e. deep neural networks which are known to handle large dataset efficiently. The future scope of this study would be to work on the above mentioned limitations of the study and provide a better solution.

Authors’ contributions

BS and PY have collected the data from the sites and perform conceptualization. BS, SK and PT performed the analysis and experiments. BS and SK wrote the manuscript. The whole work was supervised by MN. All authors read and approved the final manuscript.

Acknowledgements

The authors gratefully acknowledge the financial support of the Ministry of Education and Science of the Russian Federation in the framework of Increase Competitiveness Program of NUST « MISiS » (No. К4-2017-052).

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

The authors are not authorized to share the data. However, full support regarding the study will be provided for interested readers.

Funding

The research receive no external funding. The publication costs will be covered by personal funds of the authors.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Vorheriger Artikel A method of trend forecasting for financial and geopolitical data: inferring the effects of unknown exogenous variables

Nächster Artikel Building efficient fuzzy regression trees for large scale and high dimensional problems

UTIC: Urban transportation in Indian Cities, In: Compendium of good practices. New Delhi: National Institute of Urban Affairs, 2015.

Bhaskar G, Raghu BA. WRTS: wireless sensor based real time traffic information system. Int J Comput Appl Technol Res. 2013;2(4):481–6. http://ijcat.com/archives/volume2/issue4/ijcatr02041016.pdf

Sun S, Zhang C, Yu G. A Bayesian network approach to traffic flow forecasting. Transp Syst IEEE Trans. 2006;7(1):124–32.CrossRef

Zeng D, Xu J, Gu J, Liu L, Xu G. Short term traffic flow prediction using hybrid ARIMA and ANN models. In: Workshop on power electronics and intelligent transportation system. 2008. p. 621–5.

Zheng W, Lee DH, Shi Q. Short-term freeway traffic flow prediction: Bayesian combined neural network approach. J Transp Eng. 2006;132:114–21.CrossRef

Xie Y, Zhang Y, Ye Z. Short-term traffic volume forecasting using Kalman filter with discrete wavelet decomposition. Comput Aided Civil Infrastruct Eng. 2007;22(5):326–34.CrossRef

Sun H, Liu HX, Xiao H, He RR, Ran B. Use of local linear regression model for short-term traffic forecasting. Transp Res Rec. 2003;1836:143–50.CrossRef

Chen H, Grant-Muller S. Use of sequential learning for short-term traffic flow forecasting. Trans Res C. 2001;9(5):319–36.CrossRef

Zhang Y, Ye Z. Short-term traffic flow forecasting using fuzzy logic system methods. J Intell Transp Syst. 2008;12(3):102–12.MATHCrossRef

10.

Kumar S, Toshniwal D. Severity analysis of powered two wheeler traffic accidents in Uttarakhand, India. Eur Transp Res Rev. 2017;9(2):24.CrossRef

11.

Dougherty MS, Cobbett MR. Short-term inter-urban traffic forecasts using neural networks. Int J Forecast. 1997;13(1):21–31.CrossRef

12.

Kirby HR, Watson SM, Dougherty MS. Should we use neural networks or statistical models for short-term motorway traffic forecasting? Int J Forecast. 1997;13(1):43–50.CrossRef

13.

Dia H. An object-oriented neural network approach to short-term traffic forecasting. Eur J Oper Res. 2001;131(2):253–61.MATHCrossRef

14.

Wang J, Shi Q. Short-term traffic speed forecasting hybrid model based on Chaos-wavelet analysis-support vector machine theory. Transp Res C. 2013;27(1):219–32.CrossRef

15.

Theja PV, Vanajakshi L. Short term prediction of traffic parameters using support vector machines technique. In: Proceedings of the third international conference on emerging trends in engineering and technology. 19–21 November 2010, Goa, India. p. 70–5.

16.

Centiner BG, Sari M, Borat O. A neural network based traffic-flow prediction model. Math Comput Appl. 2010;15(2):269–78.

17.

Kumar K, Parida M, Katiyar VK. Short term traffic flow prediction in heterogeneous condition using Artificial Neural Network. Transport. 2015;30(4):397–405.CrossRef

18.

Guo J, Huang W, Williams BM. Adaptive Kalman filter approach for stochastic short term traffic flow rate prediction and uncertainty quantification. Trans Res C. 2014;43(1):50–64.CrossRef

19.

Habtemichael FG, Cetin M. Short term traffic flow rate forecasting based on identifying similar traffic patterns. In: Transportation research Part C: emerging technologies. 2016. p. 61–78.

20.

Ma M, Liang S, Guo H, Yang J. Short term traffic flow prediction using a self-adaptive two dimensional forecasting method. Adv Mech Eng. 2017;9(8):1.

21.

Guo J, Liu Z, Huang W, Wei Y, Cao J. Short–term traffic flow prediction using fuzzy information granulation approach under different time intervals. IET Intel Transport Syst. 2018;12(2):143–50.CrossRef

22.

Sharma N, Arkatkar SS, Sarkar AK. Study on heterogeneous traffic flow characteristics of a two-lane road. Transport. 2011;26(2):185–96.CrossRef

23.

Gupta AK, Redhu P. Analysis of driver’s anticipation effect in sensing relative flux in a new lattice model for two lane traffic system. Phys A. 2013;392:5622.CrossRef

24.

de Luca M, Dell’Acqua G, Lamberty R. Road safety analysis using operating speeds: case studies in southern Italy. Proc Soc Behav Sci. 2012;53:702–10.CrossRef

25.

Berry MJA, Linoff G. Data mining techniques. New York: Wiley; 1997.

26.

Blum A. Neural networks in C++. New York: Wiley; 1992.

27.

Boger Z, Guterman H. Knowledge extraction from artificial neural network models. IEEE systems, man, and cybernetics conference. Orlando, FL; 1997.

28.

Saltelli A, Ratto M, Andres T. “Global sensitivity analysis”, the Primer. New York: Wiley; 2008.MATH

29.

Yeung DS, Cloete I, Shi D, Ng WWY. Sensitivity analysis for neural networks, natural computing series. Berlin: Springer; 2010.MATHCrossRef

30.

Hou Y, Edara P, Sun C. Traffic flow forecasting for urban work zones. In: IEEE transactions on intelligent transportation systems, 16(4): 1761–70. http://citynet-ap.org/wp-content/uploads/2015/05/GP-IN1-UT.pdf

31.

Wei D, Liu H. An adaptive-margin support vector regression for short-term traffic flow forecast. J Intell Transp Syst. 2013;7(4):317–27.CrossRef

32.

Zhong JT, Ling S. Key factors of K-nearest neighbors nonparametric regression in short-time traffic flow forecasting. In: 21st international conference on industrial engineering and engineering management 2014 (IEEM 2014).

33.

Cinsdikici M, Memis K. Traffic flow evaluation at a short section using multiple regression approach. In: 7th European congress and exhibition on ITS (Intelligent Transport Systems and Services). 2008.

34.

Lingras PJ, Sharma SC. Short-term traffic volume forecasts: existing and future research. In: Proc., Canadian Society of Civil Engineers Annual Conference. Vol. IV, Regina, Saskatchewan, Canada, 1999, p. 429–38. http://cs.smu.ca/~pawan/research/csce99.rtf

Titel: ANN based short-term traffic flow forecasting in undivided two lane highway
verfasst von: Bharti Sharma
Sachin Kumar
Prayag Tiwari
Pranay Yadav
Marina I. Nezhurina
Publikationsdatum: 01.12.2018
Verlag: Springer International Publishing
Erschienen in: Journal of Big Data / Ausgabe 1/2018
Elektronische ISSN: 2196-1115
DOI: https://doi.org/10.1186/s40537-018-0157-0

Springer Professional

ANN based short-term traffic flow forecasting in undivided two lane highway

Abstract

Introduction

Literature review

Materials and methods

Data collection and preprocessing

Model development

Development of ANN models

Sensitivity analysis of traffic volume parameter to input

Results and discussion

Conclusion

Authors’ contributions

Acknowledgements

Competing interests

Availability of data and materials

Funding

Publisher’s Note

Premium Partner

Springer Professional

Abstract

Introduction

Literature review

Materials and methods

Data collection and preprocessing

Model development

Development of ANN models

Sensitivity analysis of traffic volume parameter to input

Results and discussion

Conclusion

Authors’ contributions

Acknowledgements

Competing interests

Availability of data and materials

Funding

Publisher’s Note

Weitere Artikel der Ausgabe 1/2018

Bayesian count regression analysis for determinants of antenatal care service visits among pregnant women in Amhara regional state, Ethiopia

Cross-domain similarity assessment for workflow improvement to handle Big Data challenge in workflow management

Evaluation of high-level query languages based on MapReduce in Big Data

A new approach to the space–time analysis of big data: application to subway traffic data in Seoul

Building efficient fuzzy regression trees for large scale and high dimensional problems

The MapReduce-based approach to improve the shortest path computation in large-scale road networks: the case of A* algorithm

Premium Partner