Real time classification and tracking of multiple vehicles in highways

doi:10.1016/j.patrec.2005.01.010

Pattern Recognition Letters

Volume 26, Issue 10, 15 July 2005, Pages 1597-1607

https://doi.org/10.1016/j.patrec.2005.01.010 Get rights and content

Abstract

Real time road traffic monitoring is one of the challenging problems in machine vision, especially when one is using commercially available PCs as the main processor. In this paper, we describe a real-time method for extracting a few traffic parameters in highways such as, lane change detection, vehicle classification and vehicle counting. In addition, we will explain a real time method for multiple vehicles tracking that has the capability of occlusion detection. Our tracing algorithm uses Kalman filter and background differencing techniques. We used morphological operations for vehicle contour extraction and its recognition. Our algorithm has three phases, detection of pixels on moving objects, detection of a “Shape of Interest” in frame sequences and finally determination of relation among objects also in frame sequences. Our system is implemented on a PC with Pentium II 800 MHZ CPU. Its processing speed was measured to be 11 frames per second. The accuracy of measurement was 96%.

Introduction

High speed processing of image frame sequences is highly important for many real time computer vision algorithms. Image sequence analysis provides intermediate results for conceptual description of events in a scene. In vehicle tracking applications that uses image sequences, many methods are suggested. At first we can name model-based tracking methods in which a 3D model of vehicle is extracted (Coifman et al., 1998, Malik and Russell, 1997). One of the advantages of these methods is their high accuracy in determining the vehicle type and their detail geometric model. In fact, model-based tracking methods because of their high calculation cost, can be used only for free-flowing traffic with small number of vehicles.

In some other methods that are called feature-based (Roberts, 1994), a few features such as distinguishable lines or corners are extracted for each vehicle. Some of these features are grouped together to label a vehicle (Shi and Tomasi, 1994). One of the most important advantages of this type of methods is that, even in presence of partial occlusion, some of these features remain to be visible. On the other hand, they face problems in detecting features for individual vehicles that run close to each other. In region-based methods (Coifman et al., 1998, Setchell, 1997), vehicles are presented as blobs. In these methods, at first, connected components are extracted and then regions are merged or split ted if needed. The most serious weakness of these approaches is that merging and splitting regions can cause some inaccuracy in vehicle detection.

In addition, there are some methods in which the contour of vehicles is extracted. Although contours can be detected by simple edge detection methods, but these simple methods sometimes detect false edges of the background too. However, if more complex algorithms of edge detection such as active contours or snakes (Paragios and Deriche, 2002) are used, one should find a way to optimize the coding to make them usable for real time applications with commercially available processors. In practice there are many applications such as our system, that often one does not need to know the exact detail of vehicle type, but a general type category would be sufficient. In our system the surveillance CCD camera is installed in a relatively far distance from a highway and the vehicles are visualized as small objects with minimum detail on their geometrical model.

In this paper we introduce a novel real time machine vision system for classification and tracking of multiple vehicles and also determining some traffic parameters such as lane change and counting the number of vehicles passing the highway during a desired time interval. For tracking we used Kalman filter (Grewal and Angus, 1993) and background differencing techniques. Our algorithm takes advantage of region based and contour based methods by combining their ideas in order to detect a “Shape of Interest”, that in practice it is a bounding box around the vehicle. By using the bounding box and region boundary, the occlusion and overlapping of two regions are detected by examining the object shape and determining if it was the result of merging two or more vehicles and then deciding upon a proper split point to separate the merged vehicles.

Our system was implemented in Visual C++ using Matrox Meteor II frame grabber on a Pentium II 800 MHZ CPU. The input images were gray scale with eight bits per pixel resolution and of size 320 × 320. Our experimental results showed an accuracy of 96% when it was compared with the measurements done by a human expert. The initial version of this work is given in (Rad and Jamzad, 2003).

The rest of this paper is organized as follows: After a review of related works, our algorithm is described in three main sections, Change detection, Vehicle recognition (where our ideas for occlusion removal and vehicle classification are discussed) and Vehicle tracking. In Section 6, the experimental results is presented and finally the conclusion remarks is given in Section 7.

Section snippets

Related works

Many works have been reported for vehicle tracking from image sequences in machine vision and related topics literature. Vehicle detection is a fundamental component of image-based traffic monitoring system. Here we take a brief look at some of them. One such approach is to use background subtraction or optical flow for detection of moving objects (Javed and Shah, 2002, Gupte et al., 2002) and then tracking them. Methods based on background subtraction followed by object tracking do not suffer

Change detection

Detection of changes between sequences of frames is a major task in many machine vision applications. Methods that are based on sequence frame differencing and moving edge detection, have aperture problem and are sensitive to vehicle speed. In the following we show how to use background differencing method to group pixels in moving and non-moving category.

In this method, first we construct a background reference image of the road that has no moving vehicles in it. In order to avoid the problem

Vehicle recognition

As seen in Fig. 1(a), this binary image has several small noises in it. Our vehicle recognition algorithm assumes to receive as input a binary image that only has two groups of pixels. Pixels belonging to the background, and those belonging to the moving objects. This means that we have to modify Fig. 1(a) in such a way that it becomes completely noise free. For noise removal, we used Closing and Opening morphological operators. It is known that Closing fills little apertures and Opening

Vehicle tracking

For tracking objects in a sequence of frames, the relation between objects in two consecutive frames must be found and recorded. Doing this we developed three modules. The first one is a complete image search, in which the whole image is searched. In the second module, only the area on the road (i.e. excluding the background) is searched. And in the last module, a small area in which the vehicle might be seen in next frames was determined and the trajectory of each vehicle was tested.

These

Experimental results

In order to test our algorithm, we used image sequences of about 400 frames of a video tape captured by a traffic surveillance CCD camera. This camera was installed on a height and far distance from a wide highway in the city of Tehran. The recorded video showed two sides of the highway that has three lanes in each side. The average number of vehicles in a frame was measured to be 27.2. The mean processing speed of our algorithm measured on 400 consecutive frames is summarized in Table 1.

Conclusion

In this paper, we presented an algorithm for real time detection of vehicles, classification of their types, and tracking. Our system was implemented on commercially available PC and a frame grabber. Its processing speed is 10.99 fps. In this application, since video recording is done from a relatively far distance, the field of view of camera is large enough so that running vehicles remain in the field of view in such a period of time that the processing speed of about 11 frames per second

Acknowledgments

We would like to thank the Control Traffic Company of Tehran for their cooperation and providing the traffic video tapes.

References (19)

B. Coifman et al.
A real-time computer vision system for vehicle tracking and traffic surveillance
Transport. Res.: Part C
(1998)
D. Magee
Tracking multiple vehicles using foreground, background and motion models
Image Vis. Comput.
(2004)
N. Paragios et al.
Geodesic active regions: A new paradigm to deal with frame partition problems in computer vision
J. Visual Commun. Image Represent.
(2002)
Bennett, B., Magee, D., Cohn, A.G., Hogg, D.C. Using Spatio-temporal Continuity Constraints to Enhance Visual Tracking...
Cheung, S.-C., Kamath, C. Robust techniques for background subtraction in urban traffic video, Video Communications and...
R.C. Gonzalez et al.
Digital image processing
(2002)
M.S. Grewal et al.
Kalman filtering and practice
(1993)
S. Gupte et al.
Detection and classification of vehicles
IEEE Trans. Intell. Transport. Syst.
(2002)
Javed, O., Shah, M. Tracking and object classification for automated surveillance, Proceedings ECCV,...

There are more references available in the full text version of this article.

Cited by (85)

A longitudinal scanline based vehicle trajectory reconstruction method for high-angle traffic video
2019, Transportation Research Part C: Emerging Technologies
Citation Excerpt :
Compared with tracking a single object, the task for tracking multiple objects imposes more difficulties. A popular technique for vehicle tracking is the Kalman Filter (KF), which is considered an efficient approach for estimating the dynamic state of a system for tracking (Rad and Jamzad, 2005; Hsieh et al., 2006; Kim and Cao, 2010). Many researchers have tried to add some prior knowledge, such as movement constraints, locations, or past tracking histories to better approximate the object locations (Shi and Tomasi, 1994; Coifman et al., 1998; Azevedo et al., 2014).
In this paper, a robust and efficient High-angle Spatial-Temporal Diagram Analysis (HASDA) model is built to reconstruct high-resolution vehicle trajectories from infrastructure traffic surveillance videos. A combined methodology was developed, comprising of scanline-based trajectory extraction and feature-matching coordinate transformation. A scanline-based trajectory extraction technique is introduced to separate vehicle strands from pavement background on the spatial-temporal diagram by considering color features, gradient features, and motion features. Particular cleaning algorithms for removing static object noises, shadows, and occlusions are also established. Feature-matching coordinate transformation converts the pixel coordinates to the real-world coordinates to generate the physical vehicle trajectory. To evaluate the algorithm, generated trajectory results were compared to the reconstructed version of the Next Generation Simulation (NGSIM) dataset. 15-min NGSIM video was divided into a 5-min dataset for the calibration and the remaining 10-min data for evaluation. Model parameters calibrated based on the 5-min video data are then applied to the 10-min testing data. Two levels of performance measurements are considered to evaluate both trajectory-level and point-level results. A reference algorithm based on mainstream motion-based detection and tracking methods are used as a baseline algorithm. Based on the evaluation results, the proposed method shows promising trajectory detection results, that on average more than 90% of vehicle trajectories are constructed by the proposed methods from the NGSIM videos. The HASDA model outperforms the reference algorithm and shows superior transferability in the training-testing experiment. Further work needs to be done to improve the algorithm performance against shadows and occlusions by incorporating more intelligent and advanced techniques.
Multi-vehicle detection algorithm through combining Harr and HOG features
2019, Mathematics and Computers in Simulation
Citation Excerpt :
In fact, vehicle environment perception is based on primarily computer vision technology which can provide warning and auxiliary driving instructions to drivers, such as vehicle safety warnings, anti-collision warnings, autopilot and tracking assistance, through acquiring real-time information on the surrounding environment. Considering the importance of vehicle operations in transportation systems, vehicle environment perception has become a central issue internationally for improving the efficiency and safety of vehicle operations [15,25]. Computer vision technology based moving target detection is one of the basis of vehicle environment perception.
In order to achieve a better performance of detection and tracking of multi-vehicle targets in complex urban environment, we propose a two-step detection algorithm based on combining the features of Harr and Histogram of Oriented Gradients (HOG). This algorithm makes full use of HOG characteristic advantages for target vehicles, i.e., the good descriptive ability of HOG feature, and the prospect region of interest (ROI) can be extracted using Harr features. Moreover, the extracted HOG features from the ROI target area can be selected through applying the cascade structured AdaBoost classifier features and target area classification. Precise target can be further extracted by using support vector machine (SVM). Experimental results using video collected from real world scenarios are provided, showing that the proposed method possesses higher detecting accuracy and time efficiency than the conventional ones, and it can detect and track the multi-vehicle targets successfully in complex urban environment.
Machine learning and computer vision-enabled traffic sensing data analysis and quality enhancement
2018, Data-Driven Solutions to Transportation Problems
Traffic sensor data are essential for informed, scientific decision-making processes in traffic operation, pavement design, and transportation planning. In the current traffic detection infrastructure, inductance loop detectors and surveillance cameras are two commonly deployed sensors. In this study, a machine learning approach is developed to establish an artificial neural network (ANN) to better extract classified vehicle volumes from single-loop measurements. In addition, a set of computer vision-based algorithms are developed to extract background image from a video sequence, detect presence of vehicles, identify and remove shadows, and calculate pixel-based vehicle lengths for classification based on widely available surveillance camera signals. Machine learning and computer vision are two major artificial intelligent and advanced computing techniques, which can substantially revolutionize existing traffic sensing practices and theoretical foundations. The experimental tests indicated their favorable performance under various traffic operation scenarios. This chapter summarizes our continuous efforts in these promising areas, and contributes greatly to data-driven traffic science and application research.
Evaluating the accuracy of vehicle tracking data obtained from Unmanned Aerial Vehicles
2016, International Journal of Transportation Science and Technology
Citation Excerpt :
In their study, they estimate vehicle lengths of within 10% in each instance, deriving coordination mapping functions of a calibrated camera model. Rad and Jamzad (2005) performed a similar work developing an application able to count and classify vehicles and to identify the lane-changing events by tracking them. In their study, the background subtraction method combined with morphological operations has been applied in order to identify moving vehicles in regions.
This paper presents a methodology for tracking moving vehicles that integrates Unmanned Aerial Vehicles with video processing techniques. The authors investigated the usefulness of Unmanned Aerial Vehicles to capture reliable individual vehicle data by using GPS technology as a benchmark. A video processing algorithm for vehicles trajectory acquisition is introduced. The algorithm is based on OpenCV libraries. In order to assess the accuracy of the proposed video processing algorithm an instrumented vehicle was equipped with a high precision GPS. The video capture experiments were performed in two case studies. From the field, about 24,000 positioning data were acquired for the analysis. The results of these experiments highlight the versatility of the Unmanned Aerial Vehicles technology combined with video processing technique in monitoring real traffic data.
Estimation of Algorithms for Extracting Traffic Data Using Uav and Stationary Camera
2023, SSRN
Intelligent Traffic Surveillance through Multi-Label Semantic Segmentation and Filter-Based Tracking
2023, Computers, Materials and Continua

View all citing articles on Scopus

View full text

Real time classification and tracking of multiple vehicles in highways

Abstract

Introduction

Section snippets

Related works

Change detection

Vehicle recognition

Vehicle tracking

Experimental results

Conclusion

Acknowledgments

Transport. Res.: Part C

Image Vis. Comput.

J. Visual Commun. Image Represent.

Digital image processing

Kalman filtering and practice

Detection and classification of vehicles

IEEE Trans. Intell. Transport. Syst.