Semantic Depth mAP Fusion for Moving Vehicle Detection in Aerial Video

Abstract

Wide area motion imagery from an aerial platform offers a compelling advantage in providing a global picture of traffic flows for transportation and urban planning that is complementary to the information from a network of ground-based sensors and instrumented vehicles. We propose an automatic moving vehicle detection system for wide area aerial video based on semantic fusion of motion information with projected building footprint information to significantly reduce the false alarm rate in urban scenes with many tall structures. Motion detections are obtained using the flux tensor and combined with a scene level depth mask to identify tall structures using height information derived from a dense 3D point cloud estimated using multiview stereo from the same source imagery or a prior model. The trace of the flux tensor provides robust spatio-temporal information of moving edges including the motion of tall structures caused by parallax effects. The parallax induced motions are filtered out by incorporating building depth maps obtained from dense urban 3D point clouds. Using a level-set based geodesic active contours framework, the coarse thresholded tall structures depth masks evolved and stopped at the actual building boundaries. Experiments are carried out on a cropped 2k × 2k region of interest for 200 frames from Albuquerque urban aerial imagery. An average precision of 83% and recall of 76% have been reported using an object-level detection performance evaluation method.

Cite

Text

Poostchi et al. "Semantic Depth mAP Fusion for Moving Vehicle Detection in Aerial Video." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2016. doi:10.1109/CVPRW.2016.196

Markdown

[Poostchi et al. "Semantic Depth mAP Fusion for Moving Vehicle Detection in Aerial Video." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2016.](https://mlanthology.org/cvprw/2016/poostchi2016cvprw-semantic/) doi:10.1109/CVPRW.2016.196

BibTeX

@inproceedings{poostchi2016cvprw-semantic,
  title     = {{Semantic Depth mAP Fusion for Moving Vehicle Detection in Aerial Video}},
  author    = {Poostchi, Mahdieh and Aliakbarpour, Hadi and Viguier, Raphael and Bunyak, Filiz and Palaniappan, Kannappan and Seetharaman, Guna},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2016},
  pages     = {1575-1583},
  doi       = {10.1109/CVPRW.2016.196},
  url       = {https://mlanthology.org/cvprw/2016/poostchi2016cvprw-semantic/}
}