CVMHAT

Crowded scene surveillance can significantly benefit from combining egocentric-view and its complementary top-view cameras. A typical setting is an egocentric-view camera, e.g., a wearable camera on the ground capturing rich local details, and a top-view camera, e.g., a drone-mounted one from high altitude providing a global picture of the scene. To collaboratively analyze such complementary-view videos, an important task is to associate and track multiple people across views and over time, which is challenging and differs from classical human tracking, since we need to not only track multiple subjects in each video, but also identify the same subjects across the two complementary views. This paper formulates it as a constrained mixed integer programming problem, wherein a major challenge is how to effectively measure subjects similarity over time in each video and across two views. Although appearance and motion consistencies well apply to over-time association, they are not good at connecting two highly different complementary views. To this end, we present a spatial distribution based approach to reliable cross-view subject association. We also build a dataset to benchmark this new challenging task. Extensive experiments verify the effectiveness of our method.

This paper is a substantial extension from a preliminary conference version [AAAI 20] with a number of major changes.

@inproceedings{han2020cvmht,
  title={Complementary-View Multiple Human Tracking}, 
  author={Han, Ruize and Feng, Wei and Zhao, Jiewen and Niu, Zicheng and Zhang, Yunjun and Wan, Liang and Wang, Song},  
  year={2020},  
  booktitle={AAAI Conference on Artificial Intelligence}
}

@article{han2021CVMHAT,
  title={Multiple Human Association and Tracking from Egocentric and Complementary Top Views}, 
  author={Han, Ruize and Feng, Wei and Zhang, Yunjun and Zhao, Jiewen and Wang, Song},  
  year={2021},  
  journal={IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE}
}

Dataset: Link: https://pan.baidu.com/s/1dS9sGqxOcaDsxddl6r2OKA Password: CVHT.

The annotation is public in 'CVMHAT_dataset-anno' folder.

Code: Mainly by Ruize Han (han_ruize@tju.edu.cn); Jiewen Zhao (zhaojw@tju.edu.cn).

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
CVMHAT_dataset-anno		CVMHAT_dataset-anno
CVMHAT_public-code		CVMHAT_public-code
figs		figs
README.md		README.md
Tra_Gopro.zip		Tra_Gopro.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CVMHAT

About

Releases

Packages

Languages

RuizeHan/CVMHAT

Folders and files

Latest commit

History

Repository files navigation

CVMHAT

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages