View invariant gait recognition for person re-identification in a multi surveillance camera environment

BUET ILS
BUET Institutional Repository: Home
→
Dissertations/Theses
→
Dissertations/Theses - Institute of Information and Communication Technology
→
View Item

dc.contributor.advisor	Hossen Asiful Mustafa, Dr.
dc.contributor.author	Mahedi Hasan, Md.
dc.date.accessioned	2021-10-19T04:44:26Z
dc.date.available	2021-10-19T04:44:26Z
dc.date.issued	2020-09-15
dc.identifier.uri	http://lib.buet.ac.bd:8080/xmlui/handle/123456789/5880
dc.description.abstract	Person re-identification (ReID) across multiple surveillance cameras with non-overlapping fields of view is one of the most significant problems in real-world intelligent video surveillance systems. Due to the unconstrained nature of the problem, gait-based per- son recognition is the only likely identification method to solve the person ReID in this situation. Furthermore, most of the existing ReID algorithms were designed for closed-world scenarios that consider the same descriptors across the camera network re- gardless of the dramatic change in view angle due to different camera positions, which eventually cause them to perform poorly in real-world scenarios. To address this prob- lem, therefore, in this thesis, we present a simple yet effective algorithm for robust gait recognition for person ReID that addresses the challenges that arise from the real-world multi-camera surveillance environment. In this approach, we first designed a novel low- dimensional spatio-temporal feature vector that was extracted from the pose estimation of raw video frames. In this research, we have developed a 50-dimensional feature descriptor by concatenating four different types of spatio-temporal features. These fea- tures are discriminant, and at the same time robust to the variations of different covari- ate factors. Thereafter, a pose sequence having a timestep of length of 28 frames was formed to feed into an RNN-based classifier network. The RNN network consists of two BGRU layers each of which only has 80 GRU cells. The input layer was followed by a batch normalization layer. The output of the recurrent layers was also batch normalized to standardize the activations and finally fed into an output softmax layer. The major- ity voting scheme was employed to process the output to predict the subject ID. For multi-view gait recognition, we also propose a two-stage network in which we initially identify the walking direction from gait video by employing a view angle identification network. Here, the input of the network was a clip of 16 consecutive frames that were preprocessed and resized to 112x112 to feed into a 3D convolutional network based on C3D. The experimental evaluation conducted on two challenging CASIA A and CA- SIA B gait datasets demonstrates that the proposed method has achieved state-of-the-art performance on both single-view and multi-view gait recognition. The experimental re- sult clearly confirms the effectiveness of our proposed approach when compared to the other state-of-the-art methods.	en_US
dc.language.iso	en	en_US
dc.publisher	Institute of Information and Communication Technology (IICT), BUET	en_US
dc.subject	Pattern recognition systems	en_US
dc.title	View invariant gait recognition for person re-identification in a multi surveillance camera environment	en_US
dc.type	Thesis-MSc	en_US
dc.contributor.id	1014312019	en_US
dc.identifier.accessionNumber	117637
dc.contributor.callno	006.4/MAH/2020	en_US