dc.contributor.advisor |
Hossen Asiful Mustafa, Dr. |
|
dc.contributor.author |
Mahedi Hasan, Md. |
|
dc.date.accessioned |
2021-10-19T04:44:26Z |
|
dc.date.available |
2021-10-19T04:44:26Z |
|
dc.date.issued |
2020-09-15 |
|
dc.identifier.uri |
http://lib.buet.ac.bd:8080/xmlui/handle/123456789/5880 |
|
dc.description.abstract |
Person re-identification (ReID) across multiple surveillance cameras with non-overlapping
fields of view is one of the most significant problems in real-world intelligent video
surveillance systems. Due to the unconstrained nature of the problem, gait-based per-
son recognition is the only likely identification method to solve the person ReID in
this situation. Furthermore, most of the existing ReID algorithms were designed for
closed-world scenarios that consider the same descriptors across the camera network re-
gardless of the dramatic change in view angle due to different camera positions, which
eventually cause them to perform poorly in real-world scenarios. To address this prob-
lem, therefore, in this thesis, we present a simple yet effective algorithm for robust gait
recognition for person ReID that addresses the challenges that arise from the real-world
multi-camera surveillance environment. In this approach, we first designed a novel low-
dimensional spatio-temporal feature vector that was extracted from the pose estimation
of raw video frames. In this research, we have developed a 50-dimensional feature
descriptor by concatenating four different types of spatio-temporal features. These fea-
tures are discriminant, and at the same time robust to the variations of different covari-
ate factors. Thereafter, a pose sequence having a timestep of length of 28 frames was
formed to feed into an RNN-based classifier network. The RNN network consists of two
BGRU layers each of which only has 80 GRU cells. The input layer was followed by a
batch normalization layer. The output of the recurrent layers was also batch normalized
to standardize the activations and finally fed into an output softmax layer. The major-
ity voting scheme was employed to process the output to predict the subject ID. For
multi-view gait recognition, we also propose a two-stage network in which we initially
identify the walking direction from gait video by employing a view angle identification
network. Here, the input of the network was a clip of 16 consecutive frames that were
preprocessed and resized to 112x112 to feed into a 3D convolutional network based on
C3D. The experimental evaluation conducted on two challenging CASIA A and CA-
SIA B gait datasets demonstrates that the proposed method has achieved state-of-the-art
performance on both single-view and multi-view gait recognition. The experimental re-
sult clearly confirms the effectiveness of our proposed approach when compared to the
other state-of-the-art methods. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
Institute of Information and Communication Technology (IICT), BUET |
en_US |
dc.subject |
Pattern recognition systems |
en_US |
dc.title |
View invariant gait recognition for person re-identification in a multi surveillance camera environment |
en_US |
dc.type |
Thesis-MSc |
en_US |
dc.contributor.id |
1014312019 |
en_US |
dc.identifier.accessionNumber |
117637 |
|
dc.contributor.callno |
006.4/MAH/2020 |
en_US |