Depression level prediction using deep spatiotemporal features and multilayer bi-ltsm
Depression is a serious psychiatric disorder that restricts an individuals ability to work
properly in both their daily and professional lives. Usually, the diagnosis of depression often
needs a thorough assessment by an expert. Recently, significant consideration has been
given to automatic depression prediction for more reliable and efficient depression
investigation. In this article, we propose a novel framework to estimate the depression level
from video data by employing a two-stream deep spatiotemporal network. Our approach …
properly in both their daily and professional lives. Usually, the diagnosis of depression often
needs a thorough assessment by an expert. Recently, significant consideration has been
given to automatic depression prediction for more reliable and efficient depression
investigation. In this article, we propose a novel framework to estimate the depression level
from video data by employing a two-stream deep spatiotemporal network. Our approach …
Depression is a serious psychiatric disorder that restricts an individuals ability to work properly in both their daily and professional lives. Usually, the diagnosis of depression often needs a thorough assessment by an expert. Recently, significant consideration has been given to automatic depression prediction for more reliable and efficient depression investigation. In this article, we propose a novel framework to estimate the depression level from video data by employing a two-stream deep spatiotemporal network. Our approach extracts spatial information using the Inception-ResNet-v2 network. In contrast, we introduce a volume local directional number (VLDN) based dynamic feature descriptor to capture facial motions. Then, the feature map obtained from the VLDN is fed into a convolutional neural network (CNN) to obtain more discriminative features. Additionally, we designed a multilayer bidirectional long short-term memory (Bi-LSTM) model to obtain temporal information by integrating the temporal median pooling (TMP) approach into the model. The TMP approach is employed on the temporal fragments of spatial and temporal features. Finally, extensive experimental analysis of two challenging datasets, AVEC2013 and AVEC2014, demonstrates that the proposed approach shows promising performance compared to the existing approaches for depression level prediction.
ieeexplore.ieee.org