Multimodal deep autoencoder for human pose recovery

C Hong, J Yu, J Wan, D Tao… - IEEE transactions on …, 2015 - ieeexplore.ieee.org
Video-based human pose recovery is usually conducted by retrieving relevant poses using
image features. In the retrieving process, the mapping between 2D images and 3D poses is …

A region-based image caption generator with refined descriptions

P Kinghorn, L Zhang, L Shao - Neurocomputing, 2018 - Elsevier
Describing the content of an image is a challenging task. To enable detailed description, it
requires the detection and recognition of objects, people, relationships and associated …

Deep view-sensitive pedestrian attribute inference in an end-to-end model

MS Sarfraz, A Schumann, Y Wang… - arXiv preprint arXiv …, 2017 - arxiv.org
Pedestrian attribute inference is a demanding problem in visual surveillance that can
facilitate person retrieval, search and indexing. To exploit semantic relations between …

Online feature selection with group structure analysis

J Wang, M Wang, P Li, L Liu, Z Zhao… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Online selection of dynamic features has attracted intensive interest in recent years.
However, existing online feature selection methods evaluate features individually and …

Fashion apparel detection: the role of deep convolutional neural network and pose-dependent priors

K Hara, V Jagadeesh… - 2016 IEEE Winter …, 2016 - ieeexplore.ieee.org
In this work, we propose and address a new computer vision task, which we call fashion item
detection, where the aim is to detect various fashion items a person in the image is wearing …

Hypergraph regularized autoencoder for image-based 3D human pose recovery

C Hong, X Chen, X Wang, C Tang - Signal processing, 2016 - Elsevier
Image-based human pose recovery is usually conducted by retrieving relevant poses with
image features. However, semantic gap exists for current feature extractors, which limits …

Dense-captionnet: a sentence generation architecture for fine-grained description of image semantics

I Khurram, MM Fraz, M Shahzad, NM Rajpoot - Cognitive Computation, 2021 - Springer
Automatic image captioning, a highly challenging research problem, aims to understand and
describe the contents of the complex scene in human understandable natural language. The …

Watch fashion shows to tell clothing attributes

S Zhang, S Liu, X Cao, Z Song, J Zhou - Neurocomputing, 2018 - Elsevier
In this paper, we propose a novel semi-supervised method to predict clothing attributes with
the assistance of unlabeled data like fashion shows. To this end, a two-stage framework is …

Multinomial latent logistic regression for image understanding

Z Xu, Z Hong, Y Zhang, J Wu, AC Tsoi… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
In this paper, we present multinomial latent logistic regression (MLLR), a new learning
paradigm that introduces latent variables to logistic regression. By inheriting the advantages …

基于改进的DenseNet-BC 对少数民族服饰的识别.

杨冰, 徐丹, 张豪远, 罗海妮 - Journal of Zhejiang University …, 2021 - search.ebscohost.com
随着信息技术的发展, 数字技术越来越多地应用于民族文化数字化保护, 民族服饰的数字化及
分类问题也日益受关注. 相比一般服饰, 少数民族服饰具有更多的细节特征信息 …