SeqViews2SeqLabels: Learning 3D global features via aggregating sequential views by RNN with attention

doi:10.1109/TIP.2018.2868426

UM > Faculty of Science and Technology > DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE

Residential College	false
Status	已發表Published
	SeqViews2SeqLabels: Learning 3D global features via aggregating sequential views by RNN with attention
	Han Z.4; Shang M.4; Liu Z.2; Vong C.-M.3 ; Liu Y.-S.4; Zwicker M.1; Han J.2; Chen C.L.P.3
	2019-02-01
Source Publication	IEEE Transactions on Image Processing
ISSN	1057-7149
Volume	28 Issue:2 Pages:658-672
Abstract	Learning 3D global features by aggregating multiple views has been introduced as a successful strategy for 3D shape analysis. In recent deep learning models with end-to-end training, pooling is a widely adopted procedure for view aggregation. However, pooling merely retains the max or mean value over all views, which disregards the content information of almost all views and also the spatial information among the views. To resolve these issues, we propose Sequential Views To Sequential Labels (SeqViews2SeqLabels) as a novel deep learning model with an encoder-decoder structure based on recurrent neural networks (RNNs) with attention. SeqViews2SeqLabels consists of two connected parts, an encoder-RNN followed by a decoder-RNN, that aim to learn the global features by aggregating sequential views and then performing shape classification from the learned global features, respectively. Specifically, the encoder-RNN learns the global features by simultaneously encoding the spatial and content information of sequential views, which captures the semantics of the view sequence. With the proposed prediction of sequential labels, the decoder-RNN performs more accurate classification using the learned global features by predicting sequential labels step by step. Learning to predict sequential labels provides more and finer discriminative information among shape classes to learn, which alleviates the overfitting problem inherent in training using a limited number of 3D shapes. Moreover, we introduce an attention mechanism to further improve the discriminative ability of SeqViews2SeqLabels. This mechanism increases the weight of views that are distinctive to each shape class, and it dramatically reduces the effect of selecting the first view position. Shape classification and retrieval results under three large-scale benchmarks verify that SeqViews2SeqLabels learns more discriminative global features by more effectively aggregating sequential views than state-of-the-art methods.
Keyword	3d Feature Learning Attention Rnn Sequential Labels Sequential Views View Aggregation
DOI	10.1109/TIP.2018.2868426
URL	View the original
Indexed By	SCIE
Language	英語English
WOS Research Area	Computer Science ; Engineering
WOS Subject	Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
WOS ID	WOS:000446255300010
Scopus ID	2-s2.0-85052842548
Fulltext Access	View Full-Text via DOI View Full-Text via Web of Science View Full-Text via Scopus
Citation statistics
Document Type	Journal article
Collection	DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding Author	Liu Y.-S.
Affiliation	1.University of Maryland 2.Northwestern Polytechnical University 3.Universidade de Macau 4.Tsinghua University
Recommended Citation GB/T 7714	Han Z.,Shang M.,Liu Z.,et al. SeqViews2SeqLabels: Learning 3D global features via aggregating sequential views by RNN with attention[J]. IEEE Transactions on Image Processing, 2019, 28(2), 658-672.
APA	Han Z.., Shang M.., Liu Z.., Vong C.-M.., Liu Y.-S.., Zwicker M.., Han J.., & Chen C.L.P. (2019). SeqViews2SeqLabels: Learning 3D global features via aggregating sequential views by RNN with attention. IEEE Transactions on Image Processing, 28(2), 658-672.
MLA	Han Z.,et al."SeqViews2SeqLabels: Learning 3D global features via aggregating sequential views by RNN with attention".IEEE Transactions on Image Processing 28.2(2019):658-672.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh