Residential College | false |
Status | 已發表Published |
Exposing the Self-Supervised Space-Time Correspondence Learning via Graph Kernels | |
Qin, Zheyun1; Lu, Xiankai1; Nie, Xiushan2; Yin, Yilong1; Shen, Jianbing3 | |
2023-06-27 | |
Conference Name | 37th AAAI Conference on Artificial Intelligence, AAAI 2023 |
Source Publication | Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023 |
Volume | 37 |
Pages | 2110-2118 |
Conference Date | 7 February 2023through 14 February 2023 |
Conference Place | Washington |
Publisher | AAAI Press |
Abstract | Self-supervised space-time correspondence learning is emerging as a promising way of leveraging unlabeled video. Currently, most methods adapt contrastive learning with mining negative samples or reconstruction adapted from the image domain, which requires dense affinity across multiple frames or optical flow constraints. Moreover, video correspondence predictive models require mining more inherent properties in videos, such as structural information. In this work, we propose the VideoHiGraph, a space-time correspondence framework based on a learnable graph kernel. Concerning the video as the spatial-temporal graph, the learning objectives of VideoHiGraph are emanated in a self-supervised manner for predicting unobserved hidden graphs via graph kernel manner. We learn a representation of the temporal coherence across frames in which pairwise similarity defines the structured hidden graph, such that a biased random walk graph kernel along the sub-graph can predict long-range correspondence. Then, we learn a refined representation across frames on the node-level via a dense graph kernel. The self-supervision of the model training is formed by the structural and temporal consistency of the graph. VideoHiGraph achieves superior performance and demonstrates its robustness across the benchmark of label propagation tasks involving objects, semantic parts, keypoints, and instances. Our algorithm implementations have been made publicly available at https://github.com/zyqin19/VideoHiGraph. |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85167697849 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Affiliation | 1.School of Software, Shandong University, China 2.School of Computer Science and Technology, Shandong Jianzhu University, China 3.SKL-IOTSC, CIS, University of Macau, Macao |
Recommended Citation GB/T 7714 | Qin, Zheyun,Lu, Xiankai,Nie, Xiushan,et al. Exposing the Self-Supervised Space-Time Correspondence Learning via Graph Kernels[C]:AAAI Press, 2023, 2110-2118. |
APA | Qin, Zheyun., Lu, Xiankai., Nie, Xiushan., Yin, Yilong., & Shen, Jianbing (2023). Exposing the Self-Supervised Space-Time Correspondence Learning via Graph Kernels. Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023, 37, 2110-2118. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment