×
验证码:
换一张
Forgotten Password?
Stay signed in
Login With UMPASS
English
|
繁體
Login With UMPASS
Log In
ALL
ORCID
TI
AU
PY
SU
KW
TY
JN
DA
IN
PB
FP
ST
SM
Study Hall
Image search
Paste the image URL
Home
Faculties & Institutes
Scholars
Publications
Subjects
Statistics
News
Search in the results
Faculties & Institutes
THE STATE KEY LA... [5]
Faculty of Scien... [4]
Authors
SHEN JIANBING [5]
PUN CHI MAN [1]
Document Type
Conference paper [8]
Journal article [1]
Date Issued
2024 [5]
2023 [1]
2022 [3]
Language
英語English [9]
Source Publication
Proceedings of t... [5]
2024 Joint Inter... [1]
IEEE Transaction... [1]
Lecture Notes in... [1]
Proceedings of t... [1]
Indexed By
CPCI-S [5]
SCIE [1]
Funding Organization
Funding Project
×
Knowledge Map
UM
Start a Submission
Submissions
Unclaimed
Claimed
Attach Fulltext
Bookmarks
Browse/Search Results:
1-9 of 9
Help
Selected(
0
)
Clear
Items/Page:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
Sort:
Select
Issue Date Ascending
Issue Date Descending
Journal Impact Factor Ascending
Journal Impact Factor Descending
WOS Cited Times Ascending
WOS Cited Times Descending
Submit date Ascending
Submit date Descending
Title Ascending
Title Descending
Author Ascending
Author Descending
SmartEdit: Exploring Complex Instruction-Based Image Editing with Multimodal Large Language Models
Conference paper
Huang, Yuzhou, Xie, Liangbin, Wang, Xintao, Yuan, Ziyang, Cun, Xiaodong, Ge, Yixiao, Zhou, Jiantao, Dong, Chao, Huang, Rui, Zhang, Ruimao, Shan, Ying. SmartEdit: Exploring Complex Instruction-Based Image Editing with Multimodal Large Language Models[C]:IEEE Computer Society, 2024, 8362-8371.
Authors:
Huang, Yuzhou
;
Xie, Liangbin
;
Wang, Xintao
;
Yuan, Ziyang
;
Cun, Xiaodong
; et al.
Favorite
|
TC[Scopus]:
1
|
Submit date:2024/11/05
Training
Visualization
Computer Vision
Large Language Models
Diffusion Models
Cognition
Pattern Recognition
Instruction-based Image Editing
Multimodal Large Language Models
COMMA: Co-articulated Multi-Modal Learning
Conference paper
Hu, Lianyu, Gao, Liqing, Liu, Zekang, Pun, Chi Man, Feng, Wei. COMMA: Co-articulated Multi-Modal Learning[C]:Association for the Advancement of Artificial Intelligence, 2024, 2238-2246.
Authors:
Hu, Lianyu
;
Gao, Liqing
;
Liu, Zekang
;
Pun, Chi Man
;
Feng, Wei
Favorite
|
TC[Scopus]:
0
|
Submit date:2024/05/16
Cv: Language And Vision
Cv: Large Vision Models
Cv: Multi-modal Vision
Cv: Video Understanding & Activity Analysis
LightVLP: A Lightweight Vision-Language Pre-training via Gated Interactive Masked AutoEncoders
Conference paper
Sun, Xingwu, Yang, Zhen, Xie, Ruobing, Lian, Fengzong, Kang, Zhanhui, Xu, Chengzhong. LightVLP: A Lightweight Vision-Language Pre-training via Gated Interactive Masked AutoEncoders[C]:European Language Resources Association (ELRA), 2024, 10499-10510.
Authors:
Sun, Xingwu
;
Yang, Zhen
;
Xie, Ruobing
;
Lian, Fengzong
;
Kang, Zhanhui
; et al.
Favorite
|
TC[Scopus]:
0
|
Submit date:2024/07/04
Lightweight v&l Pre-training
Mask Autoencoder
Vision-language Pre-training
Relational Network via Cascade CRF for Video Language Grounding
Journal article
Zhang, Tong, Lu, Xiankai, Zhang, Hao, Nie, Xiushan, Yin, Yilong, Shen, Jianbing. Relational Network via Cascade CRF for Video Language Grounding[J]. IEEE Transactions on Multimedia, 2024, 26, 8297-8311.
Authors:
Zhang, Tong
;
Lu, Xiankai
;
Zhang, Hao
;
Nie, Xiushan
;
Yin, Yilong
; et al.
Favorite
|
TC[WOS]:
1
TC[Scopus]:
1
IF:
8.4
/
8.0
|
Submit date:2024/02/23
Vision-language Grounding
Conditional Random Fields
Temporal Relation
Proposal Free
The Neglected Tails in Vision-Language Models
Conference paper
Parashar, Shubham, Lin, Zhiqiu, Liu, Tian, Dong, Xiangjue, Li, Yanan, Ramanan, Deva, Caverlee, James, Kong, Shu. The Neglected Tails in Vision-Language Models[C]:IEEE Computer Society, 2024, 12988-12997.
Authors:
Parashar, Shubham
;
Lin, Zhiqiu
;
Liu, Tian
;
Dong, Xiangjue
;
Li, Yanan
; et al.
Favorite
|
TC[Scopus]:
2
|
Submit date:2024/11/05
Long Tailed Recognition
Vision-language Models
Zero-shot Recognition
Referring Multi-Object Tracking
Conference paper
Wu, Dongming, Han, Wencheng, Wang, Tiancai, Dong, Xingping, Zhang, Xiangyu, Shen, Jianbing. Referring Multi-Object Tracking[C]:IEEE, 2023, 14633-14642.
Authors:
Wu, Dongming
;
Han, Wencheng
;
Wang, Tiancai
;
Dong, Xingping
;
Zhang, Xiangyu
; et al.
Favorite
|
TC[WOS]:
16
TC[Scopus]:
29
|
Submit date:2024/02/23
And Reasoning
Language
Vision
Learning Disentanglement with Decoupled Labels for Vision-Language Navigation
Conference paper
Cheng, Wenhao, Dong, Xingping, Khan, Salman, Shen, Jianbing. Learning Disentanglement with Decoupled Labels for Vision-Language Navigation[C]:SPRINGER-VERLAG BERLIN, HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY, 2022, 309-329.
Authors:
Cheng, Wenhao
;
Dong, Xingping
;
Khan, Salman
;
Shen, Jianbing
Favorite
|
TC[WOS]:
4
TC[Scopus]:
5
|
Submit date:2023/01/30
Disentanglement
Imitation/reinforcement Learning
Lstm And Transformer
Modular Network
Vision-and-language Navigation
Multi-Level Representation Learning with Semantic Alignment for Referring Video Object Segmentation
Conference paper
Dongming Wu, Xingping Dong, Ling Shao, Jianbing Shen. Multi-Level Representation Learning with Semantic Alignment for Referring Video Object Segmentation[C]:IEEE COMPUTER SOC, 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA, 2022, 4986-4995.
Authors:
Dongming Wu
;
Xingping Dong
;
Ling Shao
;
Jianbing Shen
Favorite
|
TC[WOS]:
18
TC[Scopus]:
33
|
Submit date:2023/01/30
Grouping And Shape Analysis
Segmentation
Vision + Language
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation
Conference paper
Hanqing Wang, Wei Liang, Jianbing Shen, Luc Van Gool, Wenguan Wang. Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation[C], 2022, 15450-15460.
Authors:
Hanqing Wang
;
Wei Liang
;
Jianbing Shen
;
Luc Van Gool
;
Wenguan Wang
Favorite
|
TC[WOS]:
17
TC[Scopus]:
36
|
Submit date:2023/01/30
Vision + Language