Residual-based Language Models are Free Boosters for Biomedical Imaging Tasks

doi:10.1109/CVPRW63382.2024.00515

UM > Faculty of Science and Technology

Residential College	false
Status	已發表Published
	Residual-based Language Models are Free Boosters for Biomedical Imaging Tasks
	Lai, Zhixin 1; Wu, Jing 2; Chen, Suiyao 3; Zhou, Yucheng 4; Hovakimyan, Naira 2
	2024-09
Conference Name	2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Source Publication	IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
Pages	5086-5096
Conference Date	17-18 June 2024
Conference Place	Seattle, WA, USA
Country	USA
Publisher	IEEE Computer Society
Abstract	In this study, we uncover the unexpected efficacy of residual-based large language models (LLMs) as part of encoders for biomedical imaging tasks, a domain traditionally devoid of language or textual data. The approach diverges from established methodologies by utilizing a frozen transformer block, extracted from pre-trained LLMs, as an innovative encoder layer for the direct processing of visual tokens. This strategy represents a significant departure from the standard multi-modal vision-language frameworks, which typically hinge on language-driven prompts and inputs. We found that these LLMs could boost performance across a spectrum of biomedical imaging applications, including both 2D and 3D visual classification tasks, serving as plug-and-play boosters. More interestingly, as a byproduct, we found that the proposed framework achieved superior performance, setting new state-of-the-art results on extensive, standardized datasets in MedMNIST-2D and 3D. Through this work, we aim to open new avenues for employing LLMs in biomedical imaging and enriching the understanding of their potential in this specialized domain. The code is available at https://github.com/ZhixinLai/LLMBoostMedical
Keyword	Visualization Three-dimensional Displays Large Language Models Conferences Fasteners Transformers Pattern Recognition Llm Biomedical Imaging
DOI	10.1109/CVPRW63382.2024.00515
URL	View the original
Language	英語English
Scopus ID	2-s2.0-85202595241
Fulltext Access	View Full-Text via DOI View Full-Text via Scopus
Citation statistics
Document Type	Conference paper
Collection	Faculty of Science and Technology
Affiliation	1.Cornel University, United States 2.University of Illinois, Urbana-Champaign, United States 3.University of South Florida, United States 4.University of Macau, Macao
Recommended Citation GB/T 7714	Lai, Zhixin,Wu, Jing,Chen, Suiyao,et al. Residual-based Language Models are Free Boosters for Biomedical Imaging Tasks[C]:IEEE Computer Society, 2024, 5086-5096.
APA	Lai, Zhixin., Wu, Jing., Chen, Suiyao., Zhou, Yucheng., & Hovakimyan, Naira (2024). Residual-based Language Models are Free Boosters for Biomedical Imaging Tasks. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 5086-5096.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh