Residential College | false |
Status | 即將出版Forthcoming |
VFDV-IM: An Efficient and Securely Vertical Federated Data Valuation | |
Zhou, Xiaokai1; Yan, Xiao2; Li, Xinyan1; Huang, Hao1; Xu, Quanqing3; Zhang, Qinbo1; Jerome, Yen4; Cai, Zhaohui1; Jiang, Jiawei1 | |
2024 | |
Conference Name | 29th International Conference on Database Systems for Advanced Applications, DASFAA 2024 |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
Volume | 14850 LNCS |
Pages | 409-424 |
Conference Date | 2 July 2024through 5 July 2024 |
Conference Place | Gifu |
Publisher | Springer Science and Business Media Deutschland GmbH |
Abstract | Vertical federated learning enables multiple participants to build a joint machine learning model upon distributed features of overlapping samples. The performance of VFL models heavily depends on the quality of participants’ local data. It’s essential to measure the contributions of the participants for various purposes, e.g., participant selection and reward allocation. The Shapley value is widely adopted by previous works for contribution assessment. However, computing the Shapley value in VFL requires repetitive model training from scratch, incurring expensive computation and communication overheads. Inspired by this challenge, in this paper, we ask: can we efficiently and securely perform data valuation for participants via the Shapley value in VFL? We call this problem Vertical Federated Data Valuation, and introduce VFDV-IM, a method utilizing an Inheritance Mechanism to expedite Shapley value calculations by leveraging historical training records. We first propose a simple, yet effective, strategy that directly inherits the model trained over the entire consortium. To further optimize VFDV-IM, we propose a model ensemble approach that measures the similarity of evaluated consortiums, based on which we reweight the historical models. We conduct extensive experiments on various datasets and show that our VFDV-IM can efficiently calculate the Shapley value while maintaining accuracy. |
Keyword | Data Valuation Vertical Federated Learning |
DOI | 10.1007/978-981-97-5552-3_28 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85206386695 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | University of Macau |
Affiliation | 1.Wuhan University, Wuhan, Hubei Province, China 2.Centre for Perceptual and Interactive Intelligence (CPII), Hong Kong 3.OceanBase Ant Group, China 4.The University of Macau, Macao |
Recommended Citation GB/T 7714 | Zhou, Xiaokai,Yan, Xiao,Li, Xinyan,et al. VFDV-IM: An Efficient and Securely Vertical Federated Data Valuation[C]:Springer Science and Business Media Deutschland GmbH, 2024, 409-424. |
APA | Zhou, Xiaokai., Yan, Xiao., Li, Xinyan., Huang, Hao., Xu, Quanqing., Zhang, Qinbo., Jerome, Yen., Cai, Zhaohui., & Jiang, Jiawei (2024). VFDV-IM: An Efficient and Securely Vertical Federated Data Valuation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 14850 LNCS, 409-424. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment