Residential College | false |
Status | 已發表Published |
UP-DPC: Ultra-scalable parallel density peak clustering | |
Ma, Luyao1,2; Yang, Geping1,2; Yang, Yiyang2; Chen, Xiang1; Lu, Juan3; Gong, Zhiguo4; Hao, Zhifeng5 | |
2024-03 | |
Source Publication | Information Sciences |
ISSN | 0020-0255 |
Volume | 660Pages:120114 |
Abstract | Density Peak Clustering (DPC) is a highly effective density-based clustering algorithm, but its scalability is limited by the expensive Density Peak Estimation (DPE) step. To address this challenge, we propose UP-DPC: Ultra-Scalable Parallel Density Peak Clustering, a novel framework that employs approximate Density Peak Estimation and performs DPC on LDP-wise graphs. This approach enables UP-DPC to handle datasets of arbitrary scale without relying on spatial indexing for acceleration. Furthermore, we introduce a five-layer computational architecture and leverage parallel computation techniques to further enhance the speed and efficiency of UP-DPC. To evaluate the scalability and effectiveness of UP-DPC, we conduct extensive experiments on 14 datasets, including the large/web-scale datasets, and compare UP-DPC with 21 algorithms. Notably, on the MNIST8M dataset consisting of 8,000k data objects, UP-DPC achieves an NMI (Normalized Mutual Information) value of 0.6464 in just 35.41 seconds, outperforming the state-of-the-art GPU-based method, which only archives an NMI of 0.045 in 56.96 seconds. These results demonstrate the superior scalability and effectiveness of UP-DPC in handling large/web-scale datasets. The proposed framework offers significant improvements over existing methods and shows promise as a solution for density-based clustering tasks. |
Keyword | Clustering Density Peak Estimation Large-scale Parallel Computation Scalability |
DOI | 10.1016/j.ins.2024.120114 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Information Systems |
WOS ID | WOS:001164741500001 |
Publisher | ELSEVIER SCIENCE INC, STE 800, 230 PARK AVE, NEW YORK, NY 10169 |
Scopus ID | 2-s2.0-85182592511 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Yang, Yiyang; Chen, Xiang |
Affiliation | 1.School of Electronics and Information Technology, Sun Yat-Sen University, China 2.Faculty of Computer, Guangdong University of Technology, China 3.Beijing Institute of Petrochemical Technology, China 4.State Key Laboratory of Internet of Things for Smart City, Department of Computer and Information Science, University of Macau, China 5.College of Engineering, Shantou University, China |
Recommended Citation GB/T 7714 | Ma, Luyao,Yang, Geping,Yang, Yiyang,et al. UP-DPC: Ultra-scalable parallel density peak clustering[J]. Information Sciences, 2024, 660, 120114. |
APA | Ma, Luyao., Yang, Geping., Yang, Yiyang., Chen, Xiang., Lu, Juan., Gong, Zhiguo., & Hao, Zhifeng (2024). UP-DPC: Ultra-scalable parallel density peak clustering. Information Sciences, 660, 120114. |
MLA | Ma, Luyao,et al."UP-DPC: Ultra-scalable parallel density peak clustering".Information Sciences 660(2024):120114. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment