UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
UP-DPC: Ultra-scalable parallel density peak clustering
Ma, Luyao1,2; Yang, Geping1,2; Yang, Yiyang2; Chen, Xiang1; Lu, Juan3; Gong, Zhiguo4; Hao, Zhifeng5
2024-03
Source PublicationInformation Sciences
ISSN0020-0255
Volume660Pages:120114
Abstract

Density Peak Clustering (DPC) is a highly effective density-based clustering algorithm, but its scalability is limited by the expensive Density Peak Estimation (DPE) step. To address this challenge, we propose UP-DPC: Ultra-Scalable Parallel Density Peak Clustering, a novel framework that employs approximate Density Peak Estimation and performs DPC on LDP-wise graphs. This approach enables UP-DPC to handle datasets of arbitrary scale without relying on spatial indexing for acceleration. Furthermore, we introduce a five-layer computational architecture and leverage parallel computation techniques to further enhance the speed and efficiency of UP-DPC. To evaluate the scalability and effectiveness of UP-DPC, we conduct extensive experiments on 14 datasets, including the large/web-scale datasets, and compare UP-DPC with 21 algorithms. Notably, on the MNIST8M dataset consisting of 8,000k data objects, UP-DPC achieves an NMI (Normalized Mutual Information) value of 0.6464 in just 35.41 seconds, outperforming the state-of-the-art GPU-based method, which only archives an NMI of 0.045 in 56.96 seconds. These results demonstrate the superior scalability and effectiveness of UP-DPC in handling large/web-scale datasets. The proposed framework offers significant improvements over existing methods and shows promise as a solution for density-based clustering tasks.

KeywordClustering Density Peak Estimation Large-scale Parallel Computation Scalability
DOI10.1016/j.ins.2024.120114
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Information Systems
WOS IDWOS:001164741500001
PublisherELSEVIER SCIENCE INC, STE 800, 230 PARK AVE, NEW YORK, NY 10169
Scopus ID2-s2.0-85182592511
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionFaculty of Science and Technology
THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorYang, Yiyang; Chen, Xiang
Affiliation1.School of Electronics and Information Technology, Sun Yat-Sen University, China
2.Faculty of Computer, Guangdong University of Technology, China
3.Beijing Institute of Petrochemical Technology, China
4.State Key Laboratory of Internet of Things for Smart City, Department of Computer and Information Science, University of Macau, China
5.College of Engineering, Shantou University, China
Recommended Citation
GB/T 7714
Ma, Luyao,Yang, Geping,Yang, Yiyang,et al. UP-DPC: Ultra-scalable parallel density peak clustering[J]. Information Sciences, 2024, 660, 120114.
APA Ma, Luyao., Yang, Geping., Yang, Yiyang., Chen, Xiang., Lu, Juan., Gong, Zhiguo., & Hao, Zhifeng (2024). UP-DPC: Ultra-scalable parallel density peak clustering. Information Sciences, 660, 120114.
MLA Ma, Luyao,et al."UP-DPC: Ultra-scalable parallel density peak clustering".Information Sciences 660(2024):120114.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Ma, Luyao]'s Articles
[Yang, Geping]'s Articles
[Yang, Yiyang]'s Articles
Baidu academic
Similar articles in Baidu academic
[Ma, Luyao]'s Articles
[Yang, Geping]'s Articles
[Yang, Yiyang]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Ma, Luyao]'s Articles
[Yang, Geping]'s Articles
[Yang, Yiyang]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.