Residential College | false |
Status | 已發表Published |
Efficient discovery of longest-lasting correlation in sequence databases | |
Yuhong Li1; Leong Hou U1; Man Lung Yiu2; Zhiguo Gong1 | |
2016-06-23 | |
Source Publication | VLDB Journal |
ISSN | 1066-8888 |
Volume | 25Issue:6Pages:767-790 |
Abstract | The search for similar subsequences is a core module for various analytical tasks in sequence databases. Typically, the similarity computations require users to set a length. However, there is no robust means by which to define the proper length for different application needs. In this study, we examine a new query that is capable of returning the longest-lasting highly correlated subsequences in a sequence database, which is particularly helpful to analyses without prior knowledge regarding the query length. A baseline, yet expensive, solution is to calculate the correlations for every possible subsequence length. To boost performance, we study a space-constrained index that provides a tight correlation bound for subsequences of similar lengths and offset by intraobject and interobject grouping techniques. To the best of our knowledge, this is the first index to support a normalized distance metric of arbitrary length subsequences. In addition, we study the use of a smart cache for disk-resident data (e.g., millions of sequence objects) and a graph processing unit-based parallel processing technique for frequently updated data (e.g., nonindexable streaming sequences) to compute the longest-lasting highly correlated subsequences. Extensive experimental evaluation on both real and synthetic sequence datasets verifies the efficiency and effectiveness of our proposed methods. |
Keyword | Longest-lasting Correlated Subsequences Similarity Search Time Series Analysis |
DOI | 10.1007/s00778-016-0432-7 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Hardware & Architecture ; Computer Science, Information Systems |
WOS ID | WOS:000387501000002 |
Scopus ID | 2-s2.0-84976287091 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | 1.Department of Computer and Information Science, University of Macau, Macau SAR, China 2.Department of Computing, Hong Kong Polytechnic University, Hong Kong SAR, China |
First Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Yuhong Li,Leong Hou U,Man Lung Yiu,et al. Efficient discovery of longest-lasting correlation in sequence databases[J]. VLDB Journal, 2016, 25(6), 767-790. |
APA | Yuhong Li., Leong Hou U., Man Lung Yiu., & Zhiguo Gong (2016). Efficient discovery of longest-lasting correlation in sequence databases. VLDB Journal, 25(6), 767-790. |
MLA | Yuhong Li,et al."Efficient discovery of longest-lasting correlation in sequence databases".VLDB Journal 25.6(2016):767-790. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment