Residential College | false |
Status | 已發表Published |
Durable top-k search in document archives | |
Hou U L.1; Mamoulis N.1; Berberich K.2; Bedathur S.2 | |
2010-07-23 | |
Conference Name | the 2010 ACM SIGMOD International Conference on Management of data |
Source Publication | Proceedings of the ACM SIGMOD International Conference on Management of Data |
Pages | 555-566 |
Conference Date | June 06 - 10, 2010 |
Conference Place | Indianapolis, Indiana, USA |
Abstract | We propose and study a new ranking problem in versioned databases. Consider a database of versioned objects which have different valid instances along a history (e.g., documents in a web archive). Durable top-k search finds the set of objects that are consistently in the top-k results of a query (e.g., a keyword query) throughout a given time interval (e.g., from June 2008 to May 2009). Existing work on temporal top-k queries mainly focuses on finding the most representative top-k elements within a time interval. Such methods are not readily applicable to durable top-k queries. To address this need, we propose two techniques that compute the durable top-k result. The first is adapted from the classic top-k rank aggregation algorithm NRA. The second technique is based on a shared execution paradigm and is more efficient than the first approach. In addition, we propose a special indexing technique for archived data. The index, coupled with a space partitioning technique, improves performance even further. We use data from Wikipedia and the Internet Archive to demonstrate the efficiency and effectiveness of our solutions. © 2010 ACM. |
Keyword | Document Archives Temporal Queries Top-k Search |
DOI | 10.1145/1807167.1807228 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-77954751022 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology |
Affiliation | 1.The University of Hong Kong 2.Max Planck Institut für Informatik |
Recommended Citation GB/T 7714 | Hou U L.,Mamoulis N.,Berberich K.,et al. Durable top-k search in document archives[C], 2010, 555-566. |
APA | Hou U L.., Mamoulis N.., Berberich K.., & Bedathur S. (2010). Durable top-k search in document archives. Proceedings of the ACM SIGMOD International Conference on Management of Data, 555-566. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment