Residential Collegefalse
Status已發表Published
Durable top-k search in document archives
Hou U L.1; Mamoulis N.1; Berberich K.2; Bedathur S.2
2010-07-23
Conference Namethe 2010 ACM SIGMOD International Conference on Management of data
Source PublicationProceedings of the ACM SIGMOD International Conference on Management of Data
Pages555-566
Conference DateJune 06 - 10, 2010
Conference PlaceIndianapolis, Indiana, USA
Abstract

We propose and study a new ranking problem in versioned databases. Consider a database of versioned objects which have different valid instances along a history (e.g., documents in a web archive). Durable top-k search finds the set of objects that are consistently in the top-k results of a query (e.g., a keyword query) throughout a given time interval (e.g., from June 2008 to May 2009). Existing work on temporal top-k queries mainly focuses on finding the most representative top-k elements within a time interval. Such methods are not readily applicable to durable top-k queries. To address this need, we propose two techniques that compute the durable top-k result. The first is adapted from the classic top-k rank aggregation algorithm NRA. The second technique is based on a shared execution paradigm and is more efficient than the first approach. In addition, we propose a special indexing technique for archived data. The index, coupled with a space partitioning technique, improves performance even further. We use data from Wikipedia and the Internet Archive to demonstrate the efficiency and effectiveness of our solutions. © 2010 ACM.

KeywordDocument Archives Temporal Queries Top-k Search
DOI10.1145/1807167.1807228
URLView the original
Language英語English
Scopus ID2-s2.0-77954751022
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Faculty of Science and Technology
Affiliation1.The University of Hong Kong
2.Max Planck Institut für Informatik
Recommended Citation
GB/T 7714
Hou U L.,Mamoulis N.,Berberich K.,et al. Durable top-k search in document archives[C], 2010, 555-566.
APA Hou U L.., Mamoulis N.., Berberich K.., & Bedathur S. (2010). Durable top-k search in document archives. Proceedings of the ACM SIGMOD International Conference on Management of Data, 555-566.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Hou U L.]'s Articles
[Mamoulis N.]'s Articles
[Berberich K.]'s Articles
Baidu academic
Similar articles in Baidu academic
[Hou U L.]'s Articles
[Mamoulis N.]'s Articles
[Berberich K.]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Hou U L.]'s Articles
[Mamoulis N.]'s Articles
[Berberich K.]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.