Residential College | false |
Status | 已發表Published |
Data Reconstruction of Abandoned Websites | |
Iztok Fister Jr.1; Iztok Fister1; Simon Fong2; Yan Zhuang2 | |
2015-06-11 | |
Conference Name | 2014 2nd International Symposium on Computational and Business Intelligence, ISCBI 2014 |
Source Publication | Proceedings - 2014 2nd International Symposium on Computational and Business Intelligence, ISCBI 2014 |
Pages | 67-72 |
Conference Date | 7-8 Dec. 2014 |
Conference Place | New Delhi, India |
Author of Source | Institute of Electrical and Electronics Engineers Inc. |
Publisher | IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA |
Abstract | Nowadays, the Internet offers data to anyone at any time. Websites on the Internet have been warehousing data for many years ago, i.e., for 10 years and more. In the meantime, many websites have became obsolete. This means they no longer have owner because of either they have no-one to maintain them or they have become unavailable for indexing by spiders that retrieves information about documents to be referenced. As a result, these websites are lost for accessing from Internet browsers and are therefore, referred to as abandoned websites. This paper focuses on the problem of how to identify the abandoned websites and how to preserve and reconstruct the data they hold. We have mainly concentrated on abandoned sport websites that, in general, contains very important data about the results achieved at various sporting competitions in the past. The proposed solution consist of four steps: an analysis of the abandoned servers that held these websites, identifying the structure of the abandoned web page sets, web scrapping, and preserving and visualizing these page sets. In order to test prototype solution, some steps were applied in order to reconstruct and preserve the data on the abandoned web servers for tracking the results on running. Additionally, opportunities and challenges of applying data mining techniques on reconstructed website are listed. |
Keyword | Internet Reconstruction Abandoned Websites Web Scrapping Data Mining |
DOI | 10.1109/ISCBI.2014.22 |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Information Systems ; Computer Science, Interdisciplinary Applications |
WOS ID | WOS:000393510400015 |
Scopus ID | 2-s2.0-84937554175 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Iztok Fister Jr. |
Affiliation | 1.University of Maribor, Faculty of Electrical Engineering and Computer Science, Smetanova 17, Maribor; 2000, Slovenia; 2.University of Macau, Faculty of Science and Technology, Macau, Av. Padre Tomas Pereira, Taipa, China |
Recommended Citation GB/T 7714 | Iztok Fister Jr.,Iztok Fister,Simon Fong,et al. Data Reconstruction of Abandoned Websites[C]. Institute of Electrical and Electronics Engineers Inc.:IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA, 2015, 67-72. |
APA | Iztok Fister Jr.., Iztok Fister., Simon Fong., & Yan Zhuang (2015). Data Reconstruction of Abandoned Websites. Proceedings - 2014 2nd International Symposium on Computational and Business Intelligence, ISCBI 2014, 67-72. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment