Residential College | false |
Status | 已發表Published |
Web Information Extraction | |
Man I Lam; Zhiguo Gong | |
2006-05-30 | |
Conference Name | 2005 IEEE International Conference on Information Acquisition |
Source Publication | Proceedings of the 2005 IEEE International Conference on Information Acquisition, June 27- July 3, Hong Kong and Macau, China |
Pages | 596-601 |
Conference Date | 27 June-3 July 2005 |
Conference Place | Hong Kong, China |
Abstract | Along with the continuous development of the Internet technologies, Web pages can provide a huge amount of information resource. It alters the traditional way of preserving and searching information. The queries target to the Web page becomes huge and more and more important. Now a day, search engine is a very popular method to search information on the Web. However, it only presents a list of documents other than the specific answers or piece of knowledge for the user's specific question. Therefore, the data extraction from the Web is becoming a hot topic. In this paper, we investigate the current development in the Web data extraction, the difficulties, and the objectives. In addition, we illustrate and analyze some examples and provide our solution for information extraction from the Web. |
Keyword | Web Page Web Extraction Html Xml |
DOI | 10.1109/ICIA.2005.1635157 |
Scopus ID | 2-s2.0-33947158635 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | Faculty of Science and Technology University of Macau Macao, China |
First Author Affilication | Faculty of Science and Technology |
Recommended Citation GB/T 7714 | Man I Lam,Zhiguo Gong. Web Information Extraction[C], 2006, 596-601. |
APA | Man I Lam., & Zhiguo Gong (2006). Web Information Extraction. Proceedings of the 2005 IEEE International Conference on Information Acquisition, June 27- July 3, Hong Kong and Macau, China, 596-601. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment