カワイ ユキコ
KAWAI YUKIKO
河合 由起子 所属 京都産業大学 情報理工学部 情報理工学科 職種 教授 |
|
言語種別 | 英語 |
発行・発表の年月 | 2008 |
形態種別 | その他 |
査読 | 査読あり |
標題 | Utilizing past web for knowledge discovery |
執筆形態 | その他 |
掲載誌名 | Intelligence Integration in Distributed Knowledge Management |
出版社・発行元 | IGI Global |
巻・号・頁 | ISBN: 978-1-59904-576-4;3,pp.286-304 |
著者・共著者 | Adam Jatowt,Yukiko Kawai,Katsumi Tanaka |
概要 | The Web is a useful data source for knowledge extraction, as it provides diverse content virtually on any possible topic. Hence, a lot of research has been recently done for improving mining in the Web. However, relatively little research has been done taking directly into account the temporal aspects of the Web. In this chapter, we analyze data stored in Web archives, which preserve content of the Web, and investigate the methodology required for successful knowledge discovery from this data. We call the collection of such Web archives past Web
a temporal structure composed of the past copies of Web pages. First, we discuss the character of the data and explain some concepts related to utilizing the past Web, such as data collection, analysis and processing. Next, we introduce examples of two applications, temporal summarization and a browser for the past Web. © 2009, IGI Global. |
DOI | 10.4018/978-1-59904-576-4.ch017 |