Capítulo de livro Acesso aberto Revisado por pares

Creating a Dead Poets Society: Extracting a Social Network of Historical Persons from the Web

2007; Springer Science+Business Media; Linguagem: Inglês

10.1007/978-3-540-76298-0_12

ISSN

1611-3349

Autores

Gijs Geleijnse, Jan Korst,

Tópico(s)

Topic Modeling

Resumo

We present a simple method to extract information from search engine snippets. Although the techniques presented are domain independent, this work focuses on extracting biographical information of historical persons from multiple unstructured sources on the Web. We first similarly find a list of persons and their periods of life by querying the periods and scanning the retrieved snippets for person names. Subsequently, we find biographical information for the persons extracted. In order to get insight in the mutual relations among the persons identified, we create a social network using co-occurrences on the Web. Although we use uncontrolled and unstructured Web sources, the information extracted is reliable. Moreover we show that Web Information Extraction can be used to create both informative and enjoyable applications.

Referência(s)