Identification of user sessions with hierarchical agglomerative clustering

2006; Wiley; Volume: 43; Issue: 1 Linguagem: Inglês

10.1002/meet.14504301312

ISSN

0044-7870

Autores

Gabriel Murray, Jimmy Lin, Abdur Chowdhury,

Tópico(s)

Web Data Mining and Analysis

Resumo

Abstract We introduce a novel approach to identifying Web search user sessions based on the burstiness of users' activity. Our method is user‐centered rather than population‐centered or system‐centered and can be deployed in situations in which users choose to withhold personal content information. We adopt a hierarchical agglomerative clustering approach with a stopping criterion that is statistically motivated by users' activities. An evaluation based on extracts from AOL Search™ logs reveals that our algorithm achieves 98% accuracy in identifying session boundaries compared to human judgments.

Referência(s)