Artigo Acesso aberto Revisado por pares

NovelTM Datasets for English-Language Fiction, 1700-2009

2020; Volume: 5; Issue: 2 Linguagem: Inglês

10.22148/001c.13147

ISSN

2371-4549

Autores

Ted Underwood, Patrick Kimutis, Jessica Witte,

Tópico(s)

Computational and Text Analysis Methods

Resumo

This report accompanies a collection of 210,305 volumes, predicted to be fiction, that researchers are encouraged to borrow for their own work. We divide the collection into seven subsets with different emphases (for instance, one where books written by men and women are represented equally, and one composed of only the most prominent and widely-held books). Comparing the pictures produced by these different subsets allows us to assess the resilience or fragility of recent quantitative arguments about literary history. Readers can also simply browse the report as a description of English-language fiction in HathiTrust Digital Library.

Referência(s)