Artigo Acesso aberto Revisado por pares

Delayed Decision-making in Real-time Beatbox Percussion Classification

2010; Routledge; Volume: 39; Issue: 3 Linguagem: Inglês

10.1080/09298215.2010.512979

ISSN

1744-5027

Autores

Dan Stowell, Mark D. Plumbley,

Tópico(s)

Speech and Audio Processing

Resumo

Abstract Real-time classification applied to a vocal percussion signal holds potential as an interface for live musical control. In this article we propose a novel approach to resolving the tension between the needs for low-latency reaction and reliable classification, by deferring the final classification decision until after a response has been initiated. We introduce a new dataset of annotated human beatbox recordings, and use it to study the optimal delay for classification accuracy. We then investigate the effect of such delayed decision-making on the quality of the audio output of a typical reactive system, via a MUSHRA-type listening test. Our results show that the effect depends on the output audio type: for popular dance/pop drum sounds the acceptable delay is on the order of 12–35 ms.

Referência(s)