Artigo Acesso aberto Revisado por pares

Assessing the performances of different neural network architectures for the detection of screams and shouts in public transportation

2018; Elsevier BV; Volume: 117; Linguagem: Inglês

10.1016/j.eswa.2018.08.052

ISSN

1873-6793

Autores

Pierre Laffitte, Yun Wang, David Sodoyer, Laurent Girin,

Tópico(s)

Speech and Audio Processing

Resumo

As intelligent transportation systems are becoming more and more prevalent, the relevance of automatic surveillance systems grows larger. While such systems rely heavily on video signals, other types of signals can be used as well to monitor the security of passengers. The present article proposes an audio-based intelligent system for surveillance in public transportation, investigating the use of some state-of-the-art artificial intelligence methods for the automatic detection of screams and shouts. We present test results produced on a database of sounds occurring in subway trains in real working conditions, by classifying sounds into screams, shouts and other categories using different Neural Network architectures. The relevance of these architectures in the analysis of audio signals is analyzed. We report encouraging results, given the difficulty of the task, especially when a high level of surrounding noise is present.

Referência(s)