Assessing the performances of different neural network architectures for the detection of screams and shouts in public transportation
2018; Elsevier BV; Volume: 117; Linguagem: Inglês
10.1016/j.eswa.2018.08.052
ISSN1873-6793
AutoresPierre Laffitte, Yun Wang, David Sodoyer, Laurent Girin,
Tópico(s)Speech and Audio Processing
ResumoAs intelligent transportation systems are becoming more and more prevalent, the relevance of automatic surveillance systems grows larger. While such systems rely heavily on video signals, other types of signals can be used as well to monitor the security of passengers. The present article proposes an audio-based intelligent system for surveillance in public transportation, investigating the use of some state-of-the-art artificial intelligence methods for the automatic detection of screams and shouts. We present test results produced on a database of sounds occurring in subway trains in real working conditions, by classifying sounds into screams, shouts and other categories using different Neural Network architectures. The relevance of these architectures in the analysis of audio signals is analyzed. We report encouraging results, given the difficulty of the task, especially when a high level of surrounding noise is present.
Referência(s)