Monoaural Audio Source Separation Using Deep Convolutional Neural Networks

Capítulo de livro Acesso aberto Revisado por pares

Monoaural Audio Source Separation Using Deep Convolutional Neural Networks

2017; Springer Science+Business Media; Linguagem: Inglês

10.1007/978-3-319-53547-0_25

ISSN

1611-3349

Autores

Pritish Chandna, Marius Miron, Jordi Janer, Emília Gómez,

Tópico(s)

Blind Source Separation Techniques

Resumo

In this paper we introduce a low-latency monaural source separation framework using a Convolutional Neural Network (CNN). We use a CNN to estimate time-frequency soft masks which are applied for source separation. We evaluate the performance of the neural network on a database comprising of musical mixtures of three instruments: voice, drums, bass as well as other instruments which vary from song to song. The proposed architecture is compared to a Multilayer Perceptron (MLP), achieving on-par results and a significant improvement in processing time. The algorithm was submitted to source separation evaluation campaigns to test efficiency, and achieved competitive results.

Ver no editor

Altmetric

PlumX

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação

Monoaural Audio Source Separation Using Deep Convolutional Neural Networks