Artigo Acesso aberto Revisado por pares

Amazon Alexa traffic traces

2022; Elsevier BV; Volume: 205; Linguagem: Inglês

10.1016/j.comnet.2022.108782

ISSN

1872-7069

Autores

Rubén Barceló-Armada, Ismael Castell-Uroz, Pere Barlet‐Ros,

Tópico(s)

Internet Traffic Analysis and Secure E-voting

Resumo

The number of devices that make up the Internet of Things (IoT) has been increasing every year, including smart speakers such as Amazon Echo devices. These devices have become very popular around the world where users with a smart speaker are estimated to be about 83 million in 2020. However, there has also been great concern about how they can affect the privacy and security of their users [1]. Responding to voice commands requires devices to continuously listen for the corresponding wake word, with the privacy implications that this entails. Additionally, the interactions that users may have with the virtual assistant can reveal private information about the user. In this document we publicly share two datasets that can help conduct privacy and security studies from the Amazon Echo Dot smart speaker. The included data contains 300.000 raw PCAP traces containing all the communications between the device and Amazon servers from 100 different voice commands on two different languages. The data can be used to train machine learning algorithms in order to find patterns that can characterize both, the voice commands and people using the device as well as Alexa as the device generating the traffic.

Referência(s)