Amazon Alexa traffic traces
2022; Elsevier BV; Volume: 205; Linguagem: Inglês
10.1016/j.comnet.2022.108782
ISSN1872-7069
AutoresRubén Barceló-Armada, Ismael Castell-Uroz, Pere Barlet‐Ros,
Tópico(s)Internet Traffic Analysis and Secure E-voting
ResumoThe number of devices that make up the Internet of Things (IoT) has been increasing every year, including smart speakers such as Amazon Echo devices. These devices have become very popular around the world where users with a smart speaker are estimated to be about 83 million in 2020. However, there has also been great concern about how they can affect the privacy and security of their users [1]. Responding to voice commands requires devices to continuously listen for the corresponding wake word, with the privacy implications that this entails. Additionally, the interactions that users may have with the virtual assistant can reveal private information about the user. In this document we publicly share two datasets that can help conduct privacy and security studies from the Amazon Echo Dot smart speaker. The included data contains 300.000 raw PCAP traces containing all the communications between the device and Amazon servers from 100 different voice commands on two different languages. The data can be used to train machine learning algorithms in order to find patterns that can characterize both, the voice commands and people using the device as well as Alexa as the device generating the traffic.
Referência(s)