Artigo Acesso aberto Revisado por pares

On Evaluating Floating Car Data Quality for Knowledge Discovery

2018; Institute of Electrical and Electronics Engineers; Volume: 19; Issue: 11 Linguagem: Inglês

10.1109/tits.2018.2867834

ISSN

1558-0016

Autores

Vítor Cerqueira, Luís Moreira-Matias, Jihed Khiari, Hans van Lint,

Tópico(s)

Traffic Prediction and Management Techniques

Resumo

Floating car data (FCD) denotes the type of data (location, speed, and destination) produced and broadcasted periodically by running vehicles. Increasingly, intelligent transportation systems take advantage of such data for prediction purposes as input to road and transit control and to discover useful mobility patterns with applications to transport service design and planning, to name just a few applications. However, there are considerable quality issues that affect the usefulness and efficacy of FCD in these many applications. In this paper, we propose a methodology to compute such quality indicators automatically for large FCD sets. It leverages on a set of statistical indicators (named Yuki-san) covering multiple dimensions of FCD such as spatio-temporal coverage, accuracy, and reliability. As such, the Yuki-san indicators provide a quick and intuitive means to assess the potential "value" and "veracity" characteristics of the data. Experimental results with two mobility-related data mining and supervised learning tasks on the basis of two real-world FCD sources show that the Yuki-san indicators are indeed consistent with how well the applications perform using the data. With a wider variety of FCD (e.g., from navigation systems and CAN buses) becoming available, further research and validation into the dimensions covered and the efficacy of the Yuki-San indicators is needed.

Referência(s)