Artigo Acesso aberto

Healthcare Data Quality Assessment for Cybersecurity Intelligence

2022; Institute of Electrical and Electronics Engineers; Volume: 19; Issue: 1 Linguagem: Inglês

10.1109/tii.2022.3190405

ISSN

1941-0050

Autores

Yang Li, Jiachen Yang, Zhuo Zhang, Jiabao Wen, Prabhat Kumar,

Tópico(s)

Digital Media Forensic Detection

Resumo

Considering the efficiency and security of healthcare data processing, indiscriminate data collection, annotation, and transmission are unwise. In this article, we propose the normalized double entropy (NDE) method to assess image data quality in the form of metatask. In specific, the probability entropy and distance entropy are both adopted and normalized to evaluate the data quality. The experimental results show the stable ability of the NDE to distinguish good and bad data in terms of information contribution. Furthermore, the model's diagnostic performances driven by selected good and bad data are compared, and a clear gap exists between them under the premise of the same amount of data. Screening 70% of the dataset can achieve almost the same accuracy as that based on all data. This article focuses on healthcare data quality and data redundancy and provides a practical evaluation tool to facilitate the identification and collection of valuable data, which is beneficial to improve efficiency and protect cybersecurity in healthcare systems.

Referência(s)