Artigo Produção Nacional Revisado por pares

Multivariate data quality assessment based on rotated factor scores and confidence ellipsoids

2019; Elsevier BV; Volume: 129; Linguagem: Inglês

10.1016/j.dss.2019.113173

ISSN

1873-5797

Autores

Fabrício Alves de Almeida, Rodrigo Reis Leite, Guilherme Ferreira Gomes, José Henrique de Freitas Gomes, Anderson Paulo de Paiva,

Tópico(s)

Advanced Statistical Methods and Models

Resumo

This study explores the nature of the correlation in data to estimate the data quality to be used in decision-making processes. The main contribution of this research is the introduction of a new multivariate method based on rotated factor scores by varimax strategy for the repeatability and reproducibility study to effectively identify possible data of poor quality leading to measurement errors. In addition, a new confidence ellipsoid-based decision support method is developed. The efficiency of the proposed method was demonstrated using the metallographic measurements of the geometric characteristics of the resistance spot welding process. To prove the efficiency of the proposed method, it was compared with other consolidated techniques such as the analysis of variance, weighted principal components method, and factor analysis without rotation. Thus, we verified that the proposed method performed better interpretation of the latent information, minimizing the dimensionality of the data, and separating the quality attributes analyzed by clusters. One response group was classified as acceptable, and the other as marginal. These results were verified by the confidence ellipsoids, in which the proposed method obeyed the Bonferroni bilateral limits, outlining the factors which demonstrated superior discriminatory power with non-overlapping ellipsoids avoiding the confounding and favoring the better data quality analysis for multicriteria decision-making. When compared with the other approaches, the proposed method demonstrated more reliable and robust results without such deficiencies as inversion of the groupings, neglection of the variance-covariance structure, and the variability attributed to the data within the measurement system.

Referência(s)
Altmetric
PlumX