Artigo Revisado por pares

Comparisons of decision tree methods using water data

2015; Taylor & Francis; Volume: 46; Issue: 4 Linguagem: Inglês

10.1080/03610918.2015.1066807

ISSN

1532-4141

Autores

Muhammad Azam, Muhammad Aslam, Khushnoor Khan, Anwar Mughal, Awais Inayat,

Tópico(s)

Neural Networks and Applications

Resumo

This article demonstrates the application of classification trees (decision trees), logistic regression (LR), and linear discriminant function (LDR) to classify data of water quality (i.e., whether the water is fit for drinking on not fit for drinking). The data on water quality were obtained from Pakistan Council of Research in Water Resources (PCRWR) for two cities of Pakistan—one representing industrial environment (Sialkot) and the other one representing non-industrial environment (Narowal). To classify data on water quality, three statistical tools were employed—the Decision Tree methodology using Gini Index, LR, and LDA—using R software library. The results obtained by the said three techniques were compared using misclassification rates (a model with minimum value of misclassification rate is better). It was witnessed that LR performed well than the other two techniques while the Decision trees and LDA performed equally well. But for illustration purposes decision trees technique is comparatively easy to draw and interpret.

Referência(s)