Artigo Acesso aberto Revisado por pares

Peringkasan dan Support Vector Machine pada Klasifikasi Dokumen

2017; LPPM Institut Teknologi Telkom Purwokerto; Volume: 9; Issue: 4 Linguagem: Inglês

10.20895/infotel.v9i4.312

ISSN

2460-0997

Autores

Nelly Indriani Widiastuti, Ednawati Rainarli, Kania Evita Dewi,

Tópico(s)

Multimedia Learning Systems

Resumo

Classification is the process of grouping objects that have the same features or characteristics into several classes. The automatic documents classification use words frequency that appears on training data as features. The large number of documents cause the number of words that appears as a feature will increase. Therefore, summaries are chosen to reduce the number of words that used in classification. The classification uses multiclass Support Vector Machine (SVM) method. SVM was considered to have a good reputation in the classification. This research tests the effect of summary as selection features into documents classification. The summaries reduce text into 50%. A result obtained that the summaries did not affect value accuracy of classification of documents that use SVM. But, summaries improve the accuracy of Simple Logistic Classifier. The classification testing shows that the accuracy of Naïve Bayes Multinomial (NBM) better than SVM

Referência(s)