Artigo Revisado por pares

A new pattern representation scheme using data compression

2002; IEEE Computer Society; Volume: 24; Issue: 5 Linguagem: Inglês

10.1109/34.1000234

ISSN

2160-9292

Autores

T. Watanabe, K. Sugawara, Hiroki Sugihara,

Tópico(s)

Image Retrieval and Classification Techniques

Resumo

We propose the PRDC (Pattern Representation based on Data Compression) scheme for media data analysis. PRDC is composed of two parts: an encoder that translates input data into text and a set of text compressors to generate a compression-ratio vector (CV). The CV is used as a feature of the input data. By preparing a set of media-specific encoders, PRDC becomes widely applicable. Analysis tasks - both categorization (class formation) and recognition (classification) - can be realized using CVs. After a mathematical discussion on the realizability of PRDC, the wide applicability of this scheme is demonstrated through the automatic categorization and/or recognition of music, voices, genomes, handwritten sketches and color images.

Referência(s)