Artigo Acesso aberto

Discovered motifs with using parallel Mprefixspan method

2011; Academic Journals; Volume: 6; Issue: 20 Linguagem: Inglês

10.5897/sre11.212

ISSN

1992-2248

Autores

Alinejad Rokny Hamid, Mir Mohsen Pedram, Shirgahi Hossein,

Tópico(s)

Machine Learning in Bioinformatics

Resumo

The parallelization of a prefixspan method to discover motifs is proposed in this paper. The prefixspan method is used to extract the frequent pattern from a sequence database. This system requires the use of multiple computers connected in local area network. This algorithm includes multi-threads to achieve communication between a master process and multiple slave processes. This algorithm applies dynamic scheduling to avoid tasks idling. Moreover we employ a technique, called selective sampling. We implement this algorithm with using a 4G memory and AMD phenom X4. Our experimental results show that this algorithm attains good efficiencies on motifs extraction. Key words: Motif discovery, parallel mining, wild cards, task scheduling, sequence mining, thread scheduling, parallel tree, DNA sequences.

Referência(s)
Altmetric
PlumX