Capítulo de livro Revisado por pares

From NLP (Natural Language Processing) to MLP (Machine Language Processing)

2010; Springer Science+Business Media; Linguagem: Inglês

10.1007/978-3-642-14706-7_20

ISSN

1611-3349

Autores

Peter Teufl, Udo Payer, Guenter Lackner,

Tópico(s)

Semantic Web and Ontologies

Resumo

Natural Language Processing (NLP) in combination with Machine Learning techniques plays an important role in the field of automatic text analysis. Motivated by the successful use of NLP in solving text classification problems in the area of e-Participation and inspired by our prior work in the field of polymorphic shellcode detection we gave classical NLP-processes a trial in the special case of malicious code analysis. Any malicious program is based on some kind of machine language, ranging from manually crafted assembler code that exploits a buffer overflow to high level languages such as Javascript used in web-based attacks. We argue that well known NLP analysis processes can be modified and applied to the malware analysis domain. Similar to the NLP process we call this process Machine Language Processing (MLP). In this paper, we use our e-Participation analysis architecture, extract the various NLP techniques and adopt them for the malware analysis process. As proof-of-concept we apply the adopted framework to malicious code examples from Metasploit.

Referência(s)