Artigo Acesso aberto Revisado por pares

Automatic repair of regular expressions

2019; Association for Computing Machinery; Volume: 3; Issue: OOPSLA Linguagem: Inglês

10.1145/3360565

ISSN

2475-1421

Autores

Rong Pan, Qinheping Hu, Gaowei Xu, Loris D’Antoni,

Tópico(s)

Network Packet Processing and Optimization

Resumo

We introduce RFixer, a tool for repairing complex regular expressions using examples and only consider regular expressions without non-regular operators (e.g., negative lookahead). Given an incorrect regular expression and sets of positive and negative examples, RFixer synthesizes the closest regular expression to the original one that is consistent with the examples. Automatically repairing regular expressions requires exploring a large search space because practical regular expressions: i) are large, ii) operate over very large alphabets---e.g., UTF-16 and ASCII---and iii) employ complex constructs---e.g., character classes and numerical quantifiers. RFixer's repair algorithm achieves scalability by taking advantage of structural properties of regular expressions to effectively prune the search space, and it employs satisfiability modulo theory solvers to efficiently and symbolically explore the sets of possible character classes and numerical quantifiers. RFixer could successfully compute minimal repairs for regular expressions collected from a variety of sources, whereas existing tools either failed to produce any repair or produced overly complex repairs.

Referência(s)