On the Declassification of Confidential Documents
2011; Springer Science+Business Media; Linguagem: Inglês
10.1007/978-3-642-22589-5_22
ISSN1611-3349
AutoresDaniel Abril, Guillermo Navarro‐Arribas, Vicenç Torra,
Tópico(s)Digital and Cyber Forensics
ResumoWe introduce the anonymization of unstructured documents to settle the base of automatic declassification of confidential documents. Departing from known ideas and methods of data privacy, we introduce the main issues of unstructured document anonymization and propose the use of named entity recognition techniques from natural language processing and information extraction to identify the entities of the document that need to be protected.
Referência(s)