Document anonymization using pipeline
In this blog post, we show how one can anonymize documents using the new Pipeline module. The module allows for a streamlined process of anonymizing documents, where the user defines how the anonymization should be performed and the locations, where the documents to be anonymized are located and where the anonymized documents should be stored.
The pipeline will automatically extract the text from the documents, anonymize the text, and store the anonymized text in the output folder.