Implementation of a Distributed Personal Information Recognition System that identifies sensitive personal data in text documents and anonymizes data effectively, preserving statistical value.
-
Updated
Mar 22, 2023 - Python
Implementation of a Distributed Personal Information Recognition System that identifies sensitive personal data in text documents and anonymizes data effectively, preserving statistical value.
Redactify is an efficient data redaction tool that secures sensitive text using advanced NLP and rule-based methods. It combines transformer-based NER, regex, and Presidio analysis to detect and mask personal information through full redaction or partial masking—ensuring compliance while preserving data utility.
Demo of PII detection and redaction with Microsoft Presidio
Add a description, image, and links to the microsoft-presidio topic page so that developers can more easily learn about it.
To associate your repository with the microsoft-presidio topic, visit your repo's landing page and select "manage topics."