Presidio - Data Protection and De-identification
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
https://github.com/microsoft/presidio
I don't have an immediate use for this now, but there's been times I sure wish I knew about or had this.