Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.
-
Updated
Sep 16, 2025 - Jupyter Notebook
Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.
Deidentify people's names and gender specific pronouns
Clean your Text for Statistical ML and Language Model
This repository contains the code and data for the text re-identification attack presented in B. Manzanares-Salor, D. Sánchez, P. Lison, Evaluating the disclosure risk of anonymized documents via a machine learning-based re-identification attack, Data Mining and Knowledge Discovery, 2024.
Simple project on html anonymization
This repository contains the code and data for the text anonymization enhancement method presented in B. Manzanares-Salor, D. Sánchez, Enhancing text anonymization via re-identification risk-based explainability, Submitted, 2024.
🛡️ Text Anonymizer using NuNER Zero-shot An academic project that auto-detects & masks sensitive entities (names, orgs, locations) in .txt/.docx files using Zero-shot NER. Features CLI & REST API. Replaces data with realistic fakes (Faker) or [REDACTED]. Built with Python, Flask, GLiNER.
Add a description, image, and links to the text-anonymization topic page so that developers can more easily learn about it.
To associate your repository with the text-anonymization topic, visit your repo's landing page and select "manage topics."