How anonymize.today Works
Deterministic, regex-based PII detection that delivers 100% reproducible results. Same input, same output—every time. No AI, no guessing, just transparent pattern matching.
Why Regex, Not AI?
Our Approach
- 100% reproducible results
- Fully auditable for compliance
- No training data required
- Transparent decision making
- Fast, predictable performance
- No model drift over time
AI/ML Approaches
- Results vary between runs
- Black box decision making
- Requires training data
- Difficult to audit
- Higher compute costs
- Model drift over time
The 10-Step Process
From input to output, here's exactly what happens to your document
Input Text
Submit your document via web interface, API, or Word Add-in
Language Detection
System identifies the document language for optimal processing
Tokenization
Text is broken into tokens for pattern matching
Pattern Matching
Regex patterns scan for 256 entity types
Context Analysis
Surrounding text improves detection accuracy
Confidence Scoring
Each detection receives a confidence score
Entity Classification
Detected items are categorized by type
Review Results
See all detections with positions and scores
Apply Anonymization
Choose your method: Replace, Redact, Hash, Encrypt, or Mask
Output Document
Download your anonymized document
Frequently Asked Questions
Why does anonymize.today use regex instead of AI for PII detection?
How accurate is the PII detection?
Can I audit how anonymize.today processes my data?
What happens to my data during processing?
How does anonymize.today handle multiple languages in one text?
See It in Action
Try our PII detection and anonymization free with 300 tokens per month.