AnalyzerEngine | Philter detection (regex + dictionaries + PhEye lenses) | Same role: find entities in text. Philter's pattern layer covers Presidio's regex recognizers; PhEye covers Presidio's NLP-based recognizers. |
|---|
AnonymizerEngine | Filter strategies (mask, redact, encrypt, FPE, replace, abbreviate) | Philter offers more strategies, including format-preserving encryption and deterministic synthetic-value replacement. |
|---|
| Built-in recognizers (PERSON, EMAIL, SSN, etc.) | Default policy entities | Direct one-to-one mapping for most common entity types. Philter ships additional ones (medical record numbers, custom identifiers) out of the box. |
|---|
Custom PatternRecognizer | Custom identifier definitions in policy JSON | Define your own regex, dictionaries, or identifier patterns in the policy file. No Python class to subclass. |
|---|
| Context-aware recognizers (with NLP) | PhEye lenses (purpose-trained models) | Philter's lenses are purpose-trained for PII/PHI detection, not generic NER. Healthcare and other domain lenses are available out of the box. |
|---|
| Operator config (mask, redact, replace, hash) | Per-entity filter strategy in policy | Configured per entity type in the policy JSON. Conditional rules and severity scoring give you finer control. |
|---|
presidio-analyzer Python package | Philter API (HTTP) or Phileas library | Philter is a turnkey HTTP service. If you want an embedded library, use Phileas (Java, Python, .NET, Go). |
|---|
| Docker images on quay.io | AWS / GCP / Azure marketplace or self-built container | Philter is available on the cloud marketplaces for one-click deploy, or as a container image for custom builds. |
|---|