Talk to an Expert

Tell us about your stack and the privacy problems you're trying to solve. We typically respond within one business day.

Prefer to skip the form? Pick a time on our calendar →
or send a message

Please do not enter PII or PHI in this form. If you need to share an example, use a sanitized one.

← All lenses

PII Lens

General Purpose

Broad PII baseline for documents that don't fit a specific domain — customer-support tickets, internal correspondence, generic business records. The default lens loaded by PhEye when no other is specified.

  • Status available
  • License Apache-2.0
  • Version 1.0.0
  • Updated 2026-05-22
  • PhEye compatibility >=1.0.0
  • Languages en
  • Model size 180 MB
  • Author Philterd

Entities detected

  • PERSON
  • LOCATION
  • ORG
  • DATE
  • PHONE
  • EMAIL
  • URL
  • SSN

When to load this lens

Load this lens when the document mix is heterogeneous or when you don't yet know which specialized lens fits the workload. It's the floor for PII detection across the toolkit.

What this lens detects

Eight high-confidence entity classes that cover the vast majority of PII surface in generic business text: people’s names, location mentions, organization names, dates, phone numbers, email addresses, URLs, and SSN-shaped numeric sequences.

This is the default lens. If PhEye is started without an explicit lens configuration, this is the lens that loads.

When to use this

  • Customer-support tickets, internal correspondence, generic business records.
  • As the baseline in a multi-lens configuration: load General Purpose plus one or more specialized lenses, and the policy engine merges the detections.
  • When you don’t yet know which specialized lens fits your data — start here, measure recall against a labeled sample with Philter Scope, then add specialized lenses where General Purpose is weakest.

Known limitations

  • Domain entities are out of scope. Hospital names, medication mentions, case numbers, financial-account-specific patterns, foreign-language identifiers — General Purpose either misses these entirely or labels them as the closest generic class (e.g., a hospital name might come back as LOCATION instead of HOSPITAL).
  • Single-language by design. Trained on English. Documents in other languages get poor recall — load a language-specific lens alongside (Spanish PII, French PII, etc.) for non-English text.
  • SSN-shaped, not SSN-validated. The lens flags 9-digit sequences in SSN format; downstream validation (area-number ranges, group-number checks) belongs in the policy layer.

Use this lens with PhEye, Phileas, or Philter

PhEye loads this lens at configuration time and exposes it to Phileas and Philter automatically. Have questions about a specific deployment? Talk to the team.

About PhEye →