Structured PII (Regex Detection)
These types have well-defined patterns and are detected using optimized regular expressions with validation.| Type | Description | Example | Validation |
|---|---|---|---|
EMAIL | Email addresses | [email protected] | RFC 5322 format |
PHONE | Phone numbers (international) | +49 30 123456 | E.164 patterns |
IBAN | International Bank Account Numbers | DE89370400440532013000 | Checksum validation |
BIC_SWIFT | Bank Identifier Codes | COBADEFFXXX | Format validation |
CREDIT_CARD | Credit card numbers | 4111111111111111 | Luhn algorithm |
IP_ADDRESS | IPv4 and IPv6 addresses | 192.168.1.1 | Format validation |
URL | Web URLs | https://example.com | URI format |
CASE_ID | Case/ticket numbers | CASE-12345 | Configurable pattern |
CUSTOMER_ID | Customer identifiers | CUST-ABC123 | Configurable pattern |
Soft PII (NER Detection)
These types require contextual understanding and are detected using a trained NER (Named Entity Recognition) model.| Type | Description | Example | Semantic Attributes |
|---|---|---|---|
PERSON | Person names | John Smith, Maria | gender (male/female/neutral) |
ORG | Organization names | Acme Corp, Google | — |
LOCATION | Places and locations | Berlin, Germany | scope (city/country/region) |
ADDRESS | Physical addresses | 123 Main St | — |
DATE_OF_BIRTH | Dates of birth | born on March 15, 1990 | — |
NER detection requires initializing with a model mode other than
'disabled'. See the NER Detection Guide for setup.Priority Resolution
When multiple detections overlap (e.g., an email that’s also a URL), Rehydra uses priority ordering:Confidence Thresholds
NER-detected entities have confidence scores. You can configure minimum thresholds:Type-Specific Detection Control
Enable or disable specific PII types:Custom ID Patterns
Add domain-specific patterns for case IDs and customer IDs:Placeholder Format
Detected PII is replaced with XML-like placeholder tags:- Preserved by translation APIs — Most services treat XML-like tags as non-translatable
- Parseable — Easy to extract for rehydration
- Informative — Type and attributes help with contextual translation