Model Modes
| Mode | Description | Size | Use Case |
|---|---|---|---|
'disabled' | No NER, regex only | 0 | Fast processing, structured PII only |
'quantized' | Smaller quantized model | ~280 MB | Recommended for most use cases |
'standard' | Full-size model | ~1.1 GB | Maximum accuracy |
'custom' | Your own ONNX model | Varies | Domain-specific models |
Basic Setup
Download Progress
Track model download progress:Confidence Thresholds
NER entities have confidence scores (0.0-1.0). Configure minimum thresholds:Auto-Download Control
By default, models are downloaded automatically. To disable:Manual Model Management
Pre-download models or manage cache:Custom Models
Use your own ONNX model:Custom models must follow the same input/output format as the default models. See the model training guide for details.
Cache Locations
Models are cached locally for offline use:Node.js
| Platform | Location |
|---|---|
| macOS | ~/Library/Caches/rehydra/models/ |
| Linux | ~/.cache/rehydra/models/ |
| Windows | %LOCALAPPDATA%/rehydra/models/ |
Browser
In browsers, models are stored using:- Origin Private File System (OPFS) for large model files
- IndexedDB for metadata
NER-Detected Types
| Type | Examples |
|---|---|
PERSON | John Smith, Maria, Dr. Johnson |
ORG | Acme Corp, Google, United Nations |
LOCATION | Berlin, Germany, Central Park |
ADDRESS | 123 Main Street |
DATE_OF_BIRTH | born on March 15, 1990 |
Disabling Specific NER Types
Detect only certain entity types:Performance Tips
Reuse the anonymizer instance
Reuse the anonymizer instance
Model loading is expensive. Create once and reuse:
Use quantized model for most cases
Use quantized model for most cases
The quantized model is ~95% as accurate but 4x smaller:
| Model | Size | Inference Time |
|---|---|---|
| Standard | ~1.1 GB | ~120ms |
| Quantized | ~280 MB | ~100ms |
Skip NER for structured-only PII
Skip NER for structured-only PII
If you only need emails, phones, IBANs, etc.: