You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: .github/workflows/codeql-analysis.yml
+3-3Lines changed: 3 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -55,7 +55,7 @@ jobs:
55
55
# your codebase is analyzed, see https://docs.github.com/en/code-security/code-scanning/creating-an-advanced-setup-for-code-scanning/codeql-code-scanning-for-compiled-languages
Copy file name to clipboardExpand all lines: CHANGELOG.md
+9-1Lines changed: 9 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,11 +3,19 @@
3
3
All notable changes to this project will be documented in this file.
4
4
5
5
## [unreleased]
6
+
### General
7
+
#### Added
8
+
- Published `presidio` as a PyPI meta-package that installs `presidio-analyzer` and `presidio-anonymizer`, making `pip install presidio` work as expected. Inspired by and thanks to Sakthi Santhosh Anumand and Harsha Vardhan for the original idea.
9
+
6
10
### Analyzer
7
11
#### Added
8
-
- UK Driving Licence Number (UK_DRIVING_LICENCE) recognizer with pattern matching and context support
12
+
- Nigerian National Identification Number (NG_NIN) recognizer with Verhoeff checksum validation
13
+
- Nigerian Vehicle Registration (NG_VEHICLE_REGISTRATION) recognizer for current format plates (2011+)
9
14
- US_NPI recognizer for National Provider Identifier with Luhn checksum validation and context support (#1847) (Thanks @stevenelliottjr)
15
+
- UK Driving Licence Number (UK_DRIVING_LICENCE) recognizer with pattern matching and context support
10
16
- UK Postcode (UK_POSTCODE) recognizer with pattern matching and context support
17
+
- UK Passport Number (UK_PASSPORT) recognizer for 2-letter + 7-digit format passports issued from 2015 onwards
18
+
- UK Vehicle Registration Number (UK_VEHICLE_REGISTRATION) recognizer for current (2001+), prefix (1983-2001), and suffix (1963-1983) plate formats
GLiNERRecognizer supports using ONNX Runtime as a backend, which provides better CPU compatibility and can prevent crashes on older CPUs without AVX2 instruction set support (e.g., Intel Sandy Bridge).
81
+
82
+
### Using ONNX Runtime Backend
83
+
84
+
To use ONNX Runtime with GLiNER:
85
+
86
+
```python
87
+
from presidio_analyzer.predefined_recognizers import GLiNERRecognizer
88
+
89
+
# Enable ONNX Runtime backend
90
+
gliner_recognizer = GLiNERRecognizer(
91
+
model_name="urchade/gliner_multi_pii-v1",
92
+
entity_mapping=entity_mapping,
93
+
load_onnx_model=True, # Enable ONNX Runtime
94
+
map_location="cpu",
95
+
)
96
+
```
97
+
98
+
**Benefits of using ONNX Runtime:**
99
+
- Works on older CPUs without AVX2 support
100
+
- Prevents "Illegal instruction (core dumped)" crashes on incompatible hardware
101
+
- Can provide better performance on certain CPU architectures
102
+
103
+
**Note:** Make sure `onnxruntime` is installed when using this feature. It's included in the `gliner` extra dependencies.
Copy file name to clipboardExpand all lines: docs/supported_entities.md
+9-1Lines changed: 9 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -42,10 +42,12 @@ For more information, refer to the [adding new recognizers documentation](analyz
42
42
43
43
|Entity Type|Description|Detection Method|
44
44
|--- |--- |--- |
45
+
|UK_DRIVING_LICENCE|UK driving licence number issued by the DVLA. A 16-character alphanumeric string encoding surname, date of birth, and initials.|Pattern match, context and validation|
45
46
|UK_NHS|A UK NHS number is 10 digits.|Pattern match, context and checksum|
46
47
|UK_NINO|UK [National Insurance Number](https://en.wikipedia.org/wiki/National_Insurance_number) is a unique identifier used in the administration of National Insurance and tax.|Pattern match and context|
47
-
|UK_DRIVING_LICENCE|UK driving licence number issued by the DVLA. A 16-character alphanumeric string encoding surname, date of birth, and initials.|Pattern match, context and validation|
48
+
|UK_PASSPORT|A UK passport number consists of 2 letters followed by 7 digits (e.g., AB1234567), used in passports issued from 2015 onwards.|Pattern matchand context|
48
49
|UK_POSTCODE|A UK [postcode](https://en.wikipedia.org/wiki/Postcodes_in_the_United_Kingdom) is a 5-8 character alphanumeric code used by the Royal Mail for mail sorting.|Pattern match and context|
50
+
|UK_VEHICLE_REGISTRATION|A UK vehicle registration number (number plate) in current (2001+), prefix (1983-2001), or suffix (1963-1983) format.|Pattern match, context and validation|
49
51
50
52
### Spain
51
53
@@ -111,6 +113,12 @@ For more information, refer to the [adding new recognizers documentation](analyz
111
113
| KR_RRN | The Korean Resident Registration Number (RRN) is a 13-digit number issued to all Korean residents. | Pattern match, context and custom logic. |
| NG_NIN | The Nigerian National Identification Number (NIN) is a unique 11-digit number issued by the National Identity Management Commission (NIMC). | Pattern match, context, and checksum |
120
+
| NG_VEHICLE_REGISTRATION | Nigerian vehicle registration plate number in the current format (2011+): 3 letters (LGA code), 3 digits (serial), 2 letters (year/batch). | Pattern match and context |
0 commit comments