Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

perf: skip alignment tracking in encode_fast normalization
#2022 opened Apr 10, 2026 by ArthurZucker Collaborator Loading…
Reduce crate size
#2015 opened Apr 9, 2026 by ArthurZucker Collaborator Loading…
node: bump version to 0.22.2 for release
#2009 opened Apr 4, 2026 by MayCXC Contributor Loading…
feat(pattern): parallel regex find_matches for large inputs
#2003 opened Mar 31, 2026 by McPatate Member Loading…
fix: skip serializing ByteLevel fields at their default value
#2001 opened Mar 30, 2026 by ArthurZucker Collaborator Loading…
Regex split parity
#1991 opened Mar 27, 2026 by ArthurZucker Collaborator Loading…
feat: add new faster whitespace split pretok
#1985 opened Mar 26, 2026 by McPatate Member Loading…
Implementing Parity-aware BPE
#1974 opened Mar 21, 2026 by cimeister Loading…
feat: add pcre2 as optional feature
#1959 opened Mar 2, 2026 by wheynelau Contributor Loading…
Add get_special_tokens and is_special_token methods
#1945 opened Feb 5, 2026 by ArthurZucker Collaborator Loading…
2 tasks done
Add post_process_tokens and post_process_ids methods
#1944 opened Feb 5, 2026 by ArthurZucker Collaborator Loading…
3 tasks done
feat: add unk_token property to Unigram model
#1943 opened Feb 5, 2026 by ArthurZucker Collaborator Loading…
4 tasks done
🚨 feat: add role_to_token field for special token metadata
#1942 opened Feb 5, 2026 by ArthurZucker Collaborator Loading…
fix: added type hints in .py files
#1932 opened Jan 20, 2026 by ashmi8 Loading…
Include license file into python wheels
#1931 opened Jan 20, 2026 by justeph Loading…
Upgrade GitHub Actions for Node 24 compatibility
#1916 opened Dec 20, 2025 by salmanmkc Loading…
Fix undefined names in docs/source/_ext/entities.py
#1895 opened Nov 28, 2025 by cclauss Loading…
ProTip! Follow long discussions with comments:>50.