-
- Downloads
regexp/syntax: recognize category aliases like \p{Letter}
The Unicode specification defines aliases for some of the general category names. For example the category "L" has alias "Letter". The regexp package supports \p{L} but not \p{Letter}, because there was nothing in the Unicode tables that lets regexp know about Letter. Now that package unicode provides CategoryAliases (see #70780), we can use it to provide \p{Letter} as well. This is the only feature missing from making package regexp suitable for use in a JSON-API Schema implementation. (The official test suite includes usage of aliases like \p{Letter} instead of \p{L}.) For better conformity with Unicode TR18, also accept case-insensitive matches for names and ignore underscores, hyphens, and spaces; and add Any, ASCII, and Assigned. Fixes #70781. Change-Id: I50ff024d99255338fa8d92663881acb47f1e92a5 Reviewed-on: https://go-review.googlesource.com/c/go/+/641377 LUCI-TryBot-Result: Go LUCI <golang-scoped@luci-project-accounts.iam.gserviceaccount.com> Reviewed-by:Alan Donovan <adonovan@google.com>
Showing
- doc/next/6-stdlib/99-minor/regexp/syntax/70781.md 4 additions, 0 deletionsdoc/next/6-stdlib/99-minor/regexp/syntax/70781.md
- src/regexp/syntax/doc.go 2 additions, 1 deletionsrc/regexp/syntax/doc.go
- src/regexp/syntax/parse.go 101 additions, 8 deletionssrc/regexp/syntax/parse.go
- src/regexp/syntax/parse_test.go 6 additions, 0 deletionssrc/regexp/syntax/parse_test.go
Loading
Please register or sign in to comment