summaryrefslogtreecommitdiff
path: root/lisp/emacs-lisp/rx.el
diff options
context:
space:
mode:
authorEli Zaretskii <eliz@gnu.org>2015-04-14 18:47:04 +0300
committerEli Zaretskii <eliz@gnu.org>2015-04-14 18:47:04 +0300
commit6c284c6b5828bc4407f7201499e0507ce0e5a0a0 (patch)
tree0e89f736a55245dc7e59725e57effb3b36b6dfe8 /lisp/emacs-lisp/rx.el
parent8802474a219ad3be01825466a8837d3775f8b31b (diff)
downloademacs-6c284c6b5828bc4407f7201499e0507ce0e5a0a0.tar.gz
emacs-6c284c6b5828bc4407f7201499e0507ce0e5a0a0.tar.bz2
emacs-6c284c6b5828bc4407f7201499e0507ce0e5a0a0.zip
Make [:print:] support non-ASCII characters correctly
* src/regex.c (ISPRINT): Call 'printablep' for multibyte characters. (BIT_PRINT): New bit mask. (re_wctype_to_bit): Return BIT_PRINT for RECC_PRINT. * src/character.c (printablep): New function. * src/character.h (printablep): Add prototype. * lisp/emacs-lisp/rx.el (rx): Doc fix: document the new behavior of 'print', 'alnum', and 'alphabetic'. * doc/lispref/searching.texi (Char Classes): Document the new behavior of [:print:]. * etc/NEWS: Mention the new behavior of [:print:].
Diffstat (limited to 'lisp/emacs-lisp/rx.el')
-rw-r--r--lisp/emacs-lisp/rx.el12
1 files changed, 6 insertions, 6 deletions
diff --git a/lisp/emacs-lisp/rx.el b/lisp/emacs-lisp/rx.el
index 20af59f2abf..a5a228e5876 100644
--- a/lisp/emacs-lisp/rx.el
+++ b/lisp/emacs-lisp/rx.el
@@ -969,16 +969,16 @@ CHAR
space, and DEL.
`printing', `print'
- matches printing characters--everything except ASCII control chars
- and DEL.
+ matches printing characters--everything except ASCII and non-ASCII
+ control characters, surrogates, and codepoints unassigned by Unicode.
`alphanumeric', `alnum'
- matches letters and digits. (But at present, for multibyte characters,
- it matches anything that has word syntax.)
+ matches alphabetic characters and digits. (For multibyte characters,
+ it matches according to Unicode character properties.)
`letter', `alphabetic', `alpha'
- matches letters. (But at present, for multibyte characters,
- it matches anything that has word syntax.)
+ matches alphabetic characters. (For multibyte characters,
+ it matches according to Unicode character properties.)
`ascii'
matches ASCII (unibyte) characters.