diff options
author | Robert Pluim <rpluim@gmail.com> | 2023-05-28 16:36:44 +0200 |
---|---|---|
committer | Robert Pluim <rpluim@gmail.com> | 2023-05-28 16:40:39 +0200 |
commit | c0c7b947baf5e086a857055d18dab04ce13d1f6a (patch) | |
tree | 41b3b2a7668331ae6cde69e83dad3fca570c6c8b /admin/notes | |
parent | 3be33aff9638daf0012e339b19906581befb0247 (diff) | |
download | emacs-c0c7b947baf5e086a857055d18dab04ce13d1f6a.tar.gz emacs-c0c7b947baf5e086a857055d18dab04ce13d1f6a.tar.bz2 emacs-c0c7b947baf5e086a857055d18dab04ce13d1f6a.zip |
Add instructions and test file for VS-15/VS-16
* admin/notes/unicode: Add instructions for emoji-variation-sequences.txt
* admin/unidata/emoji-variation-sequences.txt: New file, imported from
Unicode 15.
Diffstat (limited to 'admin/notes')
-rw-r--r-- | admin/notes/unicode | 17 |
1 files changed, 16 insertions, 1 deletions
diff --git a/admin/notes/unicode b/admin/notes/unicode index 0e000365da7..31c850af8fd 100644 --- a/admin/notes/unicode +++ b/admin/notes/unicode @@ -22,6 +22,7 @@ Emacs uses the following files from the Unicode Character Database . confusables.txt . emoji-data.txt . emoji-zwj-sequences.txt + . emoji-variation-sequences.txt . emoji-sequences.txt . BidiCharacterTest.txt @@ -29,7 +30,7 @@ Emacs also uses the file emoji-test.txt which should be imported from the Unicode's Public/emoji/ directory, and IdnaMappingTable.txt from the Public/idna/ directory. -First, the first 14 files, emoji-test.txt and IdnaMappingTable.txt +First, the first 15 files, emoji-test.txt and IdnaMappingTable.txt need to be copied into admin/unidata/, and the file https://www.unicode.org/copyright.html should be copied over copyright.html in admin/unidata (some of them might need trailing @@ -142,6 +143,20 @@ generated for auto-composition-emoji-eligible-codepoints by admin/unidata/emoji-zwj.awk. Note that your emoji font might not have glyphs for the newest codepoints yet. +Visit "emoji-variation-sequences.txt", and run the following lisp +fragment to actually insert the described codepoints, then check that +all the text in parentheses displays correctly (it can be helpful to +have `glyphless-char-display-control' customized to show hex codes for +variation selectors). + +(save-excursion + (goto-char (point-min)) + (while (re-search-forward "^\\([0-9A-Z]+\\) \\([0-9A-Z]+\\).*(\\([^)]+\\))" nil t) + (let ((ch (string-to-number (match-string 1) 16)) + (sel (string-to-number (match-string 2) 16)) + (sp (match-string 3))) + (replace-match (format "%s %c%c " sp ch sel) nil nil nil 3)))) + Finally, etc/NEWS should be updated to announce the support for the new Unicode version. |