Mercurial > emacs
comparison README.unicode @ 89190:9693e41cc2fd
*** empty log message ***
author | Dave Love <fx@gnu.org> |
---|---|
date | Wed, 09 Oct 2002 22:51:44 +0000 |
parents | 52c1682e6353 |
children | 3b4e014cc159 |
comparison
equal
deleted
inserted
replaced
89189:068e199a732e | 89190:9693e41cc2fd |
---|---|
1 -*-text-*- | 1 -*-text; coding: latin-1;-*- |
2 | 2 |
3 Problems, fixmes and other issues in the emacs-unicode branch | 3 Problems, fixmes and other issues in the emacs-unicode branch |
4 ------------------------------------------------------------- | 4 ------------------------------------------------------------- |
5 | 5 |
6 Notes by fx to record various things of variable importance. handa | 6 Notes by fx to record various things of variable importance. handa |
13 existing support and the extra stuff at | 13 existing support and the extra stuff at |
14 <URL:ftp://dlpx1.dl.ac.uk/fx/emacs/Mule>, mostly now in the CVS trunk. | 14 <URL:ftp://dlpx1.dl.ac.uk/fx/emacs/Mule>, mostly now in the CVS trunk. |
15 (Editing support is mostly orthogonal to the internal representation.) | 15 (Editing support is mostly orthogonal to the internal representation.) |
16 | 16 |
17 * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters, which has | 17 * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters, which has |
18 undesirable effects. | 18 undesirable effects. E.g.: |
19 (multibyte-string-p (let ((s "x")) (aset s 0 ?£) s)) => nil | |
20 (multibyte-string-p (concat [?£])) => nil | |
21 (text-char-description ?£) => "M-#" | |
19 | 22 |
20 * Rationalize character syntax and its relationship to the Unicode | 23 * Rationalize character syntax and its relationship to the Unicode |
21 database. Specifically, the latin-N.el files aren't consistent for | 24 database. (Applies mainly to symbol an punctuation syntax.) |
22 common characters (and obviously have redundancies except in | |
23 unibyte mode). | |
24 | 25 |
25 * Fontset handling and customization needs work. We want to relate | 26 * Fontset handling and customization needs work. We want to relate |
26 fonts to scripts, probably based on the Unicode blocks. The | 27 fonts to scripts, probably based on the Unicode blocks. The |
27 presence of small-repertoire 10646-encoded fonts in XFree 4 is a | 28 presence of small-repertoire 10646-encoded fonts in XFree 4 is a |
28 pain, not currently worked round. | 29 pain, not currently worked round. |
40 | 41 |
41 * Quail files need more work now the encoding is irrelevant. | 42 * Quail files need more work now the encoding is irrelevant. |
42 | 43 |
43 * What to do with the old coding categories stuff? | 44 * What to do with the old coding categories stuff? |
44 | 45 |
45 * Syntax for symbols &c in characters.el needs looking at. | |
46 | |
47 * The preferred-coding-system property of charsets should probably be | 46 * The preferred-coding-system property of charsets should probably be |
48 junked unless it can be made more useful now. | 47 junked unless it can be made more useful now. |
49 | 48 |
50 * find-coding-systems-for-charsets needs re-writing or removing. | |
51 | |
52 * find-multibyte-characters needs looking at. | 49 * find-multibyte-characters needs looking at. |
53 | 50 |
54 * Implement Korean cp949/UHC and any other important missing | 51 * Implement Korean cp949/UHC, BIG5-HKSCS and any other important missing |
55 charsets. | 52 charsets. |
56 | 53 |
57 * Check up on definitions of tcvn and alternativnj. | 54 * Check up on definition of alternativnj. |
58 | 55 |
59 * Lazy-load tables for unify-charset somehow? | 56 * Lazy-load tables for unify-charset somehow? |
60 | 57 |
61 Actually, Emacs clear out all charset maps and unify-map just | 58 Actually, Emacs clear out all charset maps and unify-map just |
62 before dumping, and their are loaded again on demand the | 59 before dumping, and their are loaded again on demand the |
87 specific to Unicode.) | 84 specific to Unicode.) |
88 | 85 |
89 * Need multibyte text in menus, e.g. for the above. (Not specific to | 86 * Need multibyte text in menus, e.g. for the above. (Not specific to |
90 Unicode.) | 87 Unicode.) |
91 | 88 |
92 * Still can't have case pairs which have different byte lengths -- | |
93 can that be fixed for Turkish, at least? | |
94 | |
95 * There's currently no support for Unicode normalization. | 89 * There's currently no support for Unicode normalization. |
96 | 90 |
97 * Populate char-width-table correctly for Unicode chanaracters and | 91 * Populate char-width-table correctly for Unicode chanaracters and |
98 worry about what happens when double-width charsets covering | 92 worry about what happens when double-width charsets covering |
99 non-CJK characters are unified. | 93 non-CJK characters are unified. |