comparison README.unicode @ 89190:9693e41cc2fd

*** empty log message ***
author Dave Love <fx@gnu.org>
date Wed, 09 Oct 2002 22:51:44 +0000
parents 52c1682e6353
children 3b4e014cc159
comparison
equal deleted inserted replaced
89189:068e199a732e 89190:9693e41cc2fd
1 -*-text-*- 1 -*-text; coding: latin-1;-*-
2 2
3 Problems, fixmes and other issues in the emacs-unicode branch 3 Problems, fixmes and other issues in the emacs-unicode branch
4 ------------------------------------------------------------- 4 -------------------------------------------------------------
5 5
6 Notes by fx to record various things of variable importance. handa 6 Notes by fx to record various things of variable importance. handa
13 existing support and the extra stuff at 13 existing support and the extra stuff at
14 <URL:ftp://dlpx1.dl.ac.uk/fx/emacs/Mule>, mostly now in the CVS trunk. 14 <URL:ftp://dlpx1.dl.ac.uk/fx/emacs/Mule>, mostly now in the CVS trunk.
15 (Editing support is mostly orthogonal to the internal representation.) 15 (Editing support is mostly orthogonal to the internal representation.)
16 16
17 * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters, which has 17 * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters, which has
18 undesirable effects. 18 undesirable effects. E.g.:
19 (multibyte-string-p (let ((s "x")) (aset s 0 ?£) s)) => nil
20 (multibyte-string-p (concat [?£])) => nil
21 (text-char-description ?£) => "M-#"
19 22
20 * Rationalize character syntax and its relationship to the Unicode 23 * Rationalize character syntax and its relationship to the Unicode
21 database. Specifically, the latin-N.el files aren't consistent for 24 database. (Applies mainly to symbol an punctuation syntax.)
22 common characters (and obviously have redundancies except in
23 unibyte mode).
24 25
25 * Fontset handling and customization needs work. We want to relate 26 * Fontset handling and customization needs work. We want to relate
26 fonts to scripts, probably based on the Unicode blocks. The 27 fonts to scripts, probably based on the Unicode blocks. The
27 presence of small-repertoire 10646-encoded fonts in XFree 4 is a 28 presence of small-repertoire 10646-encoded fonts in XFree 4 is a
28 pain, not currently worked round. 29 pain, not currently worked round.
40 41
41 * Quail files need more work now the encoding is irrelevant. 42 * Quail files need more work now the encoding is irrelevant.
42 43
43 * What to do with the old coding categories stuff? 44 * What to do with the old coding categories stuff?
44 45
45 * Syntax for symbols &c in characters.el needs looking at.
46
47 * The preferred-coding-system property of charsets should probably be 46 * The preferred-coding-system property of charsets should probably be
48 junked unless it can be made more useful now. 47 junked unless it can be made more useful now.
49 48
50 * find-coding-systems-for-charsets needs re-writing or removing.
51
52 * find-multibyte-characters needs looking at. 49 * find-multibyte-characters needs looking at.
53 50
54 * Implement Korean cp949/UHC and any other important missing 51 * Implement Korean cp949/UHC, BIG5-HKSCS and any other important missing
55 charsets. 52 charsets.
56 53
57 * Check up on definitions of tcvn and alternativnj. 54 * Check up on definition of alternativnj.
58 55
59 * Lazy-load tables for unify-charset somehow? 56 * Lazy-load tables for unify-charset somehow?
60 57
61 Actually, Emacs clear out all charset maps and unify-map just 58 Actually, Emacs clear out all charset maps and unify-map just
62 before dumping, and their are loaded again on demand the 59 before dumping, and their are loaded again on demand the
87 specific to Unicode.) 84 specific to Unicode.)
88 85
89 * Need multibyte text in menus, e.g. for the above. (Not specific to 86 * Need multibyte text in menus, e.g. for the above. (Not specific to
90 Unicode.) 87 Unicode.)
91 88
92 * Still can't have case pairs which have different byte lengths --
93 can that be fixed for Turkish, at least?
94
95 * There's currently no support for Unicode normalization. 89 * There's currently no support for Unicode normalization.
96 90
97 * Populate char-width-table correctly for Unicode chanaracters and 91 * Populate char-width-table correctly for Unicode chanaracters and
98 worry about what happens when double-width charsets covering 92 worry about what happens when double-width charsets covering
99 non-CJK characters are unified. 93 non-CJK characters are unified.