Mercurial > emacs
comparison README.unicode @ 88740:62f922836c9b
*** empty log message ***
author | Dave Love <fx@gnu.org> |
---|---|
date | Tue, 11 Jun 2002 18:45:36 +0000 |
parents | |
children | 78a0b89ce5d6 |
comparison
equal
deleted
inserted
replaced
88739:d32a7aa4afcb | 88740:62f922836c9b |
---|---|
1 -*-text-*- | |
2 | |
3 Problems, fixmes and other issues in the emacs-unicode branch | |
4 | |
5 Notes by fx to record a few things. handa needs to check them -- | |
6 don't take too seriously, especially with regard to completeness. | |
7 | |
8 Do take seriously that you don't want this CVS branch unless you're | |
9 actually working on it. If you just want to edit Unicode and/or unify | |
10 iso-8859 et al, see the existing support and the extra stuff at | |
11 <URL:ftp://dlpx1.dl.ac.uk/fx/emacs/Mule>. Editing support is mostly | |
12 orthogonal to the internal representation. | |
13 | |
14 * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters. | |
15 | |
16 * Grok UTF-8 surrogates. | |
17 | |
18 * Rationalize character syntax and its relationship to the Unicode | |
19 database. Specifically, the latin-N.el files aren't consistent for | |
20 common characters. | |
21 | |
22 * Fontset handling and customization needs work. | |
23 | |
24 * Likewise for charset and coding system priorities. | |
25 | |
26 * The relevant bits of latin1-disp.el need porting (and probably | |
27 re-naming/updating). See also cyril-util.el. | |
28 | |
29 * Quail files need work now the encoding is irrelevant. E.g. make | |
30 unified Latin pre- and post- methods. | |
31 | |
32 * What to do with the old coding categories stuff? | |
33 | |
34 * Something's wrong with categories, e.g. the result of C-u C-x e on | |
35 Lao in HELLO. | |
36 | |
37 * Syntax for symbols &c in characters needs looking at. | |
38 | |
39 * The preferred-coding-system property of charsets should probably be | |
40 junked unless it can be made more useful now. | |
41 | |
42 * find-coding-systems-for-charsets needs re-writing. | |
43 | |
44 * find-multibyte-characters needs looking at. | |
45 | |
46 * Implement Korean cp949/UHC and any other important missing | |
47 charsets. | |
48 | |
49 * Check up on tcvn and alternativnj. | |
50 | |
51 * Lazy-load tables for unify-charset somehow? | |
52 | |
53 * Should translation tables for {en,de}code and input work now or be | |
54 scrapped? | |
55 | |
56 * Defining CCL coding systems currently doesn't work. | |
57 | |
58 * iso-2022 charsets get unified on i/o. | |
59 | |
60 * Revisit locale processing: look at treating the language and | |
61 charset parts separately. (Language should affect things like | |
62 speling and calendar, but that's not a Unicode issue.) | |
63 | |
64 * Handle Unicode combining characters usefully, e.g. diacritics, and | |
65 handle more scripts specifically (á la Devanagari). There are | |
66 issues with canonicalization. | |
67 | |
68 * Bidi is a separate issue. | |
69 | |
70 * DTRT with X keysyms. We should get the right unicode for a given | |
71 keysym, not decode raw bytes in some ill-defined coding system. | |
72 (fx has some data on keysyms v. unicodes.) | |
73 | |
74 * We need tabular input methods, e.g. for maths symbols. (Not | |
75 specific to Unicode.) | |
76 | |
77 * Need multibyte text in menus, e.g. for the above. (Not specific to | |
78 Unicode.) |