Mercurial > emacs
annotate lisp/international/characters.el @ 89585:f89c9d4bd81c
(Fx_create_frame, x_create_tip_frame): Use new version of x_new_fontset.
(w32_load_system_font): Initialize charset as unicode.
font_info.encoding becomes encoding_type.
(w32_to_x_font): Use decode_coding_c_string in place of decode_coding.
(x_to_w32_font): Use encode_coding_object in place of encode_coding.
(syms_of_w32fns): Set get_font_repertory_func.
author | Jason Rumney <jasonr@gnu.org> |
---|---|
date | Sat, 11 Oct 2003 22:35:25 +0000 |
parents | 2f877ed80fa6 |
children | 1ad3832f1d1d |
rev | line source |
---|---|
17052 | 1 ;;; characters.el --- set syntax and category for multibyte characters |
2 | |
19958
f7ab34ae92bb
Set case-table for Cyrillic characters.
Richard M. Stallman <rms@gnu.org>
parents:
18561
diff
changeset
|
3 ;; Copyright (C) 1995, 1997 Electrotechnical Laboratory, JAPAN. |
89483 | 4 ;; Licensed to the Free Software Foundation. |
88612 | 5 ;; Copyright (C) 2001, 2002 Free Software Foundation, Inc. |
89483 | 6 ;; Copyright (C) 2003 |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
7 ;; National Institute of Advanced Industrial Science and Technology (AIST) |
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
8 ;; Registration Number H13PRO009 |
17052 | 9 |
10 ;; Keywords: multibyte character, character set, syntax, category | |
11 | |
12 ;; This file is part of GNU Emacs. | |
13 | |
14 ;; GNU Emacs is free software; you can redistribute it and/or modify | |
15 ;; it under the terms of the GNU General Public License as published by | |
16 ;; the Free Software Foundation; either version 2, or (at your option) | |
17 ;; any later version. | |
18 | |
19 ;; GNU Emacs is distributed in the hope that it will be useful, | |
20 ;; but WITHOUT ANY WARRANTY; without even the implied warranty of | |
21 ;; MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the | |
22 ;; GNU General Public License for more details. | |
23 | |
24 ;; You should have received a copy of the GNU General Public License | |
17071 | 25 ;; along with GNU Emacs; see the file COPYING. If not, write to the |
26 ;; Free Software Foundation, Inc., 59 Temple Place - Suite 330, | |
27 ;; Boston, MA 02111-1307, USA. | |
17052 | 28 |
29 ;;; Commentary: | |
30 | |
38414
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
31 ;;; Code: |
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
32 |
17052 | 33 ;;; Predefined categories. |
34 | |
35 ;; For each character set. | |
36 | |
37 (define-category ?a "ASCII") | |
38 (define-category ?l "Latin") | |
39 (define-category ?t "Thai") | |
40 (define-category ?g "Greek") | |
41 (define-category ?b "Arabic") | |
42 (define-category ?w "Hebrew") | |
43 (define-category ?y "Cyrillic") | |
44 (define-category ?k "Japanese katakana") | |
45 (define-category ?r "Japanese roman") | |
46 (define-category ?c "Chinese") | |
47 (define-category ?j "Japanese") | |
48 (define-category ?h "Korean") | |
49 (define-category ?e "Ethiopic (Ge'ez)") | |
50 (define-category ?v "Vietnamese") | |
51 (define-category ?i "Indian") | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
52 (define-category ?o "Lao") |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
53 (define-category ?q "Tibetan") |
17052 | 54 |
55 ;; For each group (row) of 2-byte character sets. | |
56 | |
22529 | 57 (define-category ?A "Alpha-numeric characters of 2-byte character sets") |
17052 | 58 (define-category ?C "Chinese (Han) characters of 2-byte character sets") |
22529 | 59 (define-category ?G "Greek characters of 2-byte character sets") |
17052 | 60 (define-category ?H "Japanese Hiragana characters of 2-byte character sets") |
61 (define-category ?K "Japanese Katakana characters of 2-byte character sets") | |
62 (define-category ?N "Korean Hangul characters of 2-byte character sets") | |
22529 | 63 (define-category ?Y "Cyrillic characters of 2-byte character sets") |
17052 | 64 (define-category ?I "Indian Glyphs") |
65 | |
66 ;; For phonetic classifications. | |
67 | |
68 (define-category ?0 "consonant") | |
18561
f3c28fd5118e
Modify docstring of char category
Kenichi Handa <handa@m17n.org>
parents:
18377
diff
changeset
|
69 (define-category ?1 "base (independent) vowel") |
17052 | 70 (define-category ?2 "upper diacritical mark (including upper vowel)") |
71 (define-category ?3 "lower diacritical mark (including lower vowel)") | |
72 (define-category ?4 "tone mark") | |
18561
f3c28fd5118e
Modify docstring of char category
Kenichi Handa <handa@m17n.org>
parents:
18377
diff
changeset
|
73 (define-category ?5 "symbol") |
17052 | 74 (define-category ?6 "digit") |
75 (define-category ?7 "vowel-modifying diacritical mark") | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
76 (define-category ?8 "vowel-signs") |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
77 (define-category ?9 "semivowel lower") |
17052 | 78 |
79 ;; For filling. | |
80 (define-category ?| "While filling, we can break a line at this character.") | |
81 | |
20247 | 82 ;; For indentation calculation. |
49769
5ed59fd23bc4
Use new ?\s syntax instead of "? " followed by a comment.
Juanma Barranquero <lekktu@gmail.com>
parents:
49601
diff
changeset
|
83 (define-category ?\s |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
84 "This character counts as a space for indentation purposes.") |
20247 | 85 |
22529 | 86 ;; Keep the following for `kinsoku' processing. See comments in |
17052 | 87 ;; kinsoku.el. |
88 (define-category ?> "A character which can't be placed at beginning of line.") | |
89 (define-category ?< "A character which can't be placed at end of line.") | |
90 | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
91 ;; Combining |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
92 (define-category ?^ "Combining diacritic or mark") |
17052 | 93 |
94 ;;; Setting syntax and category. | |
95 | |
96 ;; ASCII | |
97 | |
88425
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
98 ;; All ASCII characters have the category `a' (ASCII) and `l' (Latin). |
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
99 (modify-category-entry '(32 . 127) ?a) |
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
100 (modify-category-entry '(32 . 127) ?l) |
17052 | 101 |
89314 | 102 ;; Deal with the CJK charsets first. Since the syntax of blocks is |
103 ;; defined per charset, and the charsets may contain e.g. Latin | |
104 ;; characters, we end up with the wrong syntax definitions if we're | |
105 ;; not careful. | |
17052 | 106 |
88752 | 107 ;; Chinese characters (Unicode) |
108 (modify-category-entry '(#x3400 . #x9FAF) ?C) | |
109 (modify-category-entry '(#x3400 . #x9FAF) ?c) | |
110 (modify-category-entry '(#x3400 . #x9FAF) ?|) | |
111 (modify-category-entry '(#xF900 . #xFAFF) ?C) | |
112 (modify-category-entry '(#xF900 . #xFAFF) ?c) | |
113 (modify-category-entry '(#xF900 . #xFAFF) ?|) | |
114 | |
17052 | 115 ;; Chinese character set (GB2312) |
116 | |
88752 | 117 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2121 #x217E) |
118 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2221 #x227E) | |
119 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2921 #x297E) | |
17052 | 120 |
88723 | 121 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?c) |
122 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?|) | |
89014 | 123 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2330 #x2339) |
124 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2341 #x235A) | |
125 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2361 #x237A) | |
88752 | 126 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?H #x2421 #x247E) |
127 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?K #x2521 #x257E) | |
128 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?G #x2621 #x267E) | |
129 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?Y #x2721 #x277E) | |
130 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?C #x3021 #x7E7E) | |
17052 | 131 |
132 ;; Chinese character set (BIG5) | |
133 | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
134 (map-charset-chars #'modify-category-entry 'big5 ?c) |
89014 | 135 (map-charset-chars #'modify-category-entry 'big5 ?C #xA259 #xA25F) |
136 (map-charset-chars #'modify-category-entry 'big5 ?C #xA440 #xC67E) | |
137 (map-charset-chars #'modify-category-entry 'big5 ?C #xC940 #xF9DF) | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
138 (map-charset-chars #'modify-category-entry 'big5 ?|) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
139 |
17052 | 140 |
141 ;; Chinese character set (CNS11643) | |
142 | |
88723 | 143 (dolist (c '(chinese-cns11643-1 chinese-cns11643-2 chinese-cns11643-3 |
144 chinese-cns11643-4 chinese-cns11643-5 chinese-cns11643-6 | |
145 chinese-cns11643-7)) | |
146 (map-charset-chars #'modify-category-entry c ?c) | |
89014 | 147 (if (eq c 'chinese-cns11643-1) |
148 (map-charset-chars #'modify-category-entry c ?C #x4421 #x7E7E) | |
149 (map-charset-chars #'modify-category-entry c ?C)) | |
88723 | 150 (map-charset-chars #'modify-category-entry c ?|)) |
17052 | 151 |
89483 | 152 ;; Japanese character set (JISX0201, JISX0208, JISX0212, JISX0213) |
17052 | 153 |
88752 | 154 (map-charset-chars #'modify-category-entry 'katakana-jisx0201 ?k) |
155 | |
156 (map-charset-chars #'modify-category-entry 'latin-jisx0201 ?r) | |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
157 |
89483 | 158 (dolist (l '(katakana-jisx0201 japanese-jisx0208 japanese-jisx0212 |
159 japanese-jisx0213-1 japanese-jisx0213-2)) | |
88752 | 160 (map-charset-chars #'modify-category-entry l ?j) |
89483 | 161 (if (eq l 'japanese-jisx0213-1) |
162 (map-charset-chars #'modify-category-entry l ?\| #x2E21 #x7E7F) | |
163 (map-charset-chars #'modify-category-entry l ?\|))) | |
17052 | 164 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
165 ;; Unicode equivalents of JISX0201-kana |
88752 | 166 (let ((range '(#xff61 . #xff9f))) |
167 (modify-category-entry range ?k) | |
168 (modify-category-entry range ?j) | |
169 (modify-category-entry range ?\|)) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
170 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
171 ;; Katakana block |
88752 | 172 (let ((range '(#x30a0 . #x30ff))) |
173 ;; ?K is double width, ?k isn't specified | |
174 (modify-category-entry range ?K) | |
175 (modify-category-entry range ?\|)) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
176 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
177 ;; Hiragana block |
89483 | 178 (let ((range '(#x3040 . #x309d))) |
88752 | 179 ;; ?H is actually defined to be double width |
180 ;;(modify-category-entry range ?H) | |
181 ;;(modify-category-entry range ?\|) | |
182 ) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
183 |
17052 | 184 ;; JISX0208 |
88752 | 185 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0208 "_" #x2121 #x227E) |
186 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0208 "_" #x2821 #x287E) | |
187 (let ((chars '(?ー ?ã‚› ?ã‚œ ?ヽ ?ヾ ?ã‚ ?ã‚ž ?〃 ?ä» ?々 ?〆 ?〇))) | |
88519
98ec5d812ccc
Set syntax/category for japanese-jisx0208.
Kenichi Handa <handa@m17n.org>
parents:
88492
diff
changeset
|
188 (dolist (elt chars) |
88612 | 189 (modify-syntax-entry (car chars) "w"))) |
17052 | 190 |
88752 | 191 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?A #x2321 #x237E) |
192 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?H #x2421 #x247E) | |
193 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?K #x2521 #x257E) | |
194 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?G #x2621 #x267E) | |
195 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?Y #x2721 #x277E) | |
196 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?C #x3021 #x7E7E) | |
197 (modify-category-entry ?ー ?K) | |
198 (let ((chars '(?゛ ?゜))) | |
17052 | 199 (while chars |
200 (modify-category-entry (car chars) ?K) | |
201 (modify-category-entry (car chars) ?H) | |
202 (setq chars (cdr chars)))) | |
88752 | 203 (let ((chars '(?ヽ ?ヾ ?ã‚ ?ã‚ž ?〃 ?ä» ?々 ?〆 ?〇))) |
17052 | 204 (while chars |
205 (modify-category-entry (car chars) ?C) | |
206 (setq chars (cdr chars)))) | |
207 | |
208 ;; JISX0212 | |
88723 | 209 |
88752 | 210 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0212 "_" #x2121 #x237E) |
17052 | 211 |
212 ;; JISX0201-Kana | |
88723 | 213 |
88612 | 214 (let ((chars '(?。 ?、 ?・))) |
17052 | 215 (while chars |
216 (modify-syntax-entry (car chars) ".") | |
217 (setq chars (cdr chars)))) | |
218 | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
219 (modify-syntax-entry ?\ï½¢ "(ï½£") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
220 (modify-syntax-entry ?\ï½£ "(ï½¢") |
36688
45b172fa5c17
Add category `j' for katakana-jisx0201. Fix syntaxes of
Kenichi Handa <handa@m17n.org>
parents:
34507
diff
changeset
|
221 |
17052 | 222 ;; Korean character set (KSC5601) |
223 | |
88723 | 224 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?h) |
88752 | 225 |
226 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2121 #x227E) | |
89117 | 227 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2621 #x277E) |
228 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2830 #x287E) | |
229 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2930 #x297E) | |
89014 | 230 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2330 #x2339) |
231 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2341 #x235A) | |
232 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2361 #x237A) | |
88752 | 233 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?G #x2521 #x257E) |
234 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?H #x2A21 #x2A7E) | |
235 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?K #x2B21 #x2B7E) | |
236 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?Y #x2C21 #x2C7E) | |
17052 | 237 |
89314 | 238 ;; These are in more than one charset. |
89483 | 239 (let ((parens (concat "〈〉《》「ã€ã€Žã€ã€ã€‘〔〕〖〗〘〙〚〛" |
240 "︵︶︷︸︹︺︻︼︽︾︿﹀ï¹ï¹‚﹃﹄" | |
241 "()[]{ï½")) | |
242 open close) | |
243 (dotimes (i (/ (length parens) 2)) | |
244 (setq open (aref parens (* i 2)) | |
245 close (aref parens (1+ (* i 2)))) | |
246 (modify-syntax-entry open (format "(%c" close)) | |
247 (modify-syntax-entry close (format ")%c" open)))) | |
89314 | 248 |
249 ;; Arabic character set | |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
250 |
89314 | 251 (let ((charsets '(arabic-iso8859-6 |
252 arabic-digit | |
253 arabic-1-column | |
254 arabic-2-column))) | |
255 (while charsets | |
256 (map-charset-chars #'modify-category-entry (car charsets) ?b) | |
257 (setq charsets (cdr charsets)))) | |
258 (modify-category-entry '(#x600 . #x6ff) ?b) | |
259 (modify-category-entry '(#xfb50 . #xfdff) ?b) | |
260 (modify-category-entry '(#xfe70 . #xfefe) ?b) | |
261 | |
262 ;; Cyrillic character set (ISO-8859-5) | |
263 | |
264 (modify-syntax-entry ?â„– ".") | |
265 | |
266 ;; Ethiopic character set | |
267 | |
268 (modify-category-entry '(#x1200 . #x137b) ?e) | |
269 (let ((chars '(?á¡ ?ᢠ?ᣠ?ᤠ?ᥠ?ᦠ?᧠?ᨠ?ö ‡€ ?ö ‡‹ ?ö ‡Œ ?ö ‡ ?ö ‡Ž ?ö ‡))) | |
270 (while chars | |
271 (modify-syntax-entry (car chars) ".") | |
272 (setq chars (cdr chars)))) | |
273 (map-charset-chars #'modify-category-entry 'ethiopic ?e) | |
274 | |
275 ;; Hebrew character set (ISO-8859-8) | |
276 | |
277 (modify-syntax-entry #x5be ".") ; MAQAF | |
278 (modify-syntax-entry #x5c0 ".") ; PASEQ | |
279 (modify-syntax-entry #x5c3 ".") ; SOF PASUQ | |
280 (modify-syntax-entry #x5f3 ".") ; GERESH | |
281 (modify-syntax-entry #x5f4 ".") ; GERSHAYIM | |
282 | |
283 ;; Indian character set (IS 13194 and other Emacs original Indian charsets) | |
284 | |
285 (modify-category-entry '(#x901 . #x970) ?i) | |
286 (map-charset-chars #'modify-category-entry 'indian-is13194 ?i) | |
287 (map-charset-chars #'modify-category-entry 'indian-2-column ?i) | |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
288 |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
289 ;; Lao character set |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
290 |
88612 | 291 (modify-category-entry '(#xe80 . #xeff) ?o) |
292 (map-charset-chars #'modify-category-entry 'lao ?o) | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
293 |
88612 | 294 (let ((deflist '(("àº-ຮ" "w" ?0) ; consonant |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
295 ("ະາຳຽເ-ໄ" "w" ?1) ; vowel base |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
296 ("ັິ-ືົà»" "w" ?2) ; vowel upper |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
297 ("ຸູ" "w" ?3) ; vowel lower |
89483 | 298 ("່-໋" "w" ?4) ; tone mark |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
299 ("ຼຽ" "w" ?9) ; semivowel lower |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
300 ("à»-à»™" "w" ?6) ; digit |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
301 ("ຯໆ" "_" ?5) ; symbol |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
302 )) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
303 elm chars len syntax category to ch i) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
304 (while deflist |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
305 (setq elm (car deflist)) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
306 (setq chars (car elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
307 len (length chars) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
308 syntax (nth 1 elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
309 category (nth 2 elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
310 i 0) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
311 (while (< i len) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
312 (if (= (aref chars i) ?-) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
313 (setq i (1+ i) |
20843 | 314 to (aref chars i)) |
315 (setq ch (aref chars i) | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
316 to ch)) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
317 (while (<= ch to) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
318 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
319 (modify-syntax-entry ch syntax)) |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
320 (modify-category-entry ch category) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
321 (setq ch (1+ ch))) |
20843 | 322 (setq i (1+ i))) |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
323 (setq deflist (cdr deflist)))) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
324 |
17052 | 325 ;; Thai character set (TIS620) |
326 | |
88612 | 327 (modify-category-entry '(#xe00 . #xe7f) ?t) |
328 (map-charset-chars #'modify-category-entry 'thai-tis620 ?t) | |
17052 | 329 |
330 (let ((deflist '(;; chars syntax category | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
331 ("à¸-รลว-ฮ" "w" ?0) ; consonant |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
332 ("ฤฦะาำเ-ๅ" "w" ?1) ; vowel base |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
333 ("ัิ-ื็๎" "w" ?2) ; vowel upper |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
334 ("ุ-ฺ" "w" ?3) ; vowel lower |
89483 | 335 ("่-à¹" "w" ?4) ; tone mark |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
336 ("à¹-๙" "w" ?6) ; digit |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
337 ("ฯๆ฿à¹à¹šà¹›" "_" ?5) ; symbol |
17052 | 338 )) |
339 elm chars len syntax category to ch i) | |
340 (while deflist | |
341 (setq elm (car deflist)) | |
342 (setq chars (car elm) | |
343 len (length chars) | |
344 syntax (nth 1 elm) | |
345 category (nth 2 elm) | |
346 i 0) | |
347 (while (< i len) | |
348 (if (= (aref chars i) ?-) | |
349 (setq i (1+ i) | |
20843 | 350 to (aref chars i)) |
351 (setq ch (aref chars i) | |
17052 | 352 to ch)) |
353 (while (<= ch to) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
354 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
355 (modify-syntax-entry ch syntax)) |
17052 | 356 (modify-category-entry ch category) |
357 (setq ch (1+ ch))) | |
20843 | 358 (setq i (1+ i))) |
17052 | 359 (setq deflist (cdr deflist)))) |
360 | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
361 ;; Tibetan character set |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
362 |
88612 | 363 (modify-category-entry '(#xf00 . #xfff) ?q) |
364 (map-charset-chars #'modify-category-entry 'tibetan ?q) | |
365 (map-charset-chars #'modify-category-entry 'tibetan-1-column ?q) | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
366 |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
367 (let ((deflist '(;; chars syntax category |
88711 | 368 ("ཀ-ཀྵཪ" "w" ?0) ; consonant |
369 ("à¾-ྐྵྺྻྼö€ö€‚" "w" ?0) ; | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
370 ("ö„š-ö…ž" "w" ?0) ; |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
371 ("ö…¸-ö‡Œ" "w" ?0) ; |
88711 | 372 ("ིེཻོཽྀ" "w" ?2) ; upper vowel |
373 ("ཾྂྃ྆྇ྈྉྊྋ" "w" ?2) ; upper modifier | |
374 ("༙ö‚Žà½±à½´à¾„༵༷" "w" ?3) ; lowel vowel/modifier | |
89483 | 375 ("཰" "w" ?3) ; invisible vowel a |
88711 | 376 ("༠-༩༪-༳" "w" ?6) ; digit |
377 ("་à¼-༒༔ཿ" "." ?|) ; line-break char | |
378 ("་à¼à¼à¼à¼‘༔ཿ" "." ?|) ; | |
379 ("༈་à¼-༒༔ཿ༽༴" "." ?>) ; prohibition | |
380 ("་à¼à¼à¼à¼‘༔ཿ" "." ?>) ; | |
381 ("ༀ-༊༼à¿à¿‚྅" "." ?<) ; prohibition | |
382 ("༓༕-༘༚-༟༶༸-༻༾༿྾྿-à¿" "." ?q) ; others | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
383 )) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
384 elm chars len syntax category to ch i) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
385 (while deflist |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
386 (setq elm (car deflist)) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
387 (setq chars (car elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
388 len (length chars) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
389 syntax (nth 1 elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
390 category (nth 2 elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
391 i 0) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
392 (while (< i len) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
393 (if (= (aref chars i) ?-) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
394 (setq i (1+ i) |
20843 | 395 to (aref chars i)) |
396 (setq ch (aref chars i) | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
397 to ch)) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
398 (while (<= ch to) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
399 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
400 (modify-syntax-entry ch syntax)) |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
401 (modify-category-entry ch category) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
402 (setq ch (1+ ch))) |
20843 | 403 (setq i (1+ i))) |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
404 (setq deflist (cdr deflist)))) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
405 |
17052 | 406 ;; Vietnamese character set |
407 | |
88612 | 408 ;; To make a word with Latin characters |
409 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-lower ?l) | |
410 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-lower ?v) | |
411 | |
412 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-upper ?l) | |
413 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-upper ?v) | |
17052 | 414 |
27691
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
415 (let ((tbl (standard-case-table)) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
416 (i 32)) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
417 (while (< i 128) |
88711 | 418 (let* ((char (decode-char 'vietnamese-viscii-upper i)) |
419 (charl (decode-char 'vietnamese-viscii-lower i)) | |
420 (uc (encode-char char 'ucs)) | |
421 (lc (encode-char charl 'ucs))) | |
422 (set-case-syntax-pair char (decode-char 'vietnamese-viscii-lower i) | |
423 tbl) | |
424 (if uc (modify-category-entry uc ?v)) | |
425 (if lc (modify-category-entry lc ?v))) | |
27691
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
426 (setq i (1+ i)))) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
427 |
89314 | 428 |
429 ;; Latin | |
430 | |
431 (modify-category-entry '(#x80 . #x024F) ?l) | |
432 | |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
433 (let ((tbl (standard-case-table)) c) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
434 |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
435 ;; In some languages, U+0049 LATIN CAPITAL LETTER I and U+0131 LATIN |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
436 ;; SMALL LETTER DOTLESS I make a case pair, and so do U+0130 LATIN |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
437 ;; CAPITAL LETTER I WITH DOT ABOVE and U+0069 LATIN SMALL LETTER I. |
89188 | 438 ;; See the Turkish language environment. |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
439 |
89146 | 440 ;; Latin-1 |
441 | |
442 ;; Fixme: Some of the non-word syntaxes here perhaps should be | |
443 ;; reviewed. (Note that the following all implicitly have word | |
444 ;; syntax: ¢£¤¥¨ª¯²³´¶¸¹º.) There should be a well-defined way of | |
445 ;; relating Unicode categories to Emacs syntax codes. | |
446 (set-case-syntax ? " " tbl) ; dubious | |
447 (set-case-syntax ?¡ "." tbl) | |
448 (set-case-syntax ?¦ "_" tbl) | |
449 (set-case-syntax ?§ "." tbl) | |
450 (set-case-syntax ?© "_" tbl) | |
451 (set-case-syntax-delims 171 187 tbl) ; « » | |
452 (set-case-syntax ?¬ "_" tbl) | |
453 (set-case-syntax ? "_" tbl) | |
454 (set-case-syntax ?® "_" tbl) | |
455 (set-case-syntax ?° "_" tbl) | |
456 (set-case-syntax ?± "_" tbl) | |
457 (set-case-syntax ?µ "_" tbl) | |
458 (set-case-syntax ?· "_" tbl) | |
459 (set-case-syntax ?¼ "_" tbl) | |
460 (set-case-syntax ?½ "_" tbl) | |
461 (set-case-syntax ?¾ "_" tbl) | |
462 (set-case-syntax ?¿ "." tbl) | |
463 (let ((c 192)) | |
464 (while (<= c 222) | |
465 (set-case-syntax-pair c (+ c 32) tbl) | |
466 (setq c (1+ c)))) | |
467 (set-case-syntax ?× "_" tbl) | |
468 (set-case-syntax ?ß "w" tbl) | |
469 (set-case-syntax ?÷ "_" tbl) | |
470 ;; See below for ÿ. | |
471 | |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
472 ;; Latin Extended-A, Latin Extended-B |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
473 (setq c #x0100) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
474 (while (<= c #x0233) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
475 (and (or (<= c #x012e) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
476 (and (>= c #x014a) (<= c #x0177))) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
477 (zerop (% c 2)) |
88612 | 478 (set-case-syntax-pair c (1+ c) tbl)) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
479 (and (>= c #x013a) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
480 (<= c #x0148) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
481 (zerop (% c 2)) |
88612 | 482 (set-case-syntax-pair (1- c) c tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
483 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
484 (set-case-syntax-pair ?IJ ?ij tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
485 (set-case-syntax-pair ?Ĵ ?ĵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
486 (set-case-syntax-pair ?Ķ ?ķ tbl) |
89084 | 487 (set-case-syntax-pair ?Ÿ ?ÿ tbl) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
488 (set-case-syntax-pair ?Ź ?ź tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
489 (set-case-syntax-pair ?Ż ?ż tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
490 (set-case-syntax-pair ?Ž ?ž tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
491 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
492 ;; Latin Extended-B |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
493 (set-case-syntax-pair ?Æ ?É“ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
494 (set-case-syntax-pair ?Ƃ ?ƃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
495 (set-case-syntax-pair ?Æ„ ?Æ… tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
496 (set-case-syntax-pair ?Ɔ ?ɔ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
497 (set-case-syntax-pair ?Ƈ ?ƈ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
498 (set-case-syntax-pair ?Ɖ ?ɖ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
499 (set-case-syntax-pair ?ÆŠ ?É— tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
500 (set-case-syntax-pair ?Ƌ ?ƌ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
501 (set-case-syntax-pair ?ÆŽ ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
502 (set-case-syntax-pair ?Æ ?É™ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
503 (set-case-syntax-pair ?Æ ?É› tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
504 (set-case-syntax-pair ?Æ‘ ?Æ’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
505 (set-case-syntax-pair ?Æ“ ?É tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
506 (set-case-syntax-pair ?Æ” ?É£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
507 (set-case-syntax-pair ?Æ– ?É© tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
508 (set-case-syntax-pair ?Ɨ ?ɨ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
509 (set-case-syntax-pair ?Ƙ ?ƙ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
510 (set-case-syntax-pair ?Ɯ ?ɯ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
511 (set-case-syntax-pair ?Æ ?ɲ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
512 (set-case-syntax-pair ?Ɵ ?ɵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
513 (set-case-syntax-pair ?Æ ?Æ¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
514 (set-case-syntax-pair ?Æ¢ ?Æ£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
515 (set-case-syntax-pair ?Ƥ ?ƥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
516 (set-case-syntax-pair ?Ʀ ?ʀ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
517 (set-case-syntax-pair ?Ƨ ?ƨ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
518 (set-case-syntax-pair ?Ʃ ?ʃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
519 (set-case-syntax-pair ?Ƭ ?Æ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
520 (set-case-syntax-pair ?Ʈ ?ʈ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
521 (set-case-syntax-pair ?Ư ?ư tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
522 (set-case-syntax-pair ?Ʊ ?ʊ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
523 (set-case-syntax-pair ?Ʋ ?ʋ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
524 (set-case-syntax-pair ?Ƴ ?ƴ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
525 (set-case-syntax-pair ?Ƶ ?ƶ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
526 (set-case-syntax-pair ?Æ· ?Ê’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
527 (set-case-syntax-pair ?Ƹ ?ƹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
528 (set-case-syntax-pair ?Ƽ ?ƽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
529 (set-case-syntax-pair ?DŽ ?dž tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
530 (set-case-syntax-pair ?Dž ?dž tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
531 (set-case-syntax-pair ?LJ ?lj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
532 (set-case-syntax-pair ?Lj ?lj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
533 (set-case-syntax-pair ?NJ ?nj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
534 (set-case-syntax-pair ?Nj ?nj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
535 (set-case-syntax-pair ?Ç ?ÇŽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
536 (set-case-syntax-pair ?Ç ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
537 (set-case-syntax-pair ?Ç‘ ?Ç’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
538 (set-case-syntax-pair ?Ç“ ?Ç” tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
539 (set-case-syntax-pair ?Ç• ?Ç– tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
540 (set-case-syntax-pair ?Ǘ ?ǘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
541 (set-case-syntax-pair ?Ç™ ?Çš tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
542 (set-case-syntax-pair ?Ǜ ?ǜ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
543 (set-case-syntax-pair ?Çž ?ÇŸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
544 (set-case-syntax-pair ?Ç ?Ç¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
545 (set-case-syntax-pair ?Ç¢ ?Ç£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
546 (set-case-syntax-pair ?Ǥ ?ǥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
547 (set-case-syntax-pair ?Ǧ ?ǧ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
548 (set-case-syntax-pair ?Ǩ ?ǩ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
549 (set-case-syntax-pair ?Ǫ ?ǫ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
550 (set-case-syntax-pair ?Ǭ ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
551 (set-case-syntax-pair ?Ǯ ?ǯ tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
552 ;; 01F0; F; 006A 030C; # LATIN SMALL LETTER J WITH CARON |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
553 (set-case-syntax-pair ?DZ ?dz tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
554 (set-case-syntax-pair ?Dz ?dz tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
555 (set-case-syntax-pair ?Ǵ ?ǵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
556 (set-case-syntax-pair ?Ƕ ?ƕ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
557 (set-case-syntax-pair ?Ç· ?Æ¿ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
558 (set-case-syntax-pair ?Ǹ ?ǹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
559 (set-case-syntax-pair ?Ǻ ?ǻ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
560 (set-case-syntax-pair ?Ǽ ?ǽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
561 (set-case-syntax-pair ?Ǿ ?ǿ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
562 (set-case-syntax-pair ?È€ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
563 (set-case-syntax-pair ?Ȃ ?ȃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
564 (set-case-syntax-pair ?È„ ?È… tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
565 (set-case-syntax-pair ?Ȇ ?ȇ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
566 (set-case-syntax-pair ?Ȉ ?ȉ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
567 (set-case-syntax-pair ?ÈŠ ?È‹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
568 (set-case-syntax-pair ?ÈŒ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
569 (set-case-syntax-pair ?ÈŽ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
570 (set-case-syntax-pair ?È ?È‘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
571 (set-case-syntax-pair ?È’ ?È“ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
572 (set-case-syntax-pair ?È” ?È• tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
573 (set-case-syntax-pair ?È– ?È— tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
574 (set-case-syntax-pair ?Ș ?ș tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
575 (set-case-syntax-pair ?Èš ?È› tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
576 (set-case-syntax-pair ?Èœ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
577 (set-case-syntax-pair ?Èž ?ÈŸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
578 (set-case-syntax-pair ?È¢ ?È£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
579 (set-case-syntax-pair ?Ȥ ?ȥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
580 (set-case-syntax-pair ?Ȧ ?ȧ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
581 (set-case-syntax-pair ?Ȩ ?ȩ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
582 (set-case-syntax-pair ?Ȫ ?ȫ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
583 (set-case-syntax-pair ?Ȭ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
584 (set-case-syntax-pair ?Ȯ ?ȯ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
585 (set-case-syntax-pair ?Ȱ ?ȱ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
586 (set-case-syntax-pair ?Ȳ ?ȳ tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
587 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
588 ;; Latin Extended Additional |
88612 | 589 (modify-category-entry '(#x1e00 . #x1ef9) ?l) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
590 (setq c #x1e00) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
591 (while (<= c #x1ef9) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
592 (and (zerop (% c 2)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
593 (or (<= c #x1e94) (>= c #x1ea0)) |
88612 | 594 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
595 (setq c (1+ c))) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
596 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
597 ;; Greek |
88612 | 598 (modify-category-entry '(#x0370 . #x03ff) ?g) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
599 (setq c #x0370) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
600 (while (<= c #x03ff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
601 (if (or (and (>= c #x0391) (<= c #x03a1)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
602 (and (>= c #x03a3) (<= c #x03ab))) |
88612 | 603 (set-case-syntax-pair c (+ c 32) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
604 (and (>= c #x03da) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
605 (<= c #x03ee) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
606 (zerop (% c 2)) |
88612 | 607 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
608 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
609 (set-case-syntax-pair ?Ά ?ά tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
610 (set-case-syntax-pair ?Έ ?Πtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
611 (set-case-syntax-pair ?Ή ?ή tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
612 (set-case-syntax-pair ?Ί ?ί tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
613 (set-case-syntax-pair ?Ό ?ό tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
614 (set-case-syntax-pair ?ÎŽ ?Ï tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
615 (set-case-syntax-pair ?Î ?ÏŽ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
616 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
617 ;; Armenian |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
618 (setq c #x531) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
619 (while (<= c #x556) |
88612 | 620 (set-case-syntax-pair c (+ c #x30) tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
621 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
622 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
623 ;; Greek Extended |
88612 | 624 (modify-category-entry '(#x1f00 . #x1fff) ?g) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
625 (setq c #x1f00) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
626 (while (<= c #x1fff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
627 (and (<= (logand c #x000f) 7) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
628 (<= c #x1fa7) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
629 (not (memq c '(#x1f50 #x1f52 #x1f54 #x1f56))) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
630 (/= (logand c #x00f0) 7) |
88612 | 631 (set-case-syntax-pair (+ c 8) c tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
632 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
633 (set-case-syntax-pair ?Ᾰ ?ᾰ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
634 (set-case-syntax-pair ?á¾¹ ?á¾± tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
635 (set-case-syntax-pair ?Ὰ ?ὰ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
636 (set-case-syntax-pair ?á¾» ?á½± tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
637 (set-case-syntax-pair ?á¾¼ ?á¾³ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
638 (set-case-syntax-pair ?Ὲ ?ὲ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
639 (set-case-syntax-pair ?Έ ?έ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
640 (set-case-syntax-pair ?á¿Š ?á½´ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
641 (set-case-syntax-pair ?á¿‹ ?á½µ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
642 (set-case-syntax-pair ?ῌ ?ῃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
643 (set-case-syntax-pair ?Ῐ ?ῠtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
644 (set-case-syntax-pair ?á¿™ ?á¿‘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
645 (set-case-syntax-pair ?Ὶ ?ὶ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
646 (set-case-syntax-pair ?á¿› ?á½· tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
647 (set-case-syntax-pair ?Ῠ ?ῠtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
648 (set-case-syntax-pair ?á¿© ?á¿¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
649 (set-case-syntax-pair ?Ὺ ?ὺ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
650 (set-case-syntax-pair ?á¿« ?á½» tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
651 (set-case-syntax-pair ?Ῥ ?ῥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
652 (set-case-syntax-pair ?Ὸ ?ὸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
653 (set-case-syntax-pair ?Ό ?ό tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
654 (set-case-syntax-pair ?Ὼ ?ὼ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
655 (set-case-syntax-pair ?á¿» ?á½½ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
656 (set-case-syntax-pair ?ῼ ?ῳ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
657 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
658 ;; cyrillic |
88612 | 659 (modify-category-entry '(#x0400 . #x04FF) ?y) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
660 (setq c #x0400) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
661 (while (<= c #x04ff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
662 (and (>= c #x0400) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
663 (<= c #x040f) |
88612 | 664 (set-case-syntax-pair c (+ c 80) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
665 (and (>= c #x0410) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
666 (<= c #x042f) |
88612 | 667 (set-case-syntax-pair c (+ c 32) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
668 (and (zerop (% c 2)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
669 (or (and (>= c #x0460) (<= c #x0480)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
670 (and (>= c #x048c) (<= c #x04be)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
671 (and (>= c #x04d0) (<= c #x04f4))) |
89483 | 672 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
673 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
674 (set-case-syntax-pair ?Ó ?Ó‚ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
675 (set-case-syntax-pair ?Óƒ ?Ó„ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
676 (set-case-syntax-pair ?Ó‡ ?Óˆ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
677 (set-case-syntax-pair ?Ӌ ?ӌ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
678 (set-case-syntax-pair ?Ó¸ ?Ó¹ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
679 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
680 ;; general punctuation |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
681 (setq c #x2000) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
682 (while (<= c #x200b) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
683 (set-case-syntax c " " tbl) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
684 (setq c (1+ c))) |
89119 | 685 (while (<= c #x200F) |
686 (set-case-syntax c "." tbl) | |
687 (setq c (1+ c))) | |
688 ;; Fixme: These aren't all right: | |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
689 (while (<= c #x2027) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
690 (set-case-syntax c "_" tbl) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
691 (setq c (1+ c))) |
89119 | 692 (while (<= c #x206F) |
693 (set-case-syntax c "." tbl) | |
694 (setq c (1+ c))) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
695 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
696 ;; Roman numerals |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
697 (setq c #x2160) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
698 (while (<= c #x216f) |
88612 | 699 (set-case-syntax-pair c (+ c #x10) tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
700 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
701 |
89146 | 702 ;; Fixme: The following blocks might be better as symbol rather than |
703 ;; punctuation. | |
89119 | 704 ;; Arrows |
705 (setq c #x2190) | |
89121 | 706 (while (<= c #x21FF) |
707 (set-case-syntax c "." tbl) | |
89119 | 708 (setq c (1+ c))) |
709 ;; Mathematical Operators | |
710 (while (<= c #x22FF) | |
89121 | 711 (set-case-syntax c "." tbl) |
89119 | 712 (setq c (1+ c))) |
713 ;; Miscellaneous Technical | |
714 (while (<= c #x23FF) | |
89121 | 715 (set-case-syntax c "." tbl) |
89119 | 716 (setq c (1+ c))) |
717 ;; Control Pictures | |
718 (while (<= c #x243F) | |
89121 | 719 (set-case-syntax c "_" tbl) |
720 (setq c (1+ c))) | |
89119 | 721 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
722 ;; Circled Latin |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
723 (setq c #x24b6) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
724 (while (<= c #x24cf) |
88612 | 725 (set-case-syntax-pair c (+ c 26) tbl) |
726 (modify-category-entry c ?l) | |
727 (modify-category-entry (+ c 26) ?l) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
728 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
729 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
730 ;; Fullwidth Latin |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
731 (setq c #xff21) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
732 (while (<= c #xff3a) |
88612 | 733 (set-case-syntax-pair c (+ c #x20) tbl) |
734 (modify-category-entry c ?l) | |
735 (modify-category-entry (+ c #x20) ?l) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
736 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
737 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
738 ;; Combining diacritics |
88612 | 739 (modify-category-entry '(#x300 . #x362) ?^) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
740 ;; Combining marks |
88612 | 741 (modify-category-entry '(#x20d0 . #x20e3) ?^) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
742 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
743 ;; Fixme: syntax for symbols &c |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
744 ) |
17052 | 745 |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
746 ;; For each character set, put the information of the most proper |
24782
8f65cd6729b5
Typo fixed; prefered->preferred.
Kenichi Handa <handa@m17n.org>
parents:
24256
diff
changeset
|
747 ;; coding system to encode it by `preferred-coding-system' property. |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
748 |
88612 | 749 ;; Fixme: should this be junked? |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
750 (let ((l '((latin-iso8859-1 . iso-latin-1) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
751 (latin-iso8859-2 . iso-latin-2) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
752 (latin-iso8859-3 . iso-latin-3) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
753 (latin-iso8859-4 . iso-latin-4) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
754 (thai-tis620 . thai-tis620) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
755 (greek-iso8859-7 . greek-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
756 (arabic-iso8859-6 . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
757 (hebrew-iso8859-8 . hebrew-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
758 (katakana-jisx0201 . japanese-shift-jis) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
759 (latin-jisx0201 . japanese-shift-jis) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
760 (cyrillic-iso8859-5 . cyrillic-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
761 (latin-iso8859-9 . iso-latin-5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
762 (japanese-jisx0208-1978 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
763 (chinese-gb2312 . cn-gb-2312) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
764 (japanese-jisx0208 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
765 (korean-ksc5601 . iso-2022-kr) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
766 (japanese-jisx0212 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
767 (chinese-cns11643-1 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
768 (chinese-cns11643-2 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
769 (chinese-big5-1 . chinese-big5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
770 (chinese-big5-2 . chinese-big5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
771 (chinese-sisheng . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
772 (ipa . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
773 (vietnamese-viscii-lower . vietnamese-viscii) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
774 (vietnamese-viscii-upper . vietnamese-viscii) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
775 (arabic-digit . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
776 (arabic-1-column . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
777 (lao . lao) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
778 (arabic-2-column . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
779 (indian-is13194 . devanagari) |
41466
b364f798b789
Adjusted for the change of indian-1-column -> indian-glyph.
Richard M. Stallman <rms@gnu.org>
parents:
38414
diff
changeset
|
780 (indian-glyph . devanagari) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
781 (tibetan-1-column . tibetan) |
24256
47e6aa168a66
Let ethiopic use iso-2022-7bit.
Kenichi Handa <handa@m17n.org>
parents:
24208
diff
changeset
|
782 (ethiopic . iso-2022-7bit) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
783 (chinese-cns11643-3 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
784 (chinese-cns11643-4 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
785 (chinese-cns11643-5 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
786 (chinese-cns11643-6 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
787 (chinese-cns11643-7 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
788 (indian-2-column . devanagari) |
26013 | 789 (tibetan . tibetan) |
790 (latin-iso8859-14 . iso-latin-8) | |
791 (latin-iso8859-15 . iso-latin-9)))) | |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
792 (while l |
24782
8f65cd6729b5
Typo fixed; prefered->preferred.
Kenichi Handa <handa@m17n.org>
parents:
24256
diff
changeset
|
793 (put-charset-property (car (car l)) 'preferred-coding-system (cdr (car l))) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
794 (setq l (cdr l)))) |
23493 | 795 |
796 | |
23527
06e8c3b9deec
Setup charset property nospace-between-words.
Kenichi Handa <handa@m17n.org>
parents:
23493
diff
changeset
|
797 ;; Setup auto-fill-chars for charsets that should invoke auto-filling. |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
798 ;; SPACE and NEWLINE are already set. Also put `nospace-between-words' |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
799 ;; property on the charsets. |
23493 | 800 (let ((l '(katakana-jisx0201 |
801 japanese-jisx0208 japanese-jisx0212 | |
802 chinese-gb2312 chinese-big5-1 chinese-big5-2))) | |
803 (while l | |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
804 ;;(aset auto-fill-chars (make-char (car l)) t) |
23527
06e8c3b9deec
Setup charset property nospace-between-words.
Kenichi Handa <handa@m17n.org>
parents:
23493
diff
changeset
|
805 (put-charset-property (car l) 'nospace-between-words t) |
23493 | 806 (setq l (cdr l)))) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
807 |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
808 |
88492
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
809 ;; CJK double width characters. |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
810 (let ((l '((#x1100 . #x11FF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
811 (#x2E80 . #x9FAF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
812 (#xAC00 . #xD7AF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
813 (#xF900 . #xFAFF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
814 (#xFE30 . #xFE4F) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
815 (#xFF00 . #xFF5F) |
89483 | 816 (#xFFE0 . #xFFEF) |
817 (#x20000 . #x2AFFF) | |
818 (#x2F800 . #x2FFFF)))) | |
88492
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
819 (dolist (elt l) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
820 (set-char-table-range char-width-table |
88612 | 821 (cons (car elt) (cdr elt)) |
88492
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
822 2))) |
89117 | 823 ;; Fixme: Doing this affects non-CJK characters through unification, |
824 ;; but presumably CJK users expect those characters to be | |
825 ;; double-width when using these charsets. | |
826 ;; (map-charset-chars | |
827 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
828 ;; 'japanese-jisx0208) | |
829 ;; (map-charset-chars | |
830 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
831 ;; 'japanese-jisx0212) | |
832 ;; (map-charset-chars | |
833 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
834 ;; 'japanese-jisx0213-1) | |
835 ;; (map-charset-chars | |
836 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
837 ;; 'japanese-jisx0213-2) | |
838 ;; (map-charset-chars | |
839 ;; (lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
840 ;; 'korean-ksc5601) | |
88642
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
841 |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
842 ;; Other double width |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
843 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
844 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
845 'ethiopic) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
846 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
847 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
848 'tibetan) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
849 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
850 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
851 'indian-2-column) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
852 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
853 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
854 'arabic-2-column) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
855 |
88723 | 856 (optimize-char-table (standard-case-table)) |
857 (optimize-char-table char-width-table) | |
858 (optimize-char-table (standard-category-table)) | |
859 (optimize-char-table (standard-syntax-table)) | |
860 | |
89119 | 861 ;; The Unicode blocks actually extend past some of these ranges with |
862 ;; undefined codepoints. | |
88892 | 863 (let ((script-list nil)) |
864 (dolist | |
865 (elt | |
89008
3e08629c1469
(script-list): Treat ascii characters as `latin' script.
Kenichi Handa <handa@m17n.org>
parents:
88976
diff
changeset
|
866 '((#x0000 #x007F latin) |
88892 | 867 (#x00A0 #x036F latin) |
868 (#x0370 #x03E1 greek) | |
869 (#x03E2 #x03EF coptic) | |
870 (#x03F0 #x03F3 greek) | |
871 (#x0400 #x04FF cyrillic) | |
872 (#x0530 #x058F armenian) | |
873 (#x0590 #x05FF hebrew) | |
874 (#x0600 #x06FF arabic) | |
875 (#x0700 #x074F syriac) | |
876 (#x0780 #x07BF thaana) | |
877 (#x0900 #x097F devanagari) | |
878 (#x0980 #x09FF bengali) | |
879 (#x0A00 #x0A7F gurmukhi) | |
880 (#x0A80 #x0AFF gujarati) | |
881 (#x0B00 #x0B7F oriya) | |
882 (#x0B80 #x0BFF tamil) | |
883 (#x0C00 #x0C7F telugu) | |
884 (#x0C80 #x0CFF kannada) | |
885 (#x0D00 #x0D7F malayalam) | |
886 (#x0D80 #x0DFF sinhala) | |
887 (#x0E00 #x0E5F thai) | |
888 (#x0E80 #x0EDF lao) | |
889 (#x0F00 #x0FFF tibetan) | |
890 (#x1000 #x105F myanmar) | |
891 (#x10A0 #x10FF georgian) | |
892 (#x1100 #x11FF hangul) | |
893 (#x1200 #x137F ethiopic) | |
894 (#x13A0 #x13FF cherokee) | |
895 (#x1400 #x167F canadian-aboriginal) | |
896 (#x1680 #x169F ogham) | |
897 (#x16A0 #x16FF runic) | |
898 (#x1780 #x17FF khmer) | |
899 (#x1800 #x18AF mongolian) | |
900 (#x1E00 #x1EFF latin) | |
901 (#x1F00 #x1FFF greek) | |
902 (#x20A0 #x20AF currency) | |
903 (#x2800 #x28FF braille) | |
904 (#x2E80 #x2FDF han) | |
905 (#x2FF0 #x2FFF ideographic-description) | |
906 (#x3000 #x303F cjk-misc) | |
907 (#x3040 #x30FF kana) | |
908 (#x3100 #x312F bopomofo) | |
909 (#x3130 #x318F hangul) | |
910 (#x3190 #x319F kanbun) | |
911 (#x31A0 #x31BF bopomofo) | |
912 (#x3400 #x9FAF han) | |
913 (#xA000 #xA4CF yi) | |
914 (#xAC00 #xD7AF hangul) | |
915 (#xF900 #xFA5F han) | |
916 (#xFB1D #xFB4F hebrew) | |
917 (#xFB50 #xFDFF arabic) | |
918 (#xFE70 #xFEFC arabic) | |
919 (#xFF00 #xFF5F cjk-misc) | |
920 (#xFF61 #xFF9F kana) | |
921 (#xFFE0 #xFFE6 cjk-misc) | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
922 (#x20000 #x2AFFF han) |
88892 | 923 (#x2F800 #x2FFFF han))) |
924 (set-char-table-range char-script-table | |
925 (cons (car elt) (nth 1 elt)) (nth 2 elt)) | |
926 (or (memq (nth 2 elt) script-list) | |
927 (setq script-list (cons (nth 2 elt) script-list)))) | |
928 (set-char-table-extra-slot char-script-table 0 (nreverse script-list))) | |
929 | |
89483 | 930 (map-charset-chars |
89291
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
931 #'(lambda (range ignore) |
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
932 (set-char-table-range char-script-table range 'tibetan)) |
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
933 'tibetan) |
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
934 |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
935 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
936 ;;; Setting word boundary. |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
937 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
938 (defun next-word-boundary-han (pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
939 (if (<= pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
940 (save-excursion |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
941 (goto-char pos) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
942 (looking-at "\\cC+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
943 (goto-char (match-end 0)) |
88976
fc96b539a43c
(next-word-boundary-han): Don't
Kenichi Handa <handa@m17n.org>
parents:
88971
diff
changeset
|
944 (if (looking-at "\\cH+") |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
945 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
946 (point)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
947 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
948 (eq (aref char-script-table (char-after (1- pos))) 'han)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
949 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
950 pos)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
951 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
952 (defun next-word-boundary-kana (pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
953 (if (<= pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
954 (save-excursion |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
955 (goto-char pos) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
956 (if (looking-at "\\cK+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
957 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
958 (if (looking-at "\\cH+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
959 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
960 (point)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
961 (let ((category-set (char-category-set (char-after pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
962 category) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
963 (if (aref category-set ?K) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
964 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
965 (aref (char-category-set (char-after (1- pos))) ?K)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
966 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
967 (while (and (> pos limit) |
89483 | 968 (aref (setq category-set |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
969 (char-category-set (char-after (1- pos)))) ?H)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
970 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
971 (setq category (cond ((aref category-set ?C) ?C) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
972 ((aref category-set ?K) ?K) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
973 ((aref category-set ?A) ?A))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
974 (when category |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
975 (setq pos (1- pos)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
976 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
977 (aref (char-category-set (char-after (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
978 category)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
979 (setq pos (1- pos))))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
980 pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
981 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
982 (map-char-table |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
983 #'(lambda (char script) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
984 (cond ((eq script 'han) |
89483 | 985 (set-char-table-range find-word-boundary-function-table |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
986 char #'next-word-boundary-han)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
987 ((eq script 'kana) |
89483 | 988 (set-char-table-range find-word-boundary-function-table |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
989 char #'next-word-boundary-kana)))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
990 char-script-table) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
991 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
992 (setq word-combining-categories |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
993 '((?l . ?l))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
994 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
995 (setq word-separating-categories ; (2-byte character sets) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
996 '((?A . ?K) ; Alpha numeric - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
997 (?A . ?C) ; Alpha numeric - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
998 (?H . ?A) ; Hiragana - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
999 (?H . ?K) ; Hiragana - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1000 (?H . ?C) ; Hiragana - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1001 (?K . ?A) ; Katakana - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1002 (?K . ?C) ; Katakana - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1003 (?C . ?A) ; Chinese - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1004 (?C . ?K) ; Chinese - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1005 )) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1006 |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1007 ;;; Local Variables: |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
1008 ;;; coding: utf-8-emacs |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1009 ;;; End: |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1010 |
38414
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
1011 ;;; characters.el ends here |