Mercurial > emacs
annotate lisp/international/characters.el @ 89735:9de89ee4b8c6
(define-charset): If :code-space is not given, generate it from
:dimension.
author | Kenichi Handa <handa@m17n.org> |
---|---|
date | Sun, 25 Jan 2004 07:33:44 +0000 |
parents | 9f9a2db0bc5a |
children | 186081a2790b |
rev | line source |
---|---|
17052 | 1 ;;; characters.el --- set syntax and category for multibyte characters |
2 | |
19958
f7ab34ae92bb
Set case-table for Cyrillic characters.
Richard M. Stallman <rms@gnu.org>
parents:
18561
diff
changeset
|
3 ;; Copyright (C) 1995, 1997 Electrotechnical Laboratory, JAPAN. |
89483 | 4 ;; Licensed to the Free Software Foundation. |
88612 | 5 ;; Copyright (C) 2001, 2002 Free Software Foundation, Inc. |
89483 | 6 ;; Copyright (C) 2003 |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
7 ;; National Institute of Advanced Industrial Science and Technology (AIST) |
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
8 ;; Registration Number H13PRO009 |
17052 | 9 |
10 ;; Keywords: multibyte character, character set, syntax, category | |
11 | |
12 ;; This file is part of GNU Emacs. | |
13 | |
14 ;; GNU Emacs is free software; you can redistribute it and/or modify | |
15 ;; it under the terms of the GNU General Public License as published by | |
16 ;; the Free Software Foundation; either version 2, or (at your option) | |
17 ;; any later version. | |
18 | |
19 ;; GNU Emacs is distributed in the hope that it will be useful, | |
20 ;; but WITHOUT ANY WARRANTY; without even the implied warranty of | |
21 ;; MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the | |
22 ;; GNU General Public License for more details. | |
23 | |
24 ;; You should have received a copy of the GNU General Public License | |
17071 | 25 ;; along with GNU Emacs; see the file COPYING. If not, write to the |
26 ;; Free Software Foundation, Inc., 59 Temple Place - Suite 330, | |
27 ;; Boston, MA 02111-1307, USA. | |
17052 | 28 |
29 ;;; Commentary: | |
30 | |
38414
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
31 ;;; Code: |
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
32 |
17052 | 33 ;;; Predefined categories. |
34 | |
35 ;; For each character set. | |
36 | |
37 (define-category ?a "ASCII") | |
38 (define-category ?l "Latin") | |
39 (define-category ?t "Thai") | |
40 (define-category ?g "Greek") | |
41 (define-category ?b "Arabic") | |
42 (define-category ?w "Hebrew") | |
43 (define-category ?y "Cyrillic") | |
44 (define-category ?k "Japanese katakana") | |
45 (define-category ?r "Japanese roman") | |
46 (define-category ?c "Chinese") | |
47 (define-category ?j "Japanese") | |
48 (define-category ?h "Korean") | |
49 (define-category ?e "Ethiopic (Ge'ez)") | |
50 (define-category ?v "Vietnamese") | |
51 (define-category ?i "Indian") | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
52 (define-category ?o "Lao") |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
53 (define-category ?q "Tibetan") |
17052 | 54 |
55 ;; For each group (row) of 2-byte character sets. | |
56 | |
22529 | 57 (define-category ?A "Alpha-numeric characters of 2-byte character sets") |
17052 | 58 (define-category ?C "Chinese (Han) characters of 2-byte character sets") |
22529 | 59 (define-category ?G "Greek characters of 2-byte character sets") |
17052 | 60 (define-category ?H "Japanese Hiragana characters of 2-byte character sets") |
61 (define-category ?K "Japanese Katakana characters of 2-byte character sets") | |
62 (define-category ?N "Korean Hangul characters of 2-byte character sets") | |
22529 | 63 (define-category ?Y "Cyrillic characters of 2-byte character sets") |
17052 | 64 (define-category ?I "Indian Glyphs") |
65 | |
66 ;; For phonetic classifications. | |
67 | |
68 (define-category ?0 "consonant") | |
18561
f3c28fd5118e
Modify docstring of char category
Kenichi Handa <handa@m17n.org>
parents:
18377
diff
changeset
|
69 (define-category ?1 "base (independent) vowel") |
17052 | 70 (define-category ?2 "upper diacritical mark (including upper vowel)") |
71 (define-category ?3 "lower diacritical mark (including lower vowel)") | |
72 (define-category ?4 "tone mark") | |
18561
f3c28fd5118e
Modify docstring of char category
Kenichi Handa <handa@m17n.org>
parents:
18377
diff
changeset
|
73 (define-category ?5 "symbol") |
17052 | 74 (define-category ?6 "digit") |
75 (define-category ?7 "vowel-modifying diacritical mark") | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
76 (define-category ?8 "vowel-signs") |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
77 (define-category ?9 "semivowel lower") |
17052 | 78 |
79 ;; For filling. | |
80 (define-category ?| "While filling, we can break a line at this character.") | |
81 | |
20247 | 82 ;; For indentation calculation. |
49769
5ed59fd23bc4
Use new ?\s syntax instead of "? " followed by a comment.
Juanma Barranquero <lekktu@gmail.com>
parents:
49601
diff
changeset
|
83 (define-category ?\s |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
84 "This character counts as a space for indentation purposes.") |
20247 | 85 |
22529 | 86 ;; Keep the following for `kinsoku' processing. See comments in |
17052 | 87 ;; kinsoku.el. |
88 (define-category ?> "A character which can't be placed at beginning of line.") | |
89 (define-category ?< "A character which can't be placed at end of line.") | |
90 | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
91 ;; Combining |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
92 (define-category ?^ "Combining diacritic or mark") |
17052 | 93 |
94 ;;; Setting syntax and category. | |
95 | |
96 ;; ASCII | |
97 | |
88425
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
98 ;; All ASCII characters have the category `a' (ASCII) and `l' (Latin). |
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
99 (modify-category-entry '(32 . 127) ?a) |
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
100 (modify-category-entry '(32 . 127) ?l) |
17052 | 101 |
89314 | 102 ;; Deal with the CJK charsets first. Since the syntax of blocks is |
103 ;; defined per charset, and the charsets may contain e.g. Latin | |
104 ;; characters, we end up with the wrong syntax definitions if we're | |
105 ;; not careful. | |
17052 | 106 |
88752 | 107 ;; Chinese characters (Unicode) |
108 (modify-category-entry '(#x3400 . #x9FAF) ?C) | |
109 (modify-category-entry '(#x3400 . #x9FAF) ?c) | |
110 (modify-category-entry '(#x3400 . #x9FAF) ?|) | |
111 (modify-category-entry '(#xF900 . #xFAFF) ?C) | |
112 (modify-category-entry '(#xF900 . #xFAFF) ?c) | |
113 (modify-category-entry '(#xF900 . #xFAFF) ?|) | |
89680
9f9a2db0bc5a
Fix setting of category `|'.
Kenichi Handa <handa@m17n.org>
parents:
89635
diff
changeset
|
114 (modify-category-entry '(#x20000 . #x2AFFF) ?|) |
9f9a2db0bc5a
Fix setting of category `|'.
Kenichi Handa <handa@m17n.org>
parents:
89635
diff
changeset
|
115 (modify-category-entry '(#x2F800 . #x2FFFF) ?|) |
9f9a2db0bc5a
Fix setting of category `|'.
Kenichi Handa <handa@m17n.org>
parents:
89635
diff
changeset
|
116 |
88752 | 117 |
17052 | 118 ;; Chinese character set (GB2312) |
119 | |
88752 | 120 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2121 #x217E) |
121 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2221 #x227E) | |
122 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2921 #x297E) | |
17052 | 123 |
88723 | 124 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?c) |
89014 | 125 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2330 #x2339) |
126 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2341 #x235A) | |
127 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2361 #x237A) | |
88752 | 128 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?H #x2421 #x247E) |
129 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?K #x2521 #x257E) | |
130 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?G #x2621 #x267E) | |
131 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?Y #x2721 #x277E) | |
132 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?C #x3021 #x7E7E) | |
17052 | 133 |
134 ;; Chinese character set (BIG5) | |
135 | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
136 (map-charset-chars #'modify-category-entry 'big5 ?c) |
89014 | 137 (map-charset-chars #'modify-category-entry 'big5 ?C #xA259 #xA25F) |
138 (map-charset-chars #'modify-category-entry 'big5 ?C #xA440 #xC67E) | |
139 (map-charset-chars #'modify-category-entry 'big5 ?C #xC940 #xF9DF) | |
17052 | 140 |
141 ;; Chinese character set (CNS11643) | |
142 | |
88723 | 143 (dolist (c '(chinese-cns11643-1 chinese-cns11643-2 chinese-cns11643-3 |
144 chinese-cns11643-4 chinese-cns11643-5 chinese-cns11643-6 | |
145 chinese-cns11643-7)) | |
146 (map-charset-chars #'modify-category-entry c ?c) | |
89014 | 147 (if (eq c 'chinese-cns11643-1) |
148 (map-charset-chars #'modify-category-entry c ?C #x4421 #x7E7E) | |
89680
9f9a2db0bc5a
Fix setting of category `|'.
Kenichi Handa <handa@m17n.org>
parents:
89635
diff
changeset
|
149 (map-charset-chars #'modify-category-entry c ?C))) |
17052 | 150 |
89483 | 151 ;; Japanese character set (JISX0201, JISX0208, JISX0212, JISX0213) |
17052 | 152 |
88752 | 153 (map-charset-chars #'modify-category-entry 'katakana-jisx0201 ?k) |
154 | |
155 (map-charset-chars #'modify-category-entry 'latin-jisx0201 ?r) | |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
156 |
89483 | 157 (dolist (l '(katakana-jisx0201 japanese-jisx0208 japanese-jisx0212 |
158 japanese-jisx0213-1 japanese-jisx0213-2)) | |
89680
9f9a2db0bc5a
Fix setting of category `|'.
Kenichi Handa <handa@m17n.org>
parents:
89635
diff
changeset
|
159 (map-charset-chars #'modify-category-entry l ?j)) |
17052 | 160 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
161 ;; Unicode equivalents of JISX0201-kana |
88752 | 162 (let ((range '(#xff61 . #xff9f))) |
163 (modify-category-entry range ?k) | |
164 (modify-category-entry range ?j) | |
165 (modify-category-entry range ?\|)) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
166 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
167 ;; Katakana block |
88752 | 168 (let ((range '(#x30a0 . #x30ff))) |
169 ;; ?K is double width, ?k isn't specified | |
170 (modify-category-entry range ?K) | |
171 (modify-category-entry range ?\|)) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
172 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
173 ;; Hiragana block |
89483 | 174 (let ((range '(#x3040 . #x309d))) |
88752 | 175 ;; ?H is actually defined to be double width |
176 ;;(modify-category-entry range ?H) | |
89680
9f9a2db0bc5a
Fix setting of category `|'.
Kenichi Handa <handa@m17n.org>
parents:
89635
diff
changeset
|
177 (modify-category-entry range ?\|) |
88752 | 178 ) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
179 |
17052 | 180 ;; JISX0208 |
88752 | 181 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0208 "_" #x2121 #x227E) |
182 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0208 "_" #x2821 #x287E) | |
183 (let ((chars '(?ー ?ã‚› ?ã‚œ ?ヽ ?ヾ ?ã‚ ?ã‚ž ?〃 ?ä» ?々 ?〆 ?〇))) | |
88519
98ec5d812ccc
Set syntax/category for japanese-jisx0208.
Kenichi Handa <handa@m17n.org>
parents:
88492
diff
changeset
|
184 (dolist (elt chars) |
88612 | 185 (modify-syntax-entry (car chars) "w"))) |
17052 | 186 |
88752 | 187 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?A #x2321 #x237E) |
188 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?H #x2421 #x247E) | |
189 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?K #x2521 #x257E) | |
190 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?G #x2621 #x267E) | |
191 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?Y #x2721 #x277E) | |
192 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?C #x3021 #x7E7E) | |
193 (modify-category-entry ?ー ?K) | |
194 (let ((chars '(?゛ ?゜))) | |
17052 | 195 (while chars |
196 (modify-category-entry (car chars) ?K) | |
197 (modify-category-entry (car chars) ?H) | |
198 (setq chars (cdr chars)))) | |
88752 | 199 (let ((chars '(?ヽ ?ヾ ?ã‚ ?ã‚ž ?〃 ?ä» ?々 ?〆 ?〇))) |
17052 | 200 (while chars |
201 (modify-category-entry (car chars) ?C) | |
202 (setq chars (cdr chars)))) | |
203 | |
204 ;; JISX0212 | |
88723 | 205 |
88752 | 206 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0212 "_" #x2121 #x237E) |
17052 | 207 |
208 ;; JISX0201-Kana | |
88723 | 209 |
88612 | 210 (let ((chars '(?。 ?、 ?・))) |
17052 | 211 (while chars |
212 (modify-syntax-entry (car chars) ".") | |
213 (setq chars (cdr chars)))) | |
214 | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
215 (modify-syntax-entry ?\ï½¢ "(ï½£") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
216 (modify-syntax-entry ?\ï½£ "(ï½¢") |
36688
45b172fa5c17
Add category `j' for katakana-jisx0201. Fix syntaxes of
Kenichi Handa <handa@m17n.org>
parents:
34507
diff
changeset
|
217 |
17052 | 218 ;; Korean character set (KSC5601) |
219 | |
88723 | 220 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?h) |
88752 | 221 |
222 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2121 #x227E) | |
89117 | 223 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2621 #x277E) |
224 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2830 #x287E) | |
225 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2930 #x297E) | |
89014 | 226 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2330 #x2339) |
227 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2341 #x235A) | |
228 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2361 #x237A) | |
88752 | 229 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?G #x2521 #x257E) |
230 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?H #x2A21 #x2A7E) | |
231 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?K #x2B21 #x2B7E) | |
232 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?Y #x2C21 #x2C7E) | |
17052 | 233 |
89314 | 234 ;; These are in more than one charset. |
89483 | 235 (let ((parens (concat "〈〉《》「ã€ã€Žã€ã€ã€‘〔〕〖〗〘〙〚〛" |
236 "︵︶︷︸︹︺︻︼︽︾︿﹀ï¹ï¹‚﹃﹄" | |
237 "()[]{ï½")) | |
238 open close) | |
239 (dotimes (i (/ (length parens) 2)) | |
240 (setq open (aref parens (* i 2)) | |
241 close (aref parens (1+ (* i 2)))) | |
242 (modify-syntax-entry open (format "(%c" close)) | |
243 (modify-syntax-entry close (format ")%c" open)))) | |
89314 | 244 |
245 ;; Arabic character set | |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
246 |
89314 | 247 (let ((charsets '(arabic-iso8859-6 |
248 arabic-digit | |
249 arabic-1-column | |
250 arabic-2-column))) | |
251 (while charsets | |
252 (map-charset-chars #'modify-category-entry (car charsets) ?b) | |
253 (setq charsets (cdr charsets)))) | |
254 (modify-category-entry '(#x600 . #x6ff) ?b) | |
255 (modify-category-entry '(#xfb50 . #xfdff) ?b) | |
256 (modify-category-entry '(#xfe70 . #xfefe) ?b) | |
257 | |
258 ;; Cyrillic character set (ISO-8859-5) | |
259 | |
260 (modify-syntax-entry ?â„– ".") | |
261 | |
262 ;; Ethiopic character set | |
263 | |
89600
1ad3832f1d1d
Fix category for Ethiopic.
Kenichi Handa <handa@m17n.org>
parents:
89483
diff
changeset
|
264 (modify-category-entry '(#x1200 . #x137c) ?e) |
89314 | 265 (let ((chars '(?á¡ ?ᢠ?ᣠ?ᤠ?ᥠ?ᦠ?᧠?ᨠ?ö ‡€ ?ö ‡‹ ?ö ‡Œ ?ö ‡ ?ö ‡Ž ?ö ‡))) |
266 (while chars | |
267 (modify-syntax-entry (car chars) ".") | |
268 (setq chars (cdr chars)))) | |
269 (map-charset-chars #'modify-category-entry 'ethiopic ?e) | |
270 | |
271 ;; Hebrew character set (ISO-8859-8) | |
272 | |
273 (modify-syntax-entry #x5be ".") ; MAQAF | |
274 (modify-syntax-entry #x5c0 ".") ; PASEQ | |
275 (modify-syntax-entry #x5c3 ".") ; SOF PASUQ | |
276 (modify-syntax-entry #x5f3 ".") ; GERESH | |
277 (modify-syntax-entry #x5f4 ".") ; GERSHAYIM | |
278 | |
279 ;; Indian character set (IS 13194 and other Emacs original Indian charsets) | |
280 | |
281 (modify-category-entry '(#x901 . #x970) ?i) | |
282 (map-charset-chars #'modify-category-entry 'indian-is13194 ?i) | |
283 (map-charset-chars #'modify-category-entry 'indian-2-column ?i) | |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
284 |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
285 ;; Lao character set |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
286 |
88612 | 287 (modify-category-entry '(#xe80 . #xeff) ?o) |
288 (map-charset-chars #'modify-category-entry 'lao ?o) | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
289 |
88612 | 290 (let ((deflist '(("àº-ຮ" "w" ?0) ; consonant |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
291 ("ະາຳຽເ-ໄ" "w" ?1) ; vowel base |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
292 ("ັິ-ືົà»" "w" ?2) ; vowel upper |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
293 ("ຸູ" "w" ?3) ; vowel lower |
89483 | 294 ("່-໋" "w" ?4) ; tone mark |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
295 ("ຼຽ" "w" ?9) ; semivowel lower |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
296 ("à»-à»™" "w" ?6) ; digit |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
297 ("ຯໆ" "_" ?5) ; symbol |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
298 )) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
299 elm chars len syntax category to ch i) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
300 (while deflist |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
301 (setq elm (car deflist)) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
302 (setq chars (car elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
303 len (length chars) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
304 syntax (nth 1 elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
305 category (nth 2 elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
306 i 0) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
307 (while (< i len) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
308 (if (= (aref chars i) ?-) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
309 (setq i (1+ i) |
20843 | 310 to (aref chars i)) |
311 (setq ch (aref chars i) | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
312 to ch)) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
313 (while (<= ch to) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
314 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
315 (modify-syntax-entry ch syntax)) |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
316 (modify-category-entry ch category) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
317 (setq ch (1+ ch))) |
20843 | 318 (setq i (1+ i))) |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
319 (setq deflist (cdr deflist)))) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
320 |
17052 | 321 ;; Thai character set (TIS620) |
322 | |
88612 | 323 (modify-category-entry '(#xe00 . #xe7f) ?t) |
324 (map-charset-chars #'modify-category-entry 'thai-tis620 ?t) | |
17052 | 325 |
326 (let ((deflist '(;; chars syntax category | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
327 ("à¸-รลว-ฮ" "w" ?0) ; consonant |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
328 ("ฤฦะาำเ-ๅ" "w" ?1) ; vowel base |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
329 ("ัิ-ื็๎" "w" ?2) ; vowel upper |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
330 ("ุ-ฺ" "w" ?3) ; vowel lower |
89483 | 331 ("่-à¹" "w" ?4) ; tone mark |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
332 ("à¹-๙" "w" ?6) ; digit |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
333 ("ฯๆ฿à¹à¹šà¹›" "_" ?5) ; symbol |
17052 | 334 )) |
335 elm chars len syntax category to ch i) | |
336 (while deflist | |
337 (setq elm (car deflist)) | |
338 (setq chars (car elm) | |
339 len (length chars) | |
340 syntax (nth 1 elm) | |
341 category (nth 2 elm) | |
342 i 0) | |
343 (while (< i len) | |
344 (if (= (aref chars i) ?-) | |
345 (setq i (1+ i) | |
20843 | 346 to (aref chars i)) |
347 (setq ch (aref chars i) | |
17052 | 348 to ch)) |
349 (while (<= ch to) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
350 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
351 (modify-syntax-entry ch syntax)) |
17052 | 352 (modify-category-entry ch category) |
353 (setq ch (1+ ch))) | |
20843 | 354 (setq i (1+ i))) |
17052 | 355 (setq deflist (cdr deflist)))) |
356 | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
357 ;; Tibetan character set |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
358 |
88612 | 359 (modify-category-entry '(#xf00 . #xfff) ?q) |
360 (map-charset-chars #'modify-category-entry 'tibetan ?q) | |
361 (map-charset-chars #'modify-category-entry 'tibetan-1-column ?q) | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
362 |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
363 (let ((deflist '(;; chars syntax category |
88711 | 364 ("ཀ-ཀྵཪ" "w" ?0) ; consonant |
365 ("à¾-ྐྵྺྻྼö€ö€‚" "w" ?0) ; | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
366 ("ö„š-ö…ž" "w" ?0) ; |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
367 ("ö…¸-ö‡Œ" "w" ?0) ; |
88711 | 368 ("ིེཻོཽྀ" "w" ?2) ; upper vowel |
369 ("ཾྂྃ྆྇ྈྉྊྋ" "w" ?2) ; upper modifier | |
370 ("༙ö‚Žà½±à½´à¾„༵༷" "w" ?3) ; lowel vowel/modifier | |
89483 | 371 ("཰" "w" ?3) ; invisible vowel a |
88711 | 372 ("༠-༩༪-༳" "w" ?6) ; digit |
373 ("་à¼-༒༔ཿ" "." ?|) ; line-break char | |
374 ("་à¼à¼à¼à¼‘༔ཿ" "." ?|) ; | |
375 ("༈་à¼-༒༔ཿ༽༴" "." ?>) ; prohibition | |
376 ("་à¼à¼à¼à¼‘༔ཿ" "." ?>) ; | |
377 ("ༀ-༊༼à¿à¿‚྅" "." ?<) ; prohibition | |
378 ("༓༕-༘༚-༟༶༸-༻༾༿྾྿-à¿" "." ?q) ; others | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
379 )) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
380 elm chars len syntax category to ch i) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
381 (while deflist |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
382 (setq elm (car deflist)) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
383 (setq chars (car elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
384 len (length chars) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
385 syntax (nth 1 elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
386 category (nth 2 elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
387 i 0) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
388 (while (< i len) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
389 (if (= (aref chars i) ?-) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
390 (setq i (1+ i) |
20843 | 391 to (aref chars i)) |
392 (setq ch (aref chars i) | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
393 to ch)) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
394 (while (<= ch to) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
395 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
396 (modify-syntax-entry ch syntax)) |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
397 (modify-category-entry ch category) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
398 (setq ch (1+ ch))) |
20843 | 399 (setq i (1+ i))) |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
400 (setq deflist (cdr deflist)))) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
401 |
17052 | 402 ;; Vietnamese character set |
403 | |
88612 | 404 ;; To make a word with Latin characters |
405 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-lower ?l) | |
406 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-lower ?v) | |
407 | |
408 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-upper ?l) | |
409 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-upper ?v) | |
17052 | 410 |
27691
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
411 (let ((tbl (standard-case-table)) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
412 (i 32)) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
413 (while (< i 128) |
88711 | 414 (let* ((char (decode-char 'vietnamese-viscii-upper i)) |
415 (charl (decode-char 'vietnamese-viscii-lower i)) | |
416 (uc (encode-char char 'ucs)) | |
417 (lc (encode-char charl 'ucs))) | |
418 (set-case-syntax-pair char (decode-char 'vietnamese-viscii-lower i) | |
419 tbl) | |
420 (if uc (modify-category-entry uc ?v)) | |
421 (if lc (modify-category-entry lc ?v))) | |
27691
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
422 (setq i (1+ i)))) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
423 |
89314 | 424 |
425 ;; Latin | |
426 | |
427 (modify-category-entry '(#x80 . #x024F) ?l) | |
428 | |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
429 (let ((tbl (standard-case-table)) c) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
430 |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
431 ;; In some languages, U+0049 LATIN CAPITAL LETTER I and U+0131 LATIN |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
432 ;; SMALL LETTER DOTLESS I make a case pair, and so do U+0130 LATIN |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
433 ;; CAPITAL LETTER I WITH DOT ABOVE and U+0069 LATIN SMALL LETTER I. |
89188 | 434 ;; See the Turkish language environment. |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
435 |
89146 | 436 ;; Latin-1 |
437 | |
438 ;; Fixme: Some of the non-word syntaxes here perhaps should be | |
439 ;; reviewed. (Note that the following all implicitly have word | |
440 ;; syntax: ¢£¤¥¨ª¯²³´¶¸¹º.) There should be a well-defined way of | |
441 ;; relating Unicode categories to Emacs syntax codes. | |
442 (set-case-syntax ? " " tbl) ; dubious | |
443 (set-case-syntax ?¡ "." tbl) | |
444 (set-case-syntax ?¦ "_" tbl) | |
445 (set-case-syntax ?§ "." tbl) | |
446 (set-case-syntax ?© "_" tbl) | |
447 (set-case-syntax-delims 171 187 tbl) ; « » | |
448 (set-case-syntax ?¬ "_" tbl) | |
449 (set-case-syntax ? "_" tbl) | |
450 (set-case-syntax ?® "_" tbl) | |
451 (set-case-syntax ?° "_" tbl) | |
452 (set-case-syntax ?± "_" tbl) | |
453 (set-case-syntax ?µ "_" tbl) | |
454 (set-case-syntax ?· "_" tbl) | |
455 (set-case-syntax ?¼ "_" tbl) | |
456 (set-case-syntax ?½ "_" tbl) | |
457 (set-case-syntax ?¾ "_" tbl) | |
458 (set-case-syntax ?¿ "." tbl) | |
459 (let ((c 192)) | |
460 (while (<= c 222) | |
461 (set-case-syntax-pair c (+ c 32) tbl) | |
462 (setq c (1+ c)))) | |
463 (set-case-syntax ?× "_" tbl) | |
464 (set-case-syntax ?ß "w" tbl) | |
465 (set-case-syntax ?÷ "_" tbl) | |
466 ;; See below for ÿ. | |
467 | |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
468 ;; Latin Extended-A, Latin Extended-B |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
469 (setq c #x0100) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
470 (while (<= c #x0233) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
471 (and (or (<= c #x012e) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
472 (and (>= c #x014a) (<= c #x0177))) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
473 (zerop (% c 2)) |
88612 | 474 (set-case-syntax-pair c (1+ c) tbl)) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
475 (and (>= c #x013a) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
476 (<= c #x0148) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
477 (zerop (% c 2)) |
88612 | 478 (set-case-syntax-pair (1- c) c tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
479 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
480 (set-case-syntax-pair ?IJ ?ij tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
481 (set-case-syntax-pair ?Ĵ ?ĵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
482 (set-case-syntax-pair ?Ķ ?ķ tbl) |
89084 | 483 (set-case-syntax-pair ?Ÿ ?ÿ tbl) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
484 (set-case-syntax-pair ?Ź ?ź tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
485 (set-case-syntax-pair ?Ż ?ż tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
486 (set-case-syntax-pair ?Ž ?ž tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
487 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
488 ;; Latin Extended-B |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
489 (set-case-syntax-pair ?Æ ?É“ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
490 (set-case-syntax-pair ?Ƃ ?ƃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
491 (set-case-syntax-pair ?Æ„ ?Æ… tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
492 (set-case-syntax-pair ?Ɔ ?ɔ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
493 (set-case-syntax-pair ?Ƈ ?ƈ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
494 (set-case-syntax-pair ?Ɖ ?ɖ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
495 (set-case-syntax-pair ?ÆŠ ?É— tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
496 (set-case-syntax-pair ?Ƌ ?ƌ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
497 (set-case-syntax-pair ?ÆŽ ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
498 (set-case-syntax-pair ?Æ ?É™ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
499 (set-case-syntax-pair ?Æ ?É› tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
500 (set-case-syntax-pair ?Æ‘ ?Æ’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
501 (set-case-syntax-pair ?Æ“ ?É tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
502 (set-case-syntax-pair ?Æ” ?É£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
503 (set-case-syntax-pair ?Æ– ?É© tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
504 (set-case-syntax-pair ?Ɨ ?ɨ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
505 (set-case-syntax-pair ?Ƙ ?ƙ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
506 (set-case-syntax-pair ?Ɯ ?ɯ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
507 (set-case-syntax-pair ?Æ ?ɲ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
508 (set-case-syntax-pair ?Ɵ ?ɵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
509 (set-case-syntax-pair ?Æ ?Æ¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
510 (set-case-syntax-pair ?Æ¢ ?Æ£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
511 (set-case-syntax-pair ?Ƥ ?ƥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
512 (set-case-syntax-pair ?Ʀ ?ʀ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
513 (set-case-syntax-pair ?Ƨ ?ƨ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
514 (set-case-syntax-pair ?Ʃ ?ʃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
515 (set-case-syntax-pair ?Ƭ ?Æ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
516 (set-case-syntax-pair ?Ʈ ?ʈ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
517 (set-case-syntax-pair ?Ư ?ư tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
518 (set-case-syntax-pair ?Ʊ ?ʊ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
519 (set-case-syntax-pair ?Ʋ ?ʋ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
520 (set-case-syntax-pair ?Ƴ ?ƴ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
521 (set-case-syntax-pair ?Ƶ ?ƶ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
522 (set-case-syntax-pair ?Æ· ?Ê’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
523 (set-case-syntax-pair ?Ƹ ?ƹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
524 (set-case-syntax-pair ?Ƽ ?ƽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
525 (set-case-syntax-pair ?DŽ ?dž tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
526 (set-case-syntax-pair ?Dž ?dž tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
527 (set-case-syntax-pair ?LJ ?lj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
528 (set-case-syntax-pair ?Lj ?lj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
529 (set-case-syntax-pair ?NJ ?nj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
530 (set-case-syntax-pair ?Nj ?nj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
531 (set-case-syntax-pair ?Ç ?ÇŽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
532 (set-case-syntax-pair ?Ç ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
533 (set-case-syntax-pair ?Ç‘ ?Ç’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
534 (set-case-syntax-pair ?Ç“ ?Ç” tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
535 (set-case-syntax-pair ?Ç• ?Ç– tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
536 (set-case-syntax-pair ?Ǘ ?ǘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
537 (set-case-syntax-pair ?Ç™ ?Çš tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
538 (set-case-syntax-pair ?Ǜ ?ǜ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
539 (set-case-syntax-pair ?Çž ?ÇŸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
540 (set-case-syntax-pair ?Ç ?Ç¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
541 (set-case-syntax-pair ?Ç¢ ?Ç£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
542 (set-case-syntax-pair ?Ǥ ?ǥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
543 (set-case-syntax-pair ?Ǧ ?ǧ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
544 (set-case-syntax-pair ?Ǩ ?ǩ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
545 (set-case-syntax-pair ?Ǫ ?ǫ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
546 (set-case-syntax-pair ?Ǭ ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
547 (set-case-syntax-pair ?Ǯ ?ǯ tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
548 ;; 01F0; F; 006A 030C; # LATIN SMALL LETTER J WITH CARON |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
549 (set-case-syntax-pair ?DZ ?dz tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
550 (set-case-syntax-pair ?Dz ?dz tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
551 (set-case-syntax-pair ?Ǵ ?ǵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
552 (set-case-syntax-pair ?Ƕ ?ƕ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
553 (set-case-syntax-pair ?Ç· ?Æ¿ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
554 (set-case-syntax-pair ?Ǹ ?ǹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
555 (set-case-syntax-pair ?Ǻ ?ǻ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
556 (set-case-syntax-pair ?Ǽ ?ǽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
557 (set-case-syntax-pair ?Ǿ ?ǿ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
558 (set-case-syntax-pair ?È€ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
559 (set-case-syntax-pair ?Ȃ ?ȃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
560 (set-case-syntax-pair ?È„ ?È… tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
561 (set-case-syntax-pair ?Ȇ ?ȇ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
562 (set-case-syntax-pair ?Ȉ ?ȉ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
563 (set-case-syntax-pair ?ÈŠ ?È‹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
564 (set-case-syntax-pair ?ÈŒ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
565 (set-case-syntax-pair ?ÈŽ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
566 (set-case-syntax-pair ?È ?È‘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
567 (set-case-syntax-pair ?È’ ?È“ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
568 (set-case-syntax-pair ?È” ?È• tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
569 (set-case-syntax-pair ?È– ?È— tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
570 (set-case-syntax-pair ?Ș ?ș tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
571 (set-case-syntax-pair ?Èš ?È› tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
572 (set-case-syntax-pair ?Èœ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
573 (set-case-syntax-pair ?Èž ?ÈŸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
574 (set-case-syntax-pair ?È¢ ?È£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
575 (set-case-syntax-pair ?Ȥ ?ȥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
576 (set-case-syntax-pair ?Ȧ ?ȧ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
577 (set-case-syntax-pair ?Ȩ ?ȩ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
578 (set-case-syntax-pair ?Ȫ ?ȫ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
579 (set-case-syntax-pair ?Ȭ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
580 (set-case-syntax-pair ?Ȯ ?ȯ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
581 (set-case-syntax-pair ?Ȱ ?ȱ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
582 (set-case-syntax-pair ?Ȳ ?ȳ tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
583 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
584 ;; Latin Extended Additional |
88612 | 585 (modify-category-entry '(#x1e00 . #x1ef9) ?l) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
586 (setq c #x1e00) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
587 (while (<= c #x1ef9) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
588 (and (zerop (% c 2)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
589 (or (<= c #x1e94) (>= c #x1ea0)) |
88612 | 590 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
591 (setq c (1+ c))) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
592 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
593 ;; Greek |
88612 | 594 (modify-category-entry '(#x0370 . #x03ff) ?g) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
595 (setq c #x0370) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
596 (while (<= c #x03ff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
597 (if (or (and (>= c #x0391) (<= c #x03a1)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
598 (and (>= c #x03a3) (<= c #x03ab))) |
88612 | 599 (set-case-syntax-pair c (+ c 32) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
600 (and (>= c #x03da) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
601 (<= c #x03ee) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
602 (zerop (% c 2)) |
88612 | 603 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
604 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
605 (set-case-syntax-pair ?Ά ?ά tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
606 (set-case-syntax-pair ?Έ ?Πtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
607 (set-case-syntax-pair ?Ή ?ή tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
608 (set-case-syntax-pair ?Ί ?ί tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
609 (set-case-syntax-pair ?Ό ?ό tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
610 (set-case-syntax-pair ?ÎŽ ?Ï tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
611 (set-case-syntax-pair ?Î ?ÏŽ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
612 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
613 ;; Armenian |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
614 (setq c #x531) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
615 (while (<= c #x556) |
88612 | 616 (set-case-syntax-pair c (+ c #x30) tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
617 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
618 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
619 ;; Greek Extended |
88612 | 620 (modify-category-entry '(#x1f00 . #x1fff) ?g) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
621 (setq c #x1f00) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
622 (while (<= c #x1fff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
623 (and (<= (logand c #x000f) 7) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
624 (<= c #x1fa7) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
625 (not (memq c '(#x1f50 #x1f52 #x1f54 #x1f56))) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
626 (/= (logand c #x00f0) 7) |
88612 | 627 (set-case-syntax-pair (+ c 8) c tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
628 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
629 (set-case-syntax-pair ?Ᾰ ?ᾰ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
630 (set-case-syntax-pair ?á¾¹ ?á¾± tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
631 (set-case-syntax-pair ?Ὰ ?ὰ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
632 (set-case-syntax-pair ?á¾» ?á½± tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
633 (set-case-syntax-pair ?á¾¼ ?á¾³ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
634 (set-case-syntax-pair ?Ὲ ?ὲ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
635 (set-case-syntax-pair ?Έ ?έ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
636 (set-case-syntax-pair ?á¿Š ?á½´ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
637 (set-case-syntax-pair ?á¿‹ ?á½µ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
638 (set-case-syntax-pair ?ῌ ?ῃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
639 (set-case-syntax-pair ?Ῐ ?ῠtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
640 (set-case-syntax-pair ?á¿™ ?á¿‘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
641 (set-case-syntax-pair ?Ὶ ?ὶ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
642 (set-case-syntax-pair ?á¿› ?á½· tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
643 (set-case-syntax-pair ?Ῠ ?ῠtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
644 (set-case-syntax-pair ?á¿© ?á¿¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
645 (set-case-syntax-pair ?Ὺ ?ὺ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
646 (set-case-syntax-pair ?á¿« ?á½» tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
647 (set-case-syntax-pair ?Ῥ ?ῥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
648 (set-case-syntax-pair ?Ὸ ?ὸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
649 (set-case-syntax-pair ?Ό ?ό tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
650 (set-case-syntax-pair ?Ὼ ?ὼ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
651 (set-case-syntax-pair ?á¿» ?á½½ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
652 (set-case-syntax-pair ?ῼ ?ῳ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
653 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
654 ;; cyrillic |
88612 | 655 (modify-category-entry '(#x0400 . #x04FF) ?y) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
656 (setq c #x0400) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
657 (while (<= c #x04ff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
658 (and (>= c #x0400) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
659 (<= c #x040f) |
88612 | 660 (set-case-syntax-pair c (+ c 80) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
661 (and (>= c #x0410) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
662 (<= c #x042f) |
88612 | 663 (set-case-syntax-pair c (+ c 32) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
664 (and (zerop (% c 2)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
665 (or (and (>= c #x0460) (<= c #x0480)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
666 (and (>= c #x048c) (<= c #x04be)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
667 (and (>= c #x04d0) (<= c #x04f4))) |
89483 | 668 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
669 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
670 (set-case-syntax-pair ?Ó ?Ó‚ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
671 (set-case-syntax-pair ?Óƒ ?Ó„ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
672 (set-case-syntax-pair ?Ó‡ ?Óˆ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
673 (set-case-syntax-pair ?Ӌ ?ӌ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
674 (set-case-syntax-pair ?Ó¸ ?Ó¹ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
675 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
676 ;; general punctuation |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
677 (setq c #x2000) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
678 (while (<= c #x200b) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
679 (set-case-syntax c " " tbl) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
680 (setq c (1+ c))) |
89119 | 681 (while (<= c #x200F) |
682 (set-case-syntax c "." tbl) | |
683 (setq c (1+ c))) | |
684 ;; Fixme: These aren't all right: | |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
685 (while (<= c #x2027) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
686 (set-case-syntax c "_" tbl) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
687 (setq c (1+ c))) |
89119 | 688 (while (<= c #x206F) |
689 (set-case-syntax c "." tbl) | |
690 (setq c (1+ c))) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
691 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
692 ;; Roman numerals |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
693 (setq c #x2160) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
694 (while (<= c #x216f) |
88612 | 695 (set-case-syntax-pair c (+ c #x10) tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
696 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
697 |
89146 | 698 ;; Fixme: The following blocks might be better as symbol rather than |
699 ;; punctuation. | |
89119 | 700 ;; Arrows |
701 (setq c #x2190) | |
89121 | 702 (while (<= c #x21FF) |
703 (set-case-syntax c "." tbl) | |
89119 | 704 (setq c (1+ c))) |
705 ;; Mathematical Operators | |
706 (while (<= c #x22FF) | |
89121 | 707 (set-case-syntax c "." tbl) |
89119 | 708 (setq c (1+ c))) |
709 ;; Miscellaneous Technical | |
710 (while (<= c #x23FF) | |
89121 | 711 (set-case-syntax c "." tbl) |
89119 | 712 (setq c (1+ c))) |
713 ;; Control Pictures | |
714 (while (<= c #x243F) | |
89121 | 715 (set-case-syntax c "_" tbl) |
716 (setq c (1+ c))) | |
89119 | 717 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
718 ;; Circled Latin |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
719 (setq c #x24b6) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
720 (while (<= c #x24cf) |
88612 | 721 (set-case-syntax-pair c (+ c 26) tbl) |
722 (modify-category-entry c ?l) | |
723 (modify-category-entry (+ c 26) ?l) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
724 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
725 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
726 ;; Fullwidth Latin |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
727 (setq c #xff21) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
728 (while (<= c #xff3a) |
88612 | 729 (set-case-syntax-pair c (+ c #x20) tbl) |
730 (modify-category-entry c ?l) | |
731 (modify-category-entry (+ c #x20) ?l) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
732 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
733 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
734 ;; Combining diacritics |
88612 | 735 (modify-category-entry '(#x300 . #x362) ?^) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
736 ;; Combining marks |
88612 | 737 (modify-category-entry '(#x20d0 . #x20e3) ?^) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
738 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
739 ;; Fixme: syntax for symbols &c |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
740 ) |
17052 | 741 |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
742 ;; For each character set, put the information of the most proper |
24782
8f65cd6729b5
Typo fixed; prefered->preferred.
Kenichi Handa <handa@m17n.org>
parents:
24256
diff
changeset
|
743 ;; coding system to encode it by `preferred-coding-system' property. |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
744 |
88612 | 745 ;; Fixme: should this be junked? |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
746 (let ((l '((latin-iso8859-1 . iso-latin-1) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
747 (latin-iso8859-2 . iso-latin-2) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
748 (latin-iso8859-3 . iso-latin-3) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
749 (latin-iso8859-4 . iso-latin-4) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
750 (thai-tis620 . thai-tis620) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
751 (greek-iso8859-7 . greek-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
752 (arabic-iso8859-6 . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
753 (hebrew-iso8859-8 . hebrew-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
754 (katakana-jisx0201 . japanese-shift-jis) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
755 (latin-jisx0201 . japanese-shift-jis) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
756 (cyrillic-iso8859-5 . cyrillic-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
757 (latin-iso8859-9 . iso-latin-5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
758 (japanese-jisx0208-1978 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
759 (chinese-gb2312 . cn-gb-2312) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
760 (japanese-jisx0208 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
761 (korean-ksc5601 . iso-2022-kr) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
762 (japanese-jisx0212 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
763 (chinese-cns11643-1 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
764 (chinese-cns11643-2 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
765 (chinese-big5-1 . chinese-big5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
766 (chinese-big5-2 . chinese-big5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
767 (chinese-sisheng . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
768 (ipa . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
769 (vietnamese-viscii-lower . vietnamese-viscii) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
770 (vietnamese-viscii-upper . vietnamese-viscii) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
771 (arabic-digit . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
772 (arabic-1-column . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
773 (lao . lao) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
774 (arabic-2-column . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
775 (indian-is13194 . devanagari) |
41466
b364f798b789
Adjusted for the change of indian-1-column -> indian-glyph.
Richard M. Stallman <rms@gnu.org>
parents:
38414
diff
changeset
|
776 (indian-glyph . devanagari) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
777 (tibetan-1-column . tibetan) |
24256
47e6aa168a66
Let ethiopic use iso-2022-7bit.
Kenichi Handa <handa@m17n.org>
parents:
24208
diff
changeset
|
778 (ethiopic . iso-2022-7bit) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
779 (chinese-cns11643-3 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
780 (chinese-cns11643-4 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
781 (chinese-cns11643-5 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
782 (chinese-cns11643-6 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
783 (chinese-cns11643-7 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
784 (indian-2-column . devanagari) |
26013 | 785 (tibetan . tibetan) |
786 (latin-iso8859-14 . iso-latin-8) | |
787 (latin-iso8859-15 . iso-latin-9)))) | |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
788 (while l |
24782
8f65cd6729b5
Typo fixed; prefered->preferred.
Kenichi Handa <handa@m17n.org>
parents:
24256
diff
changeset
|
789 (put-charset-property (car (car l)) 'preferred-coding-system (cdr (car l))) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
790 (setq l (cdr l)))) |
23493 | 791 |
792 | |
23527
06e8c3b9deec
Setup charset property nospace-between-words.
Kenichi Handa <handa@m17n.org>
parents:
23493
diff
changeset
|
793 ;; Setup auto-fill-chars for charsets that should invoke auto-filling. |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
794 ;; SPACE and NEWLINE are already set. Also put `nospace-between-words' |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
795 ;; property on the charsets. |
23493 | 796 (let ((l '(katakana-jisx0201 |
797 japanese-jisx0208 japanese-jisx0212 | |
798 chinese-gb2312 chinese-big5-1 chinese-big5-2))) | |
799 (while l | |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
800 ;;(aset auto-fill-chars (make-char (car l)) t) |
23527
06e8c3b9deec
Setup charset property nospace-between-words.
Kenichi Handa <handa@m17n.org>
parents:
23493
diff
changeset
|
801 (put-charset-property (car l) 'nospace-between-words t) |
23493 | 802 (setq l (cdr l)))) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
803 |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
804 |
88492
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
805 ;; CJK double width characters. |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
806 (let ((l '((#x1100 . #x11FF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
807 (#x2E80 . #x9FAF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
808 (#xAC00 . #xD7AF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
809 (#xF900 . #xFAFF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
810 (#xFE30 . #xFE4F) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
811 (#xFF00 . #xFF5F) |
89483 | 812 (#xFFE0 . #xFFEF) |
813 (#x20000 . #x2AFFF) | |
814 (#x2F800 . #x2FFFF)))) | |
88492
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
815 (dolist (elt l) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
816 (set-char-table-range char-width-table |
88612 | 817 (cons (car elt) (cdr elt)) |
88492
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
818 2))) |
89117 | 819 ;; Fixme: Doing this affects non-CJK characters through unification, |
820 ;; but presumably CJK users expect those characters to be | |
821 ;; double-width when using these charsets. | |
822 ;; (map-charset-chars | |
823 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
824 ;; 'japanese-jisx0208) | |
825 ;; (map-charset-chars | |
826 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
827 ;; 'japanese-jisx0212) | |
828 ;; (map-charset-chars | |
829 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
830 ;; 'japanese-jisx0213-1) | |
831 ;; (map-charset-chars | |
832 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
833 ;; 'japanese-jisx0213-2) | |
834 ;; (map-charset-chars | |
835 ;; (lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
836 ;; 'korean-ksc5601) | |
88642
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
837 |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
838 ;; Other double width |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
839 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
840 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
841 'ethiopic) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
842 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
843 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
844 'tibetan) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
845 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
846 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
847 'indian-2-column) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
848 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
849 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
850 'arabic-2-column) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
851 |
88723 | 852 (optimize-char-table (standard-case-table)) |
853 (optimize-char-table char-width-table) | |
854 (optimize-char-table (standard-category-table)) | |
855 (optimize-char-table (standard-syntax-table)) | |
856 | |
89119 | 857 ;; The Unicode blocks actually extend past some of these ranges with |
858 ;; undefined codepoints. | |
88892 | 859 (let ((script-list nil)) |
860 (dolist | |
861 (elt | |
89008
3e08629c1469
(script-list): Treat ascii characters as `latin' script.
Kenichi Handa <handa@m17n.org>
parents:
88976
diff
changeset
|
862 '((#x0000 #x007F latin) |
88892 | 863 (#x00A0 #x036F latin) |
864 (#x0370 #x03E1 greek) | |
865 (#x03E2 #x03EF coptic) | |
866 (#x03F0 #x03F3 greek) | |
867 (#x0400 #x04FF cyrillic) | |
868 (#x0530 #x058F armenian) | |
869 (#x0590 #x05FF hebrew) | |
870 (#x0600 #x06FF arabic) | |
871 (#x0700 #x074F syriac) | |
872 (#x0780 #x07BF thaana) | |
873 (#x0900 #x097F devanagari) | |
874 (#x0980 #x09FF bengali) | |
875 (#x0A00 #x0A7F gurmukhi) | |
876 (#x0A80 #x0AFF gujarati) | |
877 (#x0B00 #x0B7F oriya) | |
878 (#x0B80 #x0BFF tamil) | |
879 (#x0C00 #x0C7F telugu) | |
880 (#x0C80 #x0CFF kannada) | |
881 (#x0D00 #x0D7F malayalam) | |
882 (#x0D80 #x0DFF sinhala) | |
883 (#x0E00 #x0E5F thai) | |
884 (#x0E80 #x0EDF lao) | |
885 (#x0F00 #x0FFF tibetan) | |
886 (#x1000 #x105F myanmar) | |
887 (#x10A0 #x10FF georgian) | |
888 (#x1100 #x11FF hangul) | |
889 (#x1200 #x137F ethiopic) | |
890 (#x13A0 #x13FF cherokee) | |
891 (#x1400 #x167F canadian-aboriginal) | |
892 (#x1680 #x169F ogham) | |
893 (#x16A0 #x16FF runic) | |
894 (#x1780 #x17FF khmer) | |
895 (#x1800 #x18AF mongolian) | |
896 (#x1E00 #x1EFF latin) | |
897 (#x1F00 #x1FFF greek) | |
89635
3d2f4775a302
(script-list): Include `currency' in `symbol'.
Kenichi Handa <handa@m17n.org>
parents:
89600
diff
changeset
|
898 (#x2000 #x27FF symbol) |
88892 | 899 (#x2800 #x28FF braille) |
900 (#x2E80 #x2FDF han) | |
901 (#x2FF0 #x2FFF ideographic-description) | |
902 (#x3000 #x303F cjk-misc) | |
903 (#x3040 #x30FF kana) | |
904 (#x3100 #x312F bopomofo) | |
905 (#x3130 #x318F hangul) | |
906 (#x3190 #x319F kanbun) | |
907 (#x31A0 #x31BF bopomofo) | |
908 (#x3400 #x9FAF han) | |
909 (#xA000 #xA4CF yi) | |
910 (#xAC00 #xD7AF hangul) | |
911 (#xF900 #xFA5F han) | |
912 (#xFB1D #xFB4F hebrew) | |
913 (#xFB50 #xFDFF arabic) | |
914 (#xFE70 #xFEFC arabic) | |
915 (#xFF00 #xFF5F cjk-misc) | |
916 (#xFF61 #xFF9F kana) | |
917 (#xFFE0 #xFFE6 cjk-misc) | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
918 (#x20000 #x2AFFF han) |
88892 | 919 (#x2F800 #x2FFFF han))) |
920 (set-char-table-range char-script-table | |
921 (cons (car elt) (nth 1 elt)) (nth 2 elt)) | |
922 (or (memq (nth 2 elt) script-list) | |
923 (setq script-list (cons (nth 2 elt) script-list)))) | |
924 (set-char-table-extra-slot char-script-table 0 (nreverse script-list))) | |
925 | |
89483 | 926 (map-charset-chars |
89291
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
927 #'(lambda (range ignore) |
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
928 (set-char-table-range char-script-table range 'tibetan)) |
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
929 'tibetan) |
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
930 |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
931 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
932 ;;; Setting word boundary. |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
933 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
934 (defun next-word-boundary-han (pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
935 (if (<= pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
936 (save-excursion |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
937 (goto-char pos) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
938 (looking-at "\\cC+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
939 (goto-char (match-end 0)) |
88976
fc96b539a43c
(next-word-boundary-han): Don't
Kenichi Handa <handa@m17n.org>
parents:
88971
diff
changeset
|
940 (if (looking-at "\\cH+") |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
941 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
942 (point)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
943 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
944 (eq (aref char-script-table (char-after (1- pos))) 'han)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
945 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
946 pos)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
947 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
948 (defun next-word-boundary-kana (pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
949 (if (<= pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
950 (save-excursion |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
951 (goto-char pos) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
952 (if (looking-at "\\cK+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
953 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
954 (if (looking-at "\\cH+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
955 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
956 (point)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
957 (let ((category-set (char-category-set (char-after pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
958 category) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
959 (if (aref category-set ?K) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
960 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
961 (aref (char-category-set (char-after (1- pos))) ?K)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
962 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
963 (while (and (> pos limit) |
89483 | 964 (aref (setq category-set |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
965 (char-category-set (char-after (1- pos)))) ?H)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
966 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
967 (setq category (cond ((aref category-set ?C) ?C) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
968 ((aref category-set ?K) ?K) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
969 ((aref category-set ?A) ?A))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
970 (when category |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
971 (setq pos (1- pos)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
972 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
973 (aref (char-category-set (char-after (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
974 category)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
975 (setq pos (1- pos))))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
976 pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
977 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
978 (map-char-table |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
979 #'(lambda (char script) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
980 (cond ((eq script 'han) |
89483 | 981 (set-char-table-range find-word-boundary-function-table |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
982 char #'next-word-boundary-han)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
983 ((eq script 'kana) |
89483 | 984 (set-char-table-range find-word-boundary-function-table |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
985 char #'next-word-boundary-kana)))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
986 char-script-table) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
987 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
988 (setq word-combining-categories |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
989 '((?l . ?l))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
990 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
991 (setq word-separating-categories ; (2-byte character sets) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
992 '((?A . ?K) ; Alpha numeric - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
993 (?A . ?C) ; Alpha numeric - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
994 (?H . ?A) ; Hiragana - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
995 (?H . ?K) ; Hiragana - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
996 (?H . ?C) ; Hiragana - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
997 (?K . ?A) ; Katakana - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
998 (?K . ?C) ; Katakana - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
999 (?C . ?A) ; Chinese - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1000 (?C . ?K) ; Chinese - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1001 )) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1002 |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1003 ;;; Local Variables: |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
1004 ;;; coding: utf-8-emacs |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1005 ;;; End: |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1006 |
38414
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
1007 ;;; characters.el ends here |