Mercurial > emacs
changeset 103578:63a1307441f9
For the UNICODE format files, do reverse sort
and don't compact the map. This is to prefer the first one in the
duplicated mappings (e.g. 0x20->U+0020, 0x20->U+00A0).
author | Kenichi Handa <handa@m17n.org> |
---|---|
date | Wed, 24 Jun 2009 13:02:50 +0000 |
parents | 520b069d6504 |
children | ddf4d6535beb |
files | admin/charsets/mapconv |
diffstat | 1 files changed, 4 insertions(+), 2 deletions(-) [+] |
line wrap: on
line diff
--- a/admin/charsets/mapconv Wed Jun 24 05:12:51 2009 +0000 +++ b/admin/charsets/mapconv Wed Jun 24 13:02:50 2009 +0000 @@ -30,7 +30,7 @@ # $1: source map file # $2: address pattern for sed (optionally with substitution command) # $3: format of source map file -# GLIBC-1 GLIBC-2 GLIBC-2-7 CZYBORRA IANA UNICODE YASUOKA +# GLIBC-1 GLIBC-2 GLIBC-2-7 CZYBORRA IANA UNICODE UNICODE2 YASUOKA # $4: awk script FILE="admin/charsets/$1" @@ -115,9 +115,11 @@ elif [ "$3" = "UNICODE" ] ; then # Source format is: # YYYY XX + # We perform reverse sort to prefer the first one in the + # duplicated mappings (e.g. 0x20->U+0020, 0x20->U+00A0). zcat $1 | sed -n -e "$2 p" \ | sed -e 's/\([0-9A-F]*\)[^0-9A-F]*\([0-9A-F]*\).*/0x\2 0x\1/' \ - | sort | ${AWKPROG} + | sort -r elif [ "$3" = "UNICODE2" ] ; then # Source format is: # 0xXXXX 0xYYYY # ...