svn commit: r196067 - user/edwin/locale/tools
Edwin Groothuis
edwin at FreeBSD.org
Wed Aug 5 07:44:31 UTC 2009
Author: edwin
Date: Wed Aug 5 07:44:30 2009
New Revision: 196067
URL: http://svn.freebsd.org/changeset/base/196067
Log:
show what a certain unicode name can be
Added:
user/edwin/locale/tools/whatis.pl (contents, props changed)
Added: user/edwin/locale/tools/whatis.pl
==============================================================================
--- /dev/null 00:00:00 1970 (empty, because file is newly added)
+++ user/edwin/locale/tools/whatis.pl Wed Aug 5 07:44:30 2009 (r196067)
@@ -0,0 +1,29 @@
+#!/bin/sh
+
+UNIDATA=/home/edwin/unicode/UNIDATA/5.2.0/UnicodeData.txt
+CHARMAPS=/home/edwin/svn/edwin/locale/tools/charmaps
+UTF8=~/unicode/cldr/1.7.1/posix/UTF-8.cm
+
+if [ -z "$1" ]; then
+ echo "Usage: $0 <unicode string>"
+ exit
+fi
+
+UCS=$*
+UCS_=$(echo $* | sed -e 's/ /./g')
+echo UCS: ${UCS}
+
+echo UTF-8.cm:
+grep "${UCS_}" ${UTF8} | sed -e 's/ */ /g'
+
+echo UNIDATA:
+grep "${UCS_}" ${UNIDATA}
+L=$(grep "${UCS_}" ${UNIDATA})
+
+echo UCC:
+grep "${UCS_}" ${UNIDATA} | awk -F\; '{ print $1 }'
+
+
+echo CHARMAPS:
+grep ${UCS_} ${CHARMAPS}/* | sed -e "s|${CHARMAPS}/||g"
+grep ${UCC} ${CHARMAPS}/* | sed -e "s|${CHARMAPS}/||g"
More information about the svn-src-user
mailing list