16f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Copyright (C) 2004-2013, International Business Machines 26f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Corporation and others. All Rights Reserved. 36f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* 46f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file name: changes.txt 56f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* encoding: US-ASCII 66f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* tab size: 8 (not used) 76f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* indentation:4 86f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* 96f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* created on: 2004may06 106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* created by: Markus W. Scherer 116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* 126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* change log for Unicode updates 136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 6.3 update 176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/review/pri249/ -- beta review 196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/reports/uax-proposed-updates.html 206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/versions/beta-6.3.0.html#notable_issues 216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/reports/tr44/tr44-11.html 226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** ICU Trac 246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 10128: update ICU to Unicode 6.3 beta 266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 10168: update ICU to Unicode 6.3 final 276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- C++ branches/markus/uni63 at r33552 from trunk at r33551 286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Java branches/markus/uni63 at r33550 from trunk at r33553 296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 10142: implement Unicode 6.3 bidi algorithm additions 316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers 336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak 346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h 356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (configure.in & configure: have been modified to extract the version from uchar.h) 366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.util.VersionInfo 376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.dev.test.lang.UCharacterTest.VERSION_ 386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Run ICU4C "configure" _after_ updating the Unicode version number in uchar.h 406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org so that the makefiles see the new version number. 416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code 436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation 456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- download UCD, UCA & IDNA files 476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- make sure that the Unicode data folder passed into preparseucd.py 486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org includes a copy of the latest IdnaMappingTable.txt (can be in some subfolder) 496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- modify preparseucd.py: 506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org parse new file BidiBrackets.txt 516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org with new properties bpb=Bidi_Paired_Bracket and bpt=Bidi_Paired_Bracket_Type 526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icutools/trunk/src/unicode$ py/preparseucd.py ~/unidata/uni63/20130425 ~/svn.icu/uni63/src ~/svn.icutools/trunk/src 536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This writes files (especially ppucd.txt) to the ICU4C unidata and testdata subfolders. 546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Check test file diffs for previously commented-out, known-failing data lines; 556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org probably need to keep those commented out. 566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyAliases.txt changes 586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Enumerated Property 596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org bpt ; Bidi_Paired_Bracket_Type 606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UProperty.java & UCharacter.BidiPairedBracketType 616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> ubidi_props.h & .c & UBiDiProps.java 626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> remember to write the max value at UBIDI_MAX_VALUES_INDEX 636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uprops.cpp 646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> change ubidi.icu format version from 2.0 to 2.1 656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Miscellaneous Property 666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org bpb ; Bidi_Paired_Bracket 676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UProperty.java 686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> ppucd.h & .cpp 696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyValueAliases.txt changes 716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 3 Bidi_Paired_Bracket_Type (bpt) values: 726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org bpt; c ; Close 736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org bpt; n ; None 746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org bpt; o ; Open 756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UCharacter.BidiPairedBracketType 766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> ubidi_props.h & .c & UBiDiProps.java 776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> change ubidi.icu format version from 2.0 to 2.1 786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 4 new Bidi_Class (bc) values: 796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org bc ; FSI ; First_Strong_Isolate 806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org bc ; LRI ; Left_To_Right_Isolate 816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org bc ; RLI ; Right_To_Left_Isolate 826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org bc ; PDI ; Pop_Directional_Isolate 836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UCharacterEnums.ECharacterDirection 846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> until the bidi code gets updated, 856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Roozbeh suggests mapping the new bc values to ON (Other_Neutral) 866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 3 new Word_Break (WB) values: 876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WB ; HL ; Hebrew_Letter 886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WB ; SQ ; Single_Quote 896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WB ; DQ ; Double_Quote 906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UCharacter.WordBreak 916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> first time Word_Break numeric constants exceed 4 bits (now 17 values) 926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 2 new script codes from ISO 15924 http://www.unicode.org/iso15924/codechanges.html 936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (added 2012-10-16) 946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Aghb 239 Caucasian Albanian 956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Mahj 314 Mahajani 966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uscript.h 976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> com.ibm.icu.lang.UScript 986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org find USCRIPT_([^ ]+) *= ([0-9]+),(.+) 996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org replace public static final int \1 = \2;\3 1006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> preparseucd.py _scripts_only_in_iso15924 1016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add to expectedLong and expectedShort names in cintltst/cucdapi.c/TestUScriptCodeAPI() 1026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and in com.ibm.icu.dev.test.lang.TestUScript.java 1036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> update Script metadata: SCRIPT_PROPS[] in uscript_props.cpp & UScript.ScriptMetadata 1046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (not strictly necessary for NOT_ENCODED scripts) 1056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate normalization data files 1076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ export LD_LIBRARY_PATH=~/svn.icu/uni63/dbg/lib 1086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ SRC_DATA_IN=~/svn.icu/uni63/src/source/data/in 1096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ UNIDATA=~/svn.icu/uni63/src/source/data/unidata 1106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfc.nrm -s $UNIDATA/norm2 nfc.txt 1116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc.nrm -s $UNIDATA/norm2 nfc.txt nfkc.txt 1126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc_cf.nrm -s $UNIDATA/norm2 nfc.txt nfkc.txt nfkc_cf.txt 1136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ bin/gennorm2 -o $SRC_DATA_IN/uts46.nrm -s $UNIDATA/norm2 nfc.txt uts46.txt 1146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build ICU (make install) 1166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org so that the tools build can pick up the new definitions from the installed header files. 1176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icu/uni63/dbg$ echo;echo;make -j5 install > out.txt 2>&1 ; tail -n 20 out.txt 1196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools using CMake+make 1216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icutools/trunk/src/unicode/c/icudefs.txt: 1236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org# Location (--prefix) of where ICU was installed. 1256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgset(ICU_INST_DIR /home/mscherer/svn.icu/uni63/inst) 1266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org# Location of the ICU source tree. 1276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgset(ICU_SRC_DIR /home/mscherer/svn.icu/uni63/src) 1286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icutools/trunk/dbg/unicode/c$ cmake ../../../src/unicode/c 1306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icutools/trunk/dbg/unicode/c$ make 1316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate core properties data files 1336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icutools/trunk/dbg/unicode/c$ genprops/genprops ~/svn.icu/uni63/src 1346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icutools/trunk/dbg/unicode/c$ genuca/genuca -i ~/svn.icu/uni63/dbg/data/out/build/icudt52l ~/svn.icu/uni63/src 1356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU (make install) & tools 1366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run genuca again (see step above) so that it picks up the new case mappings and nfc.nrm 1376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU (make install) & tools 1386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update uts46test.cpp and UTS46Test.java if there are new characters that are equivalent to 1406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sequences with non-LDH ASCII (that is, their decompositions contain '=' or similar) 1416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- grep IdnaMappingTable.txt or uts46.txt for "disallowed_STD3_valid" on non-ASCII characters 1426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unicode 6.0..6.3: U+2260, U+226E, U+226F 1436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- nothing new in 6.3, no test file to update 1446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update Java data files 1466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh just the UCD-related files, just to be safe 1476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- see (ICU4C)/source/data/icu4j-readme.txt 1486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- mkdir /tmp/icu4j 1496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 1506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org output: 1516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ... 1526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Unicode .icu files built to ./out/build/icudt52l 1536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p ./out/icu4j/com/ibm/icu/impl/data/icudt52b 1546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p ./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt52b 1556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org echo pnames.icu ubidi.icu ucase.icu uprops.icu > ./out/icu4j/add.txt 1566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org LD_LIBRARY_PATH=../lib:../stubdata:../tools/ctestfw:$LD_LIBRARY_PATH ../bin/icupkg ./out/tmp/icudt52l.dat ./out/icu4j/icudt52b.dat -a ./out/icu4j/add.txt -s ./out/build/icudt52l -x '*' -tb -d ./out/icu4j/com/ibm/icu/impl/data/icudt52b 1576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mv ./out/icu4j/"com/ibm/icu/impl/data/icudt52b/zoneinfo64.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt52b/metaZones.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt52b/timezoneTypes.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt52b/windowsZones.res" "./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt52b" 1586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org jar cf ./out/icu4j/icudata.jar -C ./out/icu4j com/ibm/icu/impl/data/icudt52b/ 1596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/main/shared/data 1606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp ./out/icu4j/icudata.jar /tmp/icu4j/main/shared/data 1616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org jar cf ./out/icu4j/icutzdata.jar -C ./out/icu4j/tzdata com/ibm/icu/impl/data/icudt52b/ 1626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/main/shared/data 1636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp ./out/icu4j/icutzdata.jar /tmp/icu4j/main/shared/data 1646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org make[1]: Leaving directory `/home/mscherer/svn.icu/uni63/dbg/data' 1656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the big-endian Unicode data files to another location, 1666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org separate from the other data files 1676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/coll 1686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/brkitr 1696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni63/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt52b/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt52b 1706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni63/dbg/data/out/icu4j$ rm /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/cnvalias.icu 1716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni63/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt52b/*.nrm /tmp/icu4j/com/ibm/icu/impl/data/icudt52b 1726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni63/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt52b/coll/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/coll 1736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni63/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt52b/brkitr/* /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/brkitr 1746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J 1756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni63/dbg/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt52b 1766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* refresh Java test .txt files 1786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy new .txt files into ICU4J's main/tests/core/src/com/ibm/icu/dev/data/unicode 1796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UCA -- mostly skipped for ICU 52 / Unicode 6.3, except update coll/* files 1816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- get output from Mark's tools; look in http://www.unicode.org/Public/UCA/<beta version>/ 1836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- CLDR root files for ICU are in CollationAuxiliary.zip; unpack that 1846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/FractionalUCA.txt with FractionalUCA_SHORT.txt 1856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/UCARules.txt with UCA_Rules_SHORT.txt 1866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (note removing the underscore before "Rules") 1876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update (ICU4C)/source/test/testdata/CollationTest_*.txt 1886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and (ICU4J)/main/tests/collate/src/com/ibm/icu/dev/data/CollationTest_*.txt 1896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org with output from Mark's Unicode tools (..._CLDR_..._SHORT.txt) 1906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check test file diffs for previously commented-out, known-failing data lines; 1916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org probably need to keep those commented out 1926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check FractionalUCA.txt for manual changes of lead bytes from IMPLICIT to Hani 1936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run genuca, see command line above 1946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU4C 1956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J collation data: 1966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (subset of instructions above for properties data refresh, except copies all coll/*) 1976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni63/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 1986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni63/dbg$ mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/coll 1996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni63/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt52b/coll/* /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/coll 2006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni63/dbg/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt52b 2016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run all tests with the *_SHORT.txt or the full files (the full ones have comments, useful for debugging) 2026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- note on intltest: if collate/UCAConformanceTest fails, then 2036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org utility/MultithreadTest/TestCollators will fail as well; 2046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org fix the conformance test before looking into the multi-thread test 2056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* test ICU, fix test code where necessary 2076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* When refreshing all of ICU4J data from ICU4C 2096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 2106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- cp /tmp/icu4j/main/shared/data/icudata.jar ~/svn.icu4j/trunk/src/main/shared/data 2116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgor 2126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ make ICU4J_ROOT=~/svn.icu4j/trunk/src icu4j-data-install 2136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information 2156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- skipped for Unicode 6.3: no new scripts 2166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** merge the Unicode update branches back onto the trunk 2186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- do not merge the icudata.jar and testdata.jar, 2196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org instead rebuild them from merged & tested ICU4C 2206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 2226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 6.2 update 2246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/review/pri230/ 2266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/versions/beta-6.2.0.html 2276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/reports/tr44/tr44-9.html#Unicode_6.2.0 2286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/review/pri227/ Changes to Script Extensions Property Values 2296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/review/pri228/ Changing some common characters from Punctuation to Symbol 2306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/review/pri229/ Linebreaking Changes for Pictographic Symbols 2316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/reports/tr46/tr46-8.html IDNA 2326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://unicode.org/Public/idna/6.2.0/ 2336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** ICU Trac 2356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 9515: Unicode 6.2: final ICU update 2376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 9514: UCA 6.2: fix UCARules.txt 2396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 9437: update ICU to Unicode 6.2 2416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- C++ branches/markus/uni62 at r32050 from trunk at r32041 2426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Java branches/markus/uni62 at r32068 from trunk at r32066 2436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers 2456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak 2466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h 2476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (configure.in & configure: have been modified to extract the version from uchar.h) 2486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.util.VersionInfo 2496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.dev.test.lang.UCharacterTest.VERSION_ 2506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code 2526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation 2546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- download UCD, UCA & IDNA files 2566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- make sure that the Unicode data folder passed into preparseucd.py 2576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org includes a copy of the latest IdnaMappingTable.txt (can be in some subfolder) 2586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- modify preparseucd.py: NamesList.txt is now in UTF-8 2596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode$ py/preparseucd.py ~/uni62/20120816 ~/svn.icu/uni62/src ~/svn.icu/tools/trunk/src 2606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This writes files (especially ppucd.txt) to the ICU4C unidata and testdata subfolders. 2616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Check test file diffs for previously commented-out, known-failing data lines; 2626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org probably need to keep those commented out. 2636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyValueAliases.txt changes 2656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Line_Break (lb) value: 2666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org lb ; RI ; Regional_Indicator 2676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UCharacter.LineBreak 2686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Word_Break (WB) value: 2696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WB ; RI ; Regional_Indicator 2706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UCharacter.WordBreak 2716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Grapheme_Cluster_Break (GCB) value: 2726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org GCB; RI ; Regional_Indicator 2736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UCharacter.GraphemeClusterBreak 2746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* 3 new numeric values 2766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org The new value -1, which was really supposed to be NaN but that would have required 2776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new UnicodeData.txt syntax, can already be represented as a "fraction" of -1/1, 2786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org but encodeNumericValue() in corepropsbuilder.cpp had to be fixed. 2796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp;12456;na=CUNEIFORM NUMERIC SIGN NIGIDAMIN;nv=-1 2806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp;12457;na=CUNEIFORM NUMERIC SIGN NIGIDAESH;nv=-1 2816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org The two new values 216000 and 432000 require an addition to the encoding of numeric values. 2826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp;12432;na=CUNEIFORM NUMERIC SIGN SHAR2 TIMES GAL PLUS DISH;nv=216000 2836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp;12433;na=CUNEIFORM NUMERIC SIGN SHAR2 TIMES GAL PLUS MIN;nv=432000 2846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uprops.h, uchar.c & UCharacterProperty.java 2856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> cucdtst.c & UCharacterTest.java 2866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate normalization data files 2886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ export LD_LIBRARY_PATH=~/svn.icu/uni62/dbg/lib 2896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ SRC_DATA_IN=~/svn.icu/uni62/src/source/data/in 2906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ UNIDATA=~/svn.icu/uni62/src/source/data/unidata 2916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfc.nrm -s $UNIDATA/norm2 nfc.txt 2926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc.nrm -s $UNIDATA/norm2 nfc.txt nfkc.txt 2936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc_cf.nrm -s $UNIDATA/norm2 nfc.txt nfkc.txt nfkc_cf.txt 2946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ bin/gennorm2 -o $SRC_DATA_IN/uts46.nrm -s $UNIDATA/norm2 nfc.txt uts46.txt 2956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build ICU (make install) 2976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org so that the tools build can pick up the new definitions from the installed header files. 2986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools using CMake+make 2996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate core properties data files 3016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/dbg/unicode$ c/genprops/genprops ~/svn.icu/uni62/src 3026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- in initial bootstrapping, change the UCA version 3036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org in source/data/unidata/FractionalUCA.txt to match the new Unicode version 3046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/dbg/unicode$ c/genuca/genuca -i ~/svn.icu/uni62/dbg/data/out/build/icudt50l ~/svn.icu/uni62/src 3056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU (make install) & tools 3066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + if genrb fails to build coll/root.res with an U_INVALID_FORMAT_ERROR, 3076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org check if the UCA version in FractionalUCA.txt matches the new Unicode version 3086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (see step above) 3096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run genuca again (see step above) so that it picks up the new case mappings and nfc.nrm 3106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU (make install) & tools 3116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update uts46test.cpp and UTS46Test.java if there are new characters that are equivalent to 3136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sequences with non-LDH ASCII (that is, their decompositions contain '=' or similar) 3146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- grep IdnaMappingTable.txt or uts46.txt for "disallowed_STD3_valid" on non-ASCII characters 3156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unicode 6.0..6.2: U+2260, U+226E, U+226F 3166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- nothing new in 6.2, no test file to update 3176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update Java data files 3196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh just the UCD-related files, just to be safe 3206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- see (ICU4C)/source/data/icu4j-readme.txt 3216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- mkdir /tmp/icu4j 3226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 3236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org output: 3246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ... 3256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Unicode .icu files built to ./out/build/icudt50l 3266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p ./out/icu4j/com/ibm/icu/impl/data/icudt50b 3276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p ./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt50b 3286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org echo pnames.icu ubidi.icu ucase.icu uprops.icu > ./out/icu4j/add.txt 3296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org LD_LIBRARY_PATH=../lib:../stubdata:../tools/ctestfw:$LD_LIBRARY_PATH ../bin/icupkg ./out/tmp/icudt50l.dat ./out/icu4j/icudt50b.dat -a ./out/icu4j/add.txt -s ./out/build/icudt50l -x '*' -tb -d ./out/icu4j/com/ibm/icu/impl/data/icudt50b 3306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mv ./out/icu4j/"com/ibm/icu/impl/data/icudt50b/zoneinfo64.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt50b/metaZones.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt50b/timezoneTypes.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt50b/windowsZones.res" "./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt50b" 3316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org jar cf ./out/icu4j/icudata.jar -C ./out/icu4j com/ibm/icu/impl/data/icudt50b/ 3326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/main/shared/data 3336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp ./out/icu4j/icudata.jar /tmp/icu4j/main/shared/data 3346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org jar cf ./out/icu4j/icutzdata.jar -C ./out/icu4j/tzdata com/ibm/icu/impl/data/icudt50b/ 3356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/main/shared/data 3366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp ./out/icu4j/icutzdata.jar /tmp/icu4j/main/shared/data 3376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org make[1]: Leaving directory `/home/mscherer/svn.icu/uni62/dbg/data' 3386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the big-endian Unicode data files to another location, 3396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org separate from the other data files 3406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/coll 3416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/brkitr 3426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni62/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt50b/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt50b 3436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni62/dbg/data/out/icu4j$ rm /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/cnvalias.icu 3446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni62/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt50b/*.nrm /tmp/icu4j/com/ibm/icu/impl/data/icudt50b 3456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni62/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt50b/coll/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/coll 3466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni62/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt50b/brkitr/* /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/brkitr 3476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J 3486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni62/dbg/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt50b 3496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* refresh Java test .txt files 3516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy new .txt files into ICU4J's main/tests/core/src/com/ibm/icu/dev/data/unicode 3526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UCA 3546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- get output from Mark's tools; look in http://www.unicode.org/Public/UCA/<beta version>/ 3566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- CLDR root files for ICU are in CollationAuxiliary.zip; unpack that 3576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/FractionalUCA.txt with FractionalUCA_SHORT.txt 3586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/UCARules.txt with UCA_Rules_SHORT.txt 3596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (note removing the underscore before "Rules") 3606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update (ICU4C)/source/test/testdata/CollationTest_*.txt 3616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and (ICU4J)/main/tests/collate/src/com/ibm/icu/dev/data/CollationTest_*.txt 3626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org with output from Mark's Unicode tools (..._CLDR_..._SHORT.txt) 3636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check test file diffs for previously commented-out, known-failing data lines; 3646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org probably need to keep those commented out 3656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check FractionalUCA.txt for manual changes of lead bytes from IMPLICIT to Hani 3666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run genuca, see command line above 3676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU4C 3686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J collation data: 3696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (subset of instructions above for properties data refresh, except copies all coll/*) 3706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni62/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 3716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni62/bld$ mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/coll 3726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni62/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt50b/coll/* /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/coll 3736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/uni62/bld/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt50b 3746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run all tests with the *_SHORT.txt or the full files (the full ones have comments, useful for debugging) 3756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- note on intltest: if collate/UCAConformanceTest fails, then 3766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org utility/MultithreadTest/TestCollators will fail as well; 3776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org fix the conformance test before looking into the multi-thread test 3786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* test ICU, fix test code where necessary 3806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* When refreshing all of ICU4J data from ICU4C 3826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 3836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- cp /tmp/icu4j/main/shared/data/icudata.jar ~/svn.icu4j/trunk/src/main/shared/data 3846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgor 3856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ make ICU4J_ROOT=~/svn.icu4j/trunk/src icu4j-data-install 3866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information 3886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- skipped for Unicode 6.2: no new scripts 3896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** merge the Unicode update branches back onto the trunk 3916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- do not merge the icudata.jar and testdata.jar, 3926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org instead rebuild them from merged & tested ICU4C 3936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 3956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgFuture Unicode update 3976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 3986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgTools simplified since the Unicode 6.1 update. See 3996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- http://site.icu-project.org/design/props/ppucd 4006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- http://bugs.icu-project.org/trac/wiki/Markus/ReviewTicket8972 4016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Unicode version numbers 4036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- icutools/unicode/makedefs.sh was deleted, so one fewer place for version & path updates 4046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation 4066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdcopy.py, idna2nrm.py and genpname/preparse.pl replaced by preparseucd.py: 4076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode$ py/preparseucd.py ~/uni61/20120118 ~/svn.icu/trunk/src ~/svn.icu/tools/trunk/src 4086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This writes files (especially ppucd.txt) to the ICU4C unidata and testdata subfolders. 4096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Check test file diffs for previously commented-out, known-failing data lines; 4106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org probably need to keep those commented out. 4116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyValueAliases.txt changes 4136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Script codes that are in ISO 15924 but not in Unicode are now listed in 4146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org preparseucd.py, in the _scripts_only_in_iso15924 variable. 4156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org If there are new ISO codes, then add them. 4166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org If Unicode adds some of them, then remove them from the .py variable. 4176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UnicodeData.txt changes 4196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- No more manual changes for CJK ranges for algorithmic names; 4206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org those are now written to ppucd.txt and genprops reads them from there. 4216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate core properties data files (makeprops.sh was deleted) 4236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/dbg/unicode$ c/genprops/genprops ~/svn.icu/trunk/src 4246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* no more manual updates of source/data/unidata/norm2/nfkc_cf.txt 4266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- it is now generated by preparseucd.py 4276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* no more separate idna2nrm.py run and manual copying to generate source/data/unidata/norm2/uts46.txt 4296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- it is now generated by preparseucd.py 4306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- make sure that the Unicode data folder passed into preparseucd.py 4316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org includes a copy of http://www.unicode.org/Public/idna/6.1.0/IdnaMappingTable.txt 4326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (can be in some subfolder) 4336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate normalization data files 4356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ export LD_LIBRARY_PATH=~/svn.icu/trunk/dbg/lib 4366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ SRC_DATA_IN=~/svn.icu/trunk/src/source/data/in 4376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ UNIDATA=~/svn.icu/trunk/src/source/data/unidata 4386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfc.nrm -s $UNIDATA/norm2 nfc.txt 4396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc.nrm -s $UNIDATA/norm2 nfc.txt nfkc.txt 4406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc_cf.nrm -s $UNIDATA/norm2 nfc.txt nfkc.txt nfkc_cf.txt 4416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ bin/gennorm2 -o $SRC_DATA_IN/uts46.nrm -s $UNIDATA/norm2 nfc.txt uts46.txt 4426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build ICU (make install) 4446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools using CMake+make 4456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* new way to call genuca (makeuca.sh was deleted) 4476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/dbg/unicode$ c/genuca/genuca -i ~/svn.icu/trunk/dbg/data/out/build/icudt49l ~/svn.icu/trunk/src 4486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 4506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 6.1 update 4526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** ICU Trac 4546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 8995 final update to Unicode 6.1 4566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 8994 regenerate source/layout/CanonData.cpp 4576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 8961 support Unicode "Age" value *names* 4596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 8963 support multiple character name aliases & types 4606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 8827 "update ICU to Unicode 6.1" 4626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- C++ branches/markus/uni61 at r30864 from trunk at r30843 4636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Java branches/markus/uni61 at r30865 from trunk at r30863 4646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers 4666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak 4676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h 4686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (configure.in & configure: have been modified to extract the version from uchar.h) 4696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.util.VersionInfo 4706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- icutools/unicode/makedefs.sh 4716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + also review & update other definitions in that file, 4726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org e.g. the ICU version in this path: BLD_DATA_FILES=$ICU_BLD/data/out/build/icudt49l 4736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code 4756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation 4776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icu/tools/trunk/src/unicode/c/genprops/misc$ ./ucdcopy.py ~/uni61/20111205/ucd ~/uni61/processed 4796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This prepares both unidata and testdata files in respective output subfolders. 4806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Check test file diffs for previously commented-out, known-failing data lines; 4816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org probably need to keep those commented out. 4826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 4836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyValueAliases.txt changes 4846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 11 new block names: 4856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Arabic_Extended_A 4866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Arabic_Mathematical_Alphabetic_Symbols 4876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Chakma 4886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Meetei_Mayek_Extensions 4896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Meroitic_Cursive 4906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Meroitic_Hieroglyphs 4916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Miao 4926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Sharada 4936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Sora_Sompeng 4946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Sundanese_Supplement 4956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Takri 4966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add to uchar.h 4976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add to UCharacter.UnicodeBlock IDs 4986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Eclipse find UBLOCK_([^ ]+) = ([0-9]+), (/.+) 4996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org replace public static final int \1_ID = \2; \3 5006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add to UCharacter.UnicodeBlock objects 5016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Eclipse find UBLOCK_([^ ]+) = [0-9]+, (/.+) 5026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org replace public static final UnicodeBlock \1 = new UnicodeBlock("\1", \1_ID); \2 5036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Joining_Group (jg) value: 5046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Rohingya_Yeh 5056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UCharacter.JoiningGroup 5066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 2 new Line_Break (lb) values: 5076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org CJ=Conditional_Japanese_Starter 5086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org HL=Hebrew_Letter 5096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UCharacter.LineBreak 5106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 7 new scripts: 5116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Cakm ; Chakma 5126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Merc ; Meroitic_Cursive 5136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Mero ; Meroitic_Hieroglyphs 5146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Plrd ; Miao 5156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Shrd ; Sharada 5166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Sora ; Sora_Sompeng 5176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Takr ; Takri 5186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> remove these from SyntheticPropertyValueAliases.txt 5196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> fix expectedLong names in cucdapi.c/TestUScriptCodeAPI() 5206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and in com.ibm.icu.dev.test.lang.TestUScript.java 5216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 2 new script codes from ISO 15924 http://www.unicode.org/iso15924/codechanges.html 5226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (added 2011-06-21) 5236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Khoj 322 Khojki 5246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Tirh 326 Tirhuta 5256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and another one added 2011-12-09 5266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Hluw 080 Anatolian Hieroglyphs (Luwian Hieroglyphs, Hittite Hieroglyphs) 5276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uscript.h 5286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> com.ibm.icu.lang.UScript 5296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org find USCRIPT_([^ ]+) *= ([0-9]+),(.+) 5306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org replace public static final int \1 = \2;\3 5316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> SyntheticPropertyValueAliases.txt 5326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add to expectedLong and expectedShort names in cintltst/cucdapi.c/TestUScriptCodeAPI() 5336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and in com.ibm.icu.dev.test.lang.TestUScript.java 5346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 5356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UnicodeData.txt changes 5366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- the last Unihan code point changes from U+9FCB to U+9FCC 5376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org search for both 9FCB (end) and 9FCC (limit) (regex 9FC[BC], case-insensitive) 5386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + do change gennames.c 5396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + do change swapCJK() in ucol.cpp & ImplicitCEGenerator.java 5406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 5416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* DerivedBidiClass.txt changes 5426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 2 new default-AL blocks: 5436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org# Arabic Extended-A: U+08A0 - U+08FF (was default-R) 5446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org# Arabic Mathematical Alphabetic Symbols: 5456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org# U+1EE00 - U+1EEFF (was default-R) 5466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 2 new default-R blocks: 5476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org# Meroitic Hieroglyphs: 5486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org# U+10980 - U+1099F 5496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org# Meroitic Cursive: U+109A0 - U+109FF 5506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> should be picked up by the explicit data in the file 5516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 5526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* NameAliases.txt changes 5536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- from 5546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org # Each line has two fields 5556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org # First field: Code point 5566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org # Second field: Alias 5576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- to 5586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org # Each line has three fields, as described here: 5596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org # 5606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org # First field: Code point 5616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org # Second field: Alias 5626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org # Third field: Type 5636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Also, the file previously allowed multiple aliases but only now does it 5646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org actually provide multiple, even multiple of the same type. For example, 5656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org FEFF;BYTE ORDER MARK;alternate 5666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org FEFF;BOM;abbreviation 5676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org FEFF;ZWNBSP;abbreviation 5686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This breaks our gennames parser, unames.icu data structure, and API. 5696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Fix gennames to only pick up "correction" aliases. 5706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org New ticket #8963 for further changes. 5716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 5726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname/preparse.pl (on Linux) 5736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + cd ~/svn.icu/tools/trunk/src/unicode/c/genpname 5746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + make sure that data.h is writable 5756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + perl preparse.pl ~/svn.icu/trunk/src > out.txt 5766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + preparse.pl shows no errors, out.txt Info and Warning lines look ok 5776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 5786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build ICU (make install) 5796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org so that the tools build can pick up the new definitions from the installed header files. 5806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools (at least genpname) using CMake+make 5816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 5826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname 5836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (builds both pnames.icu and propname_data.h) 5846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genpname/genpname -v -d ~/svn.icu/trunk/src/source/data/in 5856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genpname/genpname -v -d ~/svn.icu/trunk/src/source/common --csource 5866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 5876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build ICU (make install) 5886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools using CMake+make 5896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 5906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update source/data/unidata/norm2/nfkc_cf.txt 5916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- follow the instructions in nfkc_cf.txt for updating it from DerivedNormalizationProps.txt 5926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 5936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update source/data/unidata/norm2/uts46.txt 5946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- download http://www.unicode.org/Public/idna/6.1.0/IdnaMappingTable.txt 5956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org to ~/svn.icu/tools/trunk/src/unicode/py 5966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- adjust idna2nrm.py to remove "; NV8": For UTS #46, we do not care about "not valid in IDNA2008". 5976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode/py$ ./idna2nrm.py 5986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode/py$ cp uts46.txt ~/svn.icu/trunk/src/source/data/unidata/norm2 5996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update uts46test.cpp and UTS46Test.java if there are new characters that are equivalent to 6016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sequences with non-LDH ASCII (that is, their decompositions contain '=' or similar) 6026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- grep IdnaMappingTable.txt or uts46.txt for "disallowed_STD3_valid" on non-ASCII characters 6036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unicode 6.0..6.1: U+2260, U+226E, U+226F 6046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- nothing new in 6.1, no test file to update 6056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate core properties data files 6076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- in initial bootstrapping, change the UCA version 6086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org in source/data/unidata/FractionalUCA.txt to match the new Unicode version 6096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode$ ./makeprops.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld 6106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools 6116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + if genrb fails to build coll/root.res with an U_INVALID_FORMAT_ERROR, 6126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org check if the UCA version in FractionalUCA.txt matches the new Unicode version 6136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (see step above) 6146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run makeuca.sh so that genuca picks up the new case mappings and nfc.nrm: 6156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/tools/trunk/src/unicode$ ./makeuca.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld 6166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools 6176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update Java data files 6196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh just the UCD-related files, just to be safe 6206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- see (ICU4C)/source/data/icu4j-readme.txt 6216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- mkdir /tmp/icu4j 6226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 6236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org output: 6246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ... 6256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Unicode .icu files built to ./out/build/icudt49l 6266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p ./out/icu4j/com/ibm/icu/impl/data/icudt49b 6276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p ./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt49b 6286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org echo pnames.icu ubidi.icu ucase.icu uprops.icu > ./out/icu4j/add.txt 6296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org LD_LIBRARY_PATH=../lib:../stubdata:../tools/ctestfw:$LD_LIBRARY_PATH ../bin/icupkg ./out/tmp/icudt49l.dat ./out/icu4j/icudt49b.dat -a ./out/icu4j/add.txt -s ./out/build/icudt49l -x '*' -tb -d ./out/icu4j/com/ibm/icu/impl/data/icudt49b 6306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mv ./out/icu4j/"com/ibm/icu/impl/data/icudt49b/zoneinfo64.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt49b/metaZones.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt49b/timezoneTypes.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt49b/windowsZones.res" "./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt49b" 6316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org jar cf ./out/icu4j/icudata.jar -C ./out/icu4j com/ibm/icu/impl/data/icudt49b/ 6326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/main/shared/data 6336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp ./out/icu4j/icudata.jar /tmp/icu4j/main/shared/data 6346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org jar cf ./out/icu4j/icutzdata.jar -C ./out/icu4j/tzdata com/ibm/icu/impl/data/icudt49b/ 6356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/main/shared/data 6366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp ./out/icu4j/icutzdata.jar /tmp/icu4j/main/shared/data 6376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org make[1]: Leaving directory `/home/mscherer/svn.icu/trunk/bld/data' 6386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the big-endian Unicode data files to another location, 6396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org separate from the other data files 6406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/coll 6416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/brkitr 6426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt49b/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt49b 6436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ rm /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/cnvalias.icu 6446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt49b/*.nrm /tmp/icu4j/com/ibm/icu/impl/data/icudt49b 6456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt49b/coll/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/coll 6466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt49b/brkitr/* /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/brkitr 6476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J 6486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt49b 6496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* refresh Java test .txt files 6516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy new .txt files into ICU4J's main/tests/core/src/com/ibm/icu/dev/data/unicode 6526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* test ICU so far, fix test code where necessary 6546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- temporarily ignore collation issues that look like UCA/UCD mismatches, 6556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org until UCA data is updated 6566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UCA 6586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- get output from Mark's tools; look in 6606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org http://www.unicode.org/Public/UCA/6.1.0/CollationAuxiliary-<dev. version>.txt 6616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/FractionalUCA.txt with FractionalUCA_SHORT.txt 6626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/UCARules.txt with UCA_Rules_SHORT.txt 6636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (note removing the underscore before "Rules") 6646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update (ICU)/source/test/testdata/CollationTest_*.txt 6656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and (ICU4J)/main/tests/collate/src/com/ibm/icu/dev/data/CollationTest_*.txt 6666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org with output from Mark's Unicode tools (..._CLDR_..._SHORT.txt) 6676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check test file diffs for previously commented-out, known-failing data lines; 6686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org probably need to keep those commented out 6696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check FractionalUCA.txt for manual changes of lead bytes from IMPLICIT to Hani 6706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run makeuca.sh: 6716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/tools/trunk/src/unicode$ ./makeuca.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld 6726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU4C 6736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J collation data: 6746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (subset of instructions above for properties data refresh, except copies all coll/*) 6756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 6766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld$ mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/coll 6776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt49b/coll/* /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/coll 6786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt49b 6796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run all tests with the *_SHORT.txt or the full files (the full ones have comments, useful for debugging) 6806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- note on intltest: if collate/UCAConformanceTest fails, then 6816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org utility/MultithreadTest/TestCollators will fail as well; 6826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org fix the conformance test before looking into the multi-thread test 6836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* When refreshing all of ICU4J data from ICU4C 6856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 6866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- cp /tmp/icu4j/main/shared/data/icudata.jar ~/svn.icu4j/trunk/src/main/shared/data 6876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgor 6886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=~/svn.icu4j/trunk/src icu4j-data-install 6896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information 6916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org(For details see the Unicode 5.2 change log below.) 6936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Run icu4j-tools: com.ibm.icu.dev.tool.layout.ScriptNameBuilder. 6956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org This generates LEScripts.h, LELanguages.h, ScriptAndLanguageTags.h and ScriptAndLanguageTags.cpp 6966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org in the working directory. 6976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (It also generates ScriptRunData.cpp, which is no longer needed.) 6986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 6996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org The generated files have a current copyright date and "@draft" statement. 7006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- diff current <icu>/source/layout files vs. generated ones 7026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu4j/trunk/src$ kdiff3 ~/svn.icu/trunk/src/source/layout tools/misc/src/com/ibm/icu/dev/tool/layout 7036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org review and manually merge desired changes; 7046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org fix gratuitous changes, incorrect @draft and missing aliases; 7056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Unicode-derived script codes should be "born stable" like constants in uchar.h, uscript.h etc. 7066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- if you just copy the above files, then 7076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org fix mixed line endings, review the diffs as above and restore changes to API tags etc.; 7086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org manually re-add the "Indic script xyz v.2" tags in ScriptAndLanguageTags.h 7096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** merge the Unicode update branches back onto the trunk 7116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- do not merge the icudata.jar and testdata.jar, 7126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org instead rebuild them from merged & tested ICU4C 7136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 7156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU 4.8 (no Unicode update, just new script codes) 7176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* 9 new script codes from ISO 15924 http://www.unicode.org/iso15924/codechanges.html 7196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (added 2010-12-21) 7206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Afak 439 Afaka 7216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Jurc 510 Jurchen 7226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Mroo 199 Mro, Mru 7236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Nshu 499 Nüshu 7246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Shrd 319 Sharada, Śāradā 7256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Sora 398 Sora Sompeng 7266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Takr 321 Takri, Ṭākrī, Ṭāṅkrī 7276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Tang 520 Tangut 7286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Wole 480 Woleai 7296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uscript.h 7306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> com.ibm.icu.lang.UScript 7316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org find USCRIPT_([^ ]+) *= ([0-9]+),(.+) 7326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org replace public static final int \1 = \2;\3 7336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> genpname/SyntheticPropertyValueAliases.txt 7346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add to expectedLong and expectedShort names in cintltst/cucdapi.c/TestUScriptCodeAPI() 7356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and in com.ibm.icu.dev.test.lang.TestUScript.java 7366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname/preparse.pl (on Linux) 7386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + cd ~/svn.icu/tools/trunk/src/unicode/c/genpname 7396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + make sure that data.h is writable 7406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + perl preparse.pl ~/svn.icu/trunk/src > out.txt 7416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + preparse.pl shows no errors, out.txt Info and Warning lines look ok 7426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* rebuild Unicode tools (at least genpname) using make 7446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- You might first need to "make install" ICU so that the tools build can pick 7456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org up the new definitions from the installed header files. 7466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname 7486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (builds both pnames.icu and propname_data.h) 7496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genpname/genpname -v -d ~/svn.icu/trunk/src/source/data/in 7506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genpname/genpname -v -d ~/svn.icu/trunk/src/source/common --csource 7516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools 7526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genprops 7546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genprops/genprops -d ~/svn.icu/trunk/src/source/data/in -s ~/svn.icu/trunk/src/source/data/unidata -i ~/svn.icu/trunk/dbg/data/out/build/icudt48l -u 6.0 7556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genprops/genprops -d ~/svn.icu/trunk/src/source/common --csource -s ~/svn.icu/trunk/src/source/data/unidata -i ~/svn.icu/trunk/dbg/data/out/build/icudt48l -u 6.0 7566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools 7576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update Java data files 7596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh just the UCD-related files, just to be safe 7606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- see (ICU4C)/source/data/icu4j-readme.txt 7616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- mkdir /tmp/icu4j 7626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 7636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the big-endian Unicode data files to another location, 7646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org separate from the other data files 7656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt48b 7666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt48b/pnames.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt48b 7676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt48b/uprops.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt48b 7686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J 7696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/dbg/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt48b 7706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* should have updated the layout engine script codes but forgot 7726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 7746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 6.0 update 7766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related ICU Trac tickets 7786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7264 Unicode 6.0 Update 7806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers 7826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak 7836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h 7846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (configure.in & configure: have been modified to extract the version from uchar.h) 7856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.util.VersionInfo 7866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code 7886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation 7906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icu/tools/trunk/src/unicode/c/genprops/misc$ ./ucdcopy.py ~/uni60/20100720/ucd ~/uni60/processed 7926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This now prepares both unidata and testdata files in respective output subfolders. 7936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 7946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyAliases.txt changes 7956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Script_Extensions property defined in the new ScriptExtensions.txt file 7966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org but not listed in PropertyAliases.txt; reported to unicode.org; 7976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> added to tools/trunk/src/unicode/c/genpname/SyntheticPropertyAliases.txt 7986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org scx; Script_Extensions 7996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h with new UProperty section 8006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> com.ibm.icu.lang.UProperty, parallel with uchar.h 8016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 8026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyValueAliases.txt changes 8036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 12 new block names: 8046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Alchemical_Symbols 8056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Bamum_Supplement 8066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Batak 8076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Brahmi 8086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org CJK_Unified_Ideographs_Extension_D 8096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Emoticons 8106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Ethiopic_Extended_A 8116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Kana_Supplement 8126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Mandaic 8136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Miscellaneous_Symbols_And_Pictographs 8146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Playing_Cards 8156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Transport_And_Map_Symbols 8166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add to uchar.h 8176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add to UCharacter.UnicodeBlock 8186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Eclipse find UBLOCK_([^ ]+) = [0-9]+, (/.+) 8196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org replace public static final UnicodeBlock \1 = new UnicodeBlock("\1", \1_ID); \2 8206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Joining_Group (jg) values: 8216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Teh_Marbuta_Goal becomes the new canonical value for the old Hamza_On_Heh_Goal which becomes an alias 8226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uchar.h & UCharacter.JoiningGroup 8236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 3 new scripts: 8246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Batk ; Batak 8256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Brah ; Brahmi 8266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Mand ; Mandaic 8276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> remove these from SyntheticPropertyValueAliases.txt 8286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add alias USCRIPT_MANDAIC to USCRIPT_MANDAEAN 8296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> fix expectedLong names in cucdapi.c/TestUScriptCodeAPI() 8306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and in com.ibm.icu.dev.test.lang.TestUScript.java 8316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 13 new script codes from ISO 15924 http://www.unicode.org/iso15924/codechanges.html 8326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (added 2009-11-11..2010-07-18) 8336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Bass 259 Bassa Vah 8346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Dupl 755 Duployan shortand 8356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Elba 226 Elbasan 8366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Gran 343 Grantha 8376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Kpel 436 Kpelle 8386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Loma 437 Loma 8396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Mend 438 Mende 8406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Merc 101 Meroitic Cursive 8416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Narb 106 Old North Arabian 8426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Nbat 159 Nabataean 8436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Palm 126 Palmyrene 8446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Sind 318 Sindhi 8456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Wara 262 Warang Citi 8466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> uscript.h 8476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> com.ibm.icu.lang.UScript 8486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org find USCRIPT_([^ ]+) *= ([0-9]+),(.+) 8496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org replace public static final int \1 = \2;\3 8506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> SyntheticPropertyValueAliases.txt 8516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add to expectedLong and expectedShort names in cintltst/cucdapi.c/TestUScriptCodeAPI() 8526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and in com.ibm.icu.dev.test.lang.TestUScript.java 8536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ISO 15924 name change 8546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Mero 100 Meroitic Hieroglyphs (was Meroitic) 8556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add new alias USCRIPT_MEROITIC_HIEROGLYPHS to USCRIPT_MEROITIC 8566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- property value alias added for Cham, was already moved out of SyntheticPropertyValueAliases.txt 8576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 8586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UnicodeData.txt changes 8596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new CJK block: 8606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2B740;<CJK Ideograph Extension D, First>;Lo;0;L;;;;;N;;;;; 8616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2B81D;<CJK Ideograph Extension D, Last>;Lo;0;L;;;;;N;;;;; 8626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> add to tools/trunk/src/unicode/c/gennames/gennames.c, with new ucdVersion 8636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 8646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools using CMake+make 8656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 8666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname/preparse.pl (on Linux) 8676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + cd ~/svn.icu/tools/trunk/src/unicode/c/genpname 8686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + make sure that data.h is writable 8696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + perl preparse.pl ~/svn.icu/trunk/src > out.txt 8706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + preparse.pl shows no errors, out.txt Info and Warning lines look ok 8716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 8726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* rebuild Unicode tools (at least genpname) using make 8736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- You might first need to "make install" ICU so that the tools build can pick 8746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org up the new definitions from the installed header files. 8756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 8766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname 8776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode$ c/genpname/genpname -v -d ~/svn.icu/trunk/src/source/data/in 8786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools 8796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 8806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update source/data/unidata/norm2/nfkc_cf.txt 8816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- follow the instructions in nfkc_cf.txt for updating it from DerivedNormalizationProps.txt 8826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 8836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update source/data/unidata/norm2/uts46.txt 8846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- download http://www.unicode.org/Public/idna/6.0.0/IdnaMappingTable.txt 8856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org to ~/svn.icu/tools/trunk/src/unicode/py 8866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- adjust idna2nrm.py to handle new disallowed_STD3_valid and disallowed_STD3_mapped values 8876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode/py$ ./idna2nrm.py 8886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode/py$ cp uts46.txt ~/svn.icu/trunk/src/source/data/unidata/norm2 8896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 8906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update uts46test.cpp and UTS46Test.java if there are new characters that are equivalent to 8916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sequences with non-LDH ASCII (that is, their decompositions contain '=' or similar) 8926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- grep IdnaMappingTable.txt or uts46.txt for "disallowed_STD3_valid" on non-ASCII characters 8936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unicode 6.0: U+2260, U+226E, U+226F 8946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 8956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate core properties data files 8966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode$ ./makeprops.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld 8976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools 8986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run makeuca.sh so that genuca picks up the new nfc.nrm: 8996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/tools/trunk/src/unicode$ ./makeuca.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld 9006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools 9016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* implement new Script_Extensions property (provisional) 9036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- parser & generator: genprops & uprops.icu 9046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uscript.h, uprops.h, uchar.c, uniset_props.cpp and others, plus cintltst/cucdapi.c & intltest/usettest.cpp 9056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UScript.java, UCharacterProperty.java, UnicodeSet.java, TestUScript.java, UnicodeSetTest.java 9066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* switch ubidi.icu, ucase.icu and uprops.icu from UTrie to UTrie2 9086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- (one-time change) 9096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- genbidi/gencase/genprops tools changes 9106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- re-run makeprops.sh (see above) 9116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UCharacterProperty.java, UCharacterTypeIterator.java, 9126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org UBiDiProps.java, UCaseProps.java, and several others with minor changes; 9136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org UCharacterPropertyReader.java deleted and its code folded into UCharacterProperty.java 9146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update Java data files 9166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh just the UCD-related files, just to be safe 9176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- see (ICU4C)/source/data/icu4j-readme.txt 9186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- mkdir /tmp/icu4j 9196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 9206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org output: 9216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ... 9226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Unicode .icu files built to ./out/build/icudt45l 9236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p ./out/icu4j/com/ibm/icu/impl/data/icudt45b 9246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org echo ubidi.icu ucase.icu uprops.icu > ./out/icu4j/add.txt 9256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org LD_LIBRARY_PATH=../lib:../stubdata:../tools/ctestfw:$LD_LIBRARY_PATH ../bin/icupkg ./out/tmp/icudt45l.dat ./out/icu4j/icudt45b.dat -a ./out/icu4j/add.txt -s ./out/build/icudt45l -x '*' -tb -d ./out/icu4j/com/ibm/icu/impl/data/icudt45b 9266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org jar cf ./out/icu4j/icudata.jar -C ./out/icu4j com/ibm/icu/impl/data/icudt45b 9276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/main/shared/data 9286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org cp ./out/icu4j/icudata.jar /tmp/icu4j/main/shared/data 9296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the big-endian Unicode data files to another location, 9306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org separate from the other data files 9316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/coll 9326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/brkitr 9336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt45b/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt45b 9346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ rm /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/cnvalias.icu 9356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt45b/*.nrm /tmp/icu4j/com/ibm/icu/impl/data/icudt45b 9366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt45b/coll/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/coll 9376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt45b/brkitr/* /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/brkitr 9386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J 9396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt45b 9406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* refresh Java test .txt files 9426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy new .txt files into ICU4J's main/tests/core/src/com/ibm/icu/dev/data/unicode 9436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* un-hardcode normalization skippable (NF*_Inert) test data 9456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- removes one manual step from the Unicode upgrade, and removes dependency on one of Mark's tools 9466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* copy updated break iterator test files 9486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- now handled by early ucdcopy.py and 9496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org copying the uni60/processed/testdata files to ~/svn.icu/trunk/src/source/test/testdata 9506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (old instructions: 9516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org copy from (Unicode 6.0)/ucd/auxiliary/*BreakTest-6....txt 9526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org to ~/svn.icu/trunk/src/source/test/testdata) 9536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- they are not used in ICU4J 9546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UCA 9566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- get output from Mark's tools; look in 9586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org http://www.unicode.org/~book/incoming/mark/uca6.0.0/ 9596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org http://www.macchiato.com/unicode/utc/additional-uca-files 9606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org http://www.unicode.org/Public/UCA/6.0.0/ 9616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org http://www.unicode.org/~mdavis/uca/ 9626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/FractionalUCA.txt with FractionalUCA_SHORT.txt 9636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/UCARules.txt with UCA_Rules_SHORT.txt 9646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update Han-implicit ranges for new CJK extensions: 9656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org swapCJK() in ucol.cpp & ImplicitCEGenerator.java 9666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- genuca: allow bytes 02 for U+FFFE, new merge-sort character; 9676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org do not add it into invuca so that tailoring primary-after an ignorable works 9686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- genuca: permit space between [variable top] bytes 9696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucol.cpp: treat noncharacters like unassigned rather than ignorable 9706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run makeuca.sh: 9716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/tools/trunk/src/unicode$ ./makeuca.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld 9726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU4C 9736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J collation data: 9746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (subset of instructions above for properties data refresh, except copies all coll/*) 9756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 9766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/coll 9776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt45b/coll/* /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/coll 9786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ~/svn.icu/trunk/bld/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt45b 9796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update (ICU)/source/test/testdata/CollationTest_*.txt 9806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and (ICU4J)/main/tests/collate/src/com/ibm/icu/dev/data/CollationTest_*.txt 9816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org with output from Mark's Unicode tools 9826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run all tests with the *_SHORT.txt or the full files (the full ones have comments) 9836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- note on intltest: if collate/UCAConformanceTest fails, then 9846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org utility/MultithreadTest/TestCollators will fail as well; 9856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org fix the conformance test before looking into the multi-thread test 9866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* When refreshing all of ICU4J data from ICU4C 9886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install 9896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- cp /tmp/icu4j/main/shared/data/icudata.jar ~/svn.icu4j/trunk/src/main/shared/data 9906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgor 9916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=~/svn.icu4j/trunk/src icu4j-data-install 9926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information 9946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org(For details see the Unicode 5.2 change log below.) 9966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Run ICU4J com.ibm.icu.dev.tool.layout.ScriptNameBuilder. This generates LEScripts.h, LELanguages.h, 9986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptAndLanguageTags.h and ScriptAndLanguageTags.cpp in the working directory. (It also generates 9996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptRunData.cpp, which is no longer needed.) 10006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgThe generated files have a current copyright date and "@draft" statement. 10026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* copy the above files into <icu>/source/layout, replacing the old files. 10046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* fix mixed line endings 10056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* review the diffs and fix incorrect @draft and missing aliases; 10066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Unicode-derived script codes should be "born stable" like constants in uchar.h, uscript.h etc. 10076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* manually re-add the "Indic script xyz v.2" tags in ScriptAndLanguageTags.h 10086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 10106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 5.2 update 10126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related ICU Trac tickets 10146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7084 Unicode 5.2 10166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7167 verify collation bytes 10186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7235 Java test NAME_ALIAS 10196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7236 Java DerivedCoreProperties.txt test 10206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7237 Java BidiTest.txt 10216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7238 UTrie2 in core unidata 10226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7239 test for tailoring gaps 10236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7240 Java fix CollationMiscTest 10246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7243 update layout engine for Unicode 5.2 10256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers 10276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak 10286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h 10296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- configure.in & configure 10306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update ucdVersion in gennames.c if an algorithmic range changes 10316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code 10336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation 10356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgpython source\tools\genprops\misc\ucdcopy.py "C:\Documents and Settings\mscherer\My Documents\unicode\ucd\5.2.0" C:\svn\icuproj\icu\trunk\source\data\unidata 10376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- includes finding files regardless of version numbers, 10386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org copying them, and performing the equivalent processing of the 10396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ucdstrip and ucdmerge tools on the desired set of files 10406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* notes on changes 10426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- PropertyAliases.txt 10436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org moved from numeric to enumerated: 10446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ccc ; Canonical_Combining_Class 10456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new string properties: 10466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org NFKC_CF ; NFKC_Casefold 10476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Name_Alias; Name_Alias 10486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new binary properties: 10496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Cased ; Cased 10506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org CI ; Case_Ignorable 10516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org CWCF ; Changes_When_Casefolded 10526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org CWCM ; Changes_When_Casemapped 10536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org CWKCF ; Changes_When_NFKC_Casefolded 10546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org CWL ; Changes_When_Lowercased 10556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org CWT ; Changes_When_Titlecased 10566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org CWU ; Changes_When_Uppercased 10576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new CJK Unihan properties (not supported by ICU) 10586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- PropertyValueAliases.txt 10596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new block names 10606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new scripts 10616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org one script code change: 10626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Qaai ; Inherited 10636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> 10646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org sc ; Zinh ; Inherited ; Qaai 10656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new Line_Break (lb) value: 10666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org lb ; CP ; Close_Parenthesis 10676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new Joining_Group (jg) values: Farsi_Yeh, Nya 10686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org other new values: 10696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org ccc; 214; ATA ; Attached_Above 10706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- DerivedBidiClass.txt 10716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new default-R range: U+1E800 - U+1EFFF 10726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UnicodeData.txt 10736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org all of the ISO comments are gone 10746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new CJK block end: 10756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 9FC3;<CJK Ideograph, Last> -> 9FCB;<CJK Ideograph, Last> 10766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org new CJK block: 10776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2A700;<CJK Ideograph Extension C, First>;Lo;0;L;;;;;N;;;;; 10786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 2B734;<CJK Ideograph Extension C, Last>;Lo;0;L;;;;;N;;;;; 10796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname 10816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run preparse.pl 10826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + cd \svn\icuproj\icu\trunk\source\tools\genpname 10836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + make sure that data.h is writable 10846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + perl preparse.pl \svn\icuproj\icu\trunk > out.txt 10856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + preparse.pl complains with errors like the following: 10866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Error: sc:Egyp already set to Egyptian_Hieroglyphs, cannot set to Egyp at preparse.pl line 1322, <GEN6> line 34. 10876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org This is because ICU 4.0 had scripts from ISO 15924 which are now 10886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org added to Unicode 5.2, and the Perl script shows a conflict between SyntheticPropertyValueAliases.txt 10896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and PropertyValueAliases.txt. 10906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> Removed duplicate script entries from SyntheticPropertyValueAliases.txt: 10916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Egyp, Java, Lana, Mtei, Orkh, Armi, Avst, Kthi, Phli, Prti, Samr, Tavt 10926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + preparse.pl complains with errors about block names missing from uchar.h; add them 10936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 10946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.h & uscript.h & uprops.h & uprops.c & genprops 10956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new block & script values 10966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + 26 new blocks 10976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org copy new blocks from Blocks.txt 10986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org MS VC++ 2008 regular expression: 10996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org find "^{[0-9A-F]+}\.\.{[0-9A-F]+}; {[A-Z].+}$" 11006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org replace with " UBLOCK_\3 = 172, /*[\1]*/" 11016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + several new script values already added in ICU 4.0 for ISO 15924 coverage 11026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (removed from SyntheticPropertyValueAliases.txt, see genpname notes above) 11036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + 3 new script values added for ISO 15924 and Unicode 5.2 coverage 11046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + 1 new script value added for ISO 15924 coverage (not in Unicode 5.2) 11056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (added to SyntheticPropertyValueAliases.txt) 11066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Joining Group (JG) values: Farsi_Yeh, Nya 11076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Line_Break (lb) value: 11086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org lb ; CP ; Close_Parenthesis 11096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* hardcoded Unihan range end/limit 11116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unihan range end moves from 9FC3 to 9FCB 11126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org search for both 9FC3 (end) and 9FC4 (limit) (regex 9FC[34], case-insensitive) 11136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + do change gennames.c 11146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Compare definitions of new binary properties with what we used to use 11166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org in algorithms, to see if the definitions changed. 11176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Verified that definitions for Cased and Case_Ignorable are unchanged. 11186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org The gencase tool now parses the newly public Case_Ignorable values 11196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org in case the definition changes in the future. 11206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.c & uprops.h & uprops.c & genprops 11226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new numeric values that didn't exist in Unicode data before: 11236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 1/7, 1/9, 1/10, 3/10, 1/16, 3/16 11246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org the ones with denominators >9 cannot be supported by uprops.icu formatVersion 5, 11256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org therefore redesign the encoding of numeric types and values for formatVersion 6; 11266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org design for simple numbers up to at least 144 ("one gross"), 11276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org large values up to at least 10^20, 11286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and fractions with numerators -1..17 and denominators 1..16 11296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org to cover current and expected future values 11306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (e.g., more Han numeric values, Meroitic twelfths) 11316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* reimplement Hangul_Syllable_Type for new Jamo characters 11336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- the old code assumed that all Jamo characters are in the 11xx block 11346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unicode 5.2 fills holes there and adds new Jamo characters in 11356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org A960..A97F; Hangul Jamo Extended-A 11366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and in 11376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org D7B0..D7FF; Hangul Jamo Extended-B 11386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Hangul_Syllable_Type can be trivially derived from a subset of 11396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Grapheme_Cluster_Break values 11406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode data source code for hardcoding core data 11426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgC:\svn\icuproj\icu\trunk\source\data>NMAKE /f makedata.mak ICUMAKE=\svn\icuproj\icu\trunk\source\data\ CFG=x86\release uni-core-data 11436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU data make path is \svn\icuproj\icu\trunk\source\data\ 11456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU root path is \svn\icuproj\icu\trunk 11466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "ucmlocal.mk". Not building user-additional converter files. 11476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "brklocal.mk". Not building user-additional break iterator files. 11486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "reslocal.mk". Not building user-additional resource bundle files. 11496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "collocal.mk". Not building user-additional resource bundle files. 11506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "rbnflocal.mk". Not building user-additional resource bundle files. 11516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "trnslocal.mk". Not building user-additional transliterator files. 11526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "misclocal.mk". Not building user-additional miscellaenous files. 11536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "spreplocal.mk". Not building user-additional stringprep files. 11546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Property Names 11556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Character Properties 11566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Case Mapping Properties 11576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode BiDi/Shaping Properties 11586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Normalization 11596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .icu files built to "\svn\icuproj\icu\trunk\source\data\out\build\icudt43l" 11606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .c source files built to "\svn\icuproj\icu\trunk\source\data\out\tmp" 11616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the .c source files to C:\svn\icuproj\icu\trunk\source\common 11636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and rebuild the common library 11646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** UCA 11666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update FractionalUCA.txt with new canonical closure (output from Mark's Unicode tools) 11686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/UCARules.txt with UCA_Rules_SHORT.txt from Mark's Unicode tools 11696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/test/testdata/CollationTest_*.txt with output from Mark's Unicode tools 11706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org[ Begin obsolete instructions: 11716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Starting with UCA 5.2, we use the CollationTest_*_SHORT.txt files not the *_STUB.txt files. 11726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org - generate the source/test/testdata/CollationTest_*_STUB.txt files via source/tools/genuca/genteststub.py 11736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org on Windows: 11746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org python C:\svn\icuproj\icu\trunk\source\tools\genuca\genteststub.py CollationTest_NON_IGNORABLE_SHORT.txt CollationTest_NON_IGNORABLE_STUB.txt 11756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org python C:\svn\icuproj\icu\trunk\source\tools\genuca\genteststub.py CollationTest_SHIFTED_SHORT.txt CollationTest_SHIFTED_STUB.txt 11766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org End obsolete instructions] 11776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run all tests with the *_SHORT.txt or the full files (the full ones have comments) 11786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org not just the *_STUB.txt files 11796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- note on intltest: if collate/UCAConformanceTest fails, then 11806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org utility/MultithreadTest/TestCollators will fail as well; 11816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org fix the conformance test before looking into the multi-thread test 11826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Implement Cased & Case_Ignorable properties 11846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- via UProperty; call ucase.h functions ucase_getType() and ucase_getTypeOrIgnorable() 11856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Problem: These properties should be disjoint, but aren't 11866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UTC 2009nov decision: skip all Case_Ignorable regardless of whether they are Cased or not 11876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- change ucase.icu to be able to store any combination of Cased and Case_Ignorable 11886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Implement Changes_When_Xyz properties 11906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- without stored data 11916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Implement Name_Alias property 11936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- add it as another name field in unames.icu 11946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- make it available via u_charName() and UCharNameChoice and 11956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- consider it in u_charFromName() 11966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Break iterators 11986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 11996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Update break iterator rules to new UAX versions and new property values 12006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Update source/test/testdata/<boundary>Test.txt files from <unicode.org ucd>/ucd/auxiliary 12016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** new BidiTest file 12036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- review format and data 12046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy BidiTest.txt to source/test/testdata 12056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- write test code using this data 12066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- fix ICU code where it fails the conformance test 12076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Java 12096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- generally, find and update code corresponding to C/C++ 12106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UCharacter.UnicodeBlock constants: 12116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org a) add an _ID integer per new block, update COUNT 12126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org b) add a class instance per new block 12136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Visual Studio regex: 12146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org find UBLOCK_{[^ ]+} = [0-9]+, {/.+} 12156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org replace with public static final UnicodeBlock \1 = new UnicodeBlock("\1", \1_ID); \2 12166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- CHAR_NAME_ALIAS -> UCharacter.getNameAlias() and getCharFromNameAlias() 12176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- port test changes to Java 12196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information 12216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org(For comparison, see the Unicode 5.1 update: http://bugs.icu-project.org/trac/changeset/23833) 12236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Run ICU4J com.ibm.icu.dev.tool.layout.ScriptNameBuilder. This generates LEScripts.h, LELanguages.h, 12256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptAndLanguageTags.h and ScriptAndLanguageTags.cpp in the working directory. (It also generates 12266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptRunData.cpp, which is no longer needed.) 12276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgThe generated files have a current copyright date and "@draft" statement. 12296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org-> Eric Mader wrote in email on 20090930: 12316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org "I think the tool has been modified to update @draft to @stable for 12326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org older scripts and to add @draft for new scripts. 12336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (I worked with an intern on this last year.) 12346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org You should check the output after you run it." 12356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* copy the above files into <icu>/source/layout, replacing the old files. 12376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* fix mixed line endings 12386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* review the diffs and fix incorrect @draft and missing aliases 12396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* manually re-add the "Indic script xyz v.2" tags in ScriptAndLanguageTags.h 12406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgAdd new default entries to the indicClassTables array in <icu>/source/layout/IndicClassTables.cpp 12426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.organd the complexTable array in <icu>/source/layoutex/ParagraphLayout.cpp. (This step should be automated...) 12436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org-> Eric Mader wrote in email on 20090930: 12456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org "This is just a matter of making sure that all the per-script tables have 12466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org entries for any new scripts that were added. 12476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org If any new Indic characters were added, then the class tables in 12486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org IndicClassTables.cpp should be updated to reflect this. 12496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org John Emmons should know how to do this if it's required." 12506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* rebuild the layout and layoutex libraries. 12526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Documentation 12546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Update User Guide 12556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + Jamo_Short_Name, sfc->scf, binary property value aliases 12566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 12586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 5.1 update 12606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related ICU Trac tickets 12626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org5696 Update to Unicode 5.1 12646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers 12666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak 12676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h 12686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- configure.in & configure 12696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update ucdVersion in gennames.c if an algorithmic range changes 12706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code 12726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation 12746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip: 12756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org DerivedCoreProperties.txt 12766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org DerivedNormalizationProps.txt 12776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org NormalizationTest.txt 12786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org PropList.txt 12796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Scripts.txt 12806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org GraphemeBreakProperty.txt 12816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org SentenceBreakProperty.txt 12826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WordBreakProperty.txt 12836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip and ucdmerge: 12846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org EastAsianWidth.txt 12856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org LineBreak.txt 12866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 12876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* my ucd2unidata.bat (needs to be updated each time with UCD and file version numbers) 12886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\BidiMirroring.txt ..\unidata\ 12896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\Blocks.txt ..\unidata\ 12906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\CaseFolding.txt ..\unidata\ 12916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\DerivedAge.txt ..\unidata\ 12926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\extracted\DerivedBidiClass.txt ..\unidata\ 12936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\extracted\DerivedJoiningGroup.txt ..\unidata\ 12946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\extracted\DerivedJoiningType.txt ..\unidata\ 12956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\extracted\DerivedNumericValues.txt ..\unidata\ 12966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\NormalizationCorrections.txt ..\unidata\ 12976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\PropertyAliases.txt ..\unidata\ 12986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\PropertyValueAliases.txt ..\unidata\ 12996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\SpecialCasing.txt ..\unidata\ 13006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\UnicodeData.txt ..\unidata\ 13016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 13026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\DerivedCoreProperties.txt > ..\unidata\DerivedCoreProperties.txt 13036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\DerivedNormalizationProps.txt > ..\unidata\DerivedNormalizationProps.txt 13046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\NormalizationTest.txt > ..\unidata\NormalizationTest.txt 13056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\PropList.txt > ..\unidata\PropList.txt 13066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\Scripts.txt > ..\unidata\Scripts.txt 13076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\auxiliary\GraphemeBreakProperty.txt > ..\unidata\GraphemeBreakProperty.txt 13086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\auxiliary\SentenceBreakProperty.txt > ..\unidata\SentenceBreakProperty.txt 13096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\auxiliary\WordBreakProperty.txt > ..\unidata\WordBreakProperty.txt 13106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\EastAsianWidth.txt | ucdmerge > ..\unidata\EastAsianWidth.txt 13116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\LineBreak.txt | ucdmerge > ..\unidata\LineBreak.txt 13126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 13136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname 13146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run preparse.pl 13156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + cd \svn\icuproj\icu\uni51\source\tools\genpname 13166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + make sure that data.h is writable 13176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + perl preparse.pl \svn\icuproj\icu\uni51 > out.txt 13186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + preparse.pl complains with errors like the following: 13196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Error: sc:Cari already set to Carian, cannot set to Cari at preparse.pl line 1308, <GEN6> line 30. 13206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org This is because ICU 3.8 had scripts from ISO 15924 which are now 13216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org added to Unicode 5.1, and the script shows a conflict between SyntheticPropertyValueAliases.txt 13226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and PropertyValueAliases.txt. 13236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> Removed duplicate script entries from SyntheticPropertyValueAliases.txt: 13246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Cari, Cham, Kali, Lepc, Lyci, Lydi, Olck, Rjng, Saur, Sund, Vaii 13256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + PropertyValueAliases.txt now explicitly contains values for boolean properties: 13266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org N/Y, No/Yes, F/T, False/True 13276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> Added N/No and Y/Yes to preparse.pl function read_PropertyValueAliases. 13286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org It will use further values from the file if present. 13296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 13306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.h & uscript.h & uprops.h & uprops.c & genprops 13316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new block & script values 13326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + 17 new blocks 13336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + 11 new script values already added in ICU 3.8 for ISO 15924 coverage 13346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (removed from SyntheticPropertyValueAliases.txt) 13356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + 14 new script values added for ISO 15924 coverage (not in Unicode 5.1) 13366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org (added to SyntheticPropertyValueAliases.txt) 13376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uprops.icu (uprops.h) only provides 7 bits for script codes. 13386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org In ICU 4.0 there are USCRIPT_CODE_LIMIT=130 script codes now. 13396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org There is none above 127 yet which is the script code for an 13406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org assigned Unicode character, so ICU 4.0 uprops.icu does not store any 13416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org script code values greater than 127. 13426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org However, it does need to store the maximum script value=USCRIPT_CODE_LIMIT-1=129 13436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org in a parallel bit field, and that overflows now. 13446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Also, future values >=128 would be incompatible anyway. 13456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org uprops.h is modified to move around several of the bit fields 13466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org in the properties vector words, and now uses 8 bits for the script code. 13476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Two other bit fields also grow to accommodate future growth: 13486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Block (current count: 172) grows from 8 to 9 bits, 13496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and Word_Break grows from 4 to 5 bits. 13506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- renamed property Simple_Case_Folding (sfc->scf) 13516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + nothing to be done: handled as normal alias 13526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new property JSN Jamo_Short_Name 13536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + no new API: only contributes to the Name property 13546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Grapheme_Cluster_Break (GCB) value: SM=SpacingMark 13556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Joining Group (JG) value: Burushashki_Yeh_Barree 13566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Sentence_Break (SB) values: 13576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org SB ; CR ; CR 13586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org SB ; EX ; Extend 13596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org SB ; LF ; LF 13606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org SB ; SC ; SContinue 13616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Word_Break (WB) values: 13626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WB ; CR ; CR 13636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WB ; Extend ; Extend 13646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WB ; LF ; LF 13656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WB ; MB ; MidNumLet 13666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 13676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Further changes in the 2008-02-29 update: 13686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Default_Ignorable_Code_Point: The new file removes Cc, Cs, noncharacters from DICP 13696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org because they should not normally be invisible. 13706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Joining Group (JG) value Burushashki_Yeh_Barree was renamed to Burushaski_Yeh_Barree (one 'h' removed) 13716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Grapheme_Cluster_Break (GCB) value: PP=Prepend 13726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Word_Break (WB) value: NL=Newline 13736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 13746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* hardcoded Unihan range end/limit (see Unicode 4.1 update for comparison) 13756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unihan range end moves from 9FBB to 9FC3 13766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org search for both 9FBB (end) and 9FBC (limit) (regex 9FB[BC], case-insensitive) 13776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + do change gennames.c 13786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 13796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode data source code for hardcoding core data 13806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgC:\svn\icuproj\icu\uni51\source\data>NMAKE /f makedata.mak ICUMAKE=\svn\icuproj\icu\uni51\source\data\ CFG=debug uni-core-data 13816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 13826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU data make path is \svn\icuproj\icu\uni51\source\data\ 13836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU root path is \svn\icuproj\icu\uni51 13846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "ucmlocal.mk". Not building user-additional converter files. 13856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "brklocal.mk". Not building user-additional break iterator files. 13866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "reslocal.mk". Not building user-additional resource bundle files. 13876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "collocal.mk". Not building user-additional resource bundle files. 13886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "rbnflocal.mk". Not building user-additional resource bundle files. 13896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "trnslocal.mk". Not building user-additional transliterator files. 13906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "misclocal.mk". Not building user-additional miscellaenous files. 13916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Character Properties 13926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Case Mapping Properties 13936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode BiDi/Shaping Properties 13946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Normalization 13956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .icu files built to "\svn\icuproj\icu\uni51\source\data\out\build\icudt39l" 13966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .c source files built to "\svn\icuproj\icu\uni51\source\data\out\tmp" 13976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 13986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the .c source files to C:\svn\icuproj\icu\uni51\source\common 13996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and rebuild the common library 14006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Break iterators 14026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Update break iterator rules to new UAX versions and new property values 14046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** UCA 14066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update FractionalUCA.txt and UCARules.txt with new canonical closure 14086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Test suites 14106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Test that APIs using Unicode property value aliases (like UnicodeSet) 14116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org support all of the boolean values N/Y, No/Yes, F/T, False/True 14126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org -> TestBinaryValues() tests in both cintltst and intltest 14136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information 14156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Run ICU4J com.ibm.icu.dev.tool.layout.ScriptNameBuilder. This generates LEScripts.h, LELanguage.h, 14166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptAndLanguageTags.h and ScriptAndLanguageTags.cpp in the working directory. (it also generates 14176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptRunData.cpp, which is no longer needed.) 14186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgThe generated files have a current copyright date and "@draft" statement. 14206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* copy the above files into <icu>/source/layout, replacing the old files. 14226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgAdd new default entries to the indicClassTables array in <icu>/source/layout/IndicClassTables.cpp 14246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.organd the complexTable array in <icu>/source/layoutex/ParagraphLayout.cpp. (This step should be automated...) 14256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* rebuild the layout and layoutex libraries. 14276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Documentation 14296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Update User Guide 14306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + Jamo_Short_Name, sfc->scf, binary property value aliases 14316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 14336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 5.0 update 14356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related Jitterbugs 14376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org5084 RFE: Update to Unicode 5.0 14396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code 14416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation 14436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip: 14446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org DerivedCoreProperties.txt 14456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org DerivedNormalizationProps.txt 14466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org NormalizationTest.txt 14476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org PropList.txt 14486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org Scripts.txt 14496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org GraphemeBreakProperty.txt 14506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org SentenceBreakProperty.txt 14516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WordBreakProperty.txt 14526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip and ucdmerge: 14536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org EastAsianWidth.txt 14546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org LineBreak.txt 14556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* my ucd2unidata.bat (needs to be updated each time with UCD and file version numbers) 14576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\BidiMirroring.txt ..\unidata\ 14586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\Blocks.txt ..\unidata\ 14596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\CaseFolding.txt ..\unidata\ 14606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\DerivedAge.txt ..\unidata\ 14616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\extracted\DerivedBidiClass.txt ..\unidata\ 14626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\extracted\DerivedJoiningGroup.txt ..\unidata\ 14636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\extracted\DerivedJoiningType.txt ..\unidata\ 14646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\extracted\DerivedNumericValues.txt ..\unidata\ 14656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\NormalizationCorrections.txt ..\unidata\ 14666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\PropertyAliases.txt ..\unidata\ 14676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\PropertyValueAliases.txt ..\unidata\ 14686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\SpecialCasing.txt ..\unidata\ 14696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\UnicodeData.txt ..\unidata\ 14706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\DerivedCoreProperties.txt > ..\unidata\DerivedCoreProperties.txt 14726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\DerivedNormalizationProps.txt > ..\unidata\DerivedNormalizationProps.txt 14736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\NormalizationTest.txt > ..\unidata\NormalizationTest.txt 14746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\PropList.txt > ..\unidata\PropList.txt 14756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\Scripts.txt > ..\unidata\Scripts.txt 14766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\auxiliary\GraphemeBreakProperty.txt > ..\unidata\GraphemeBreakProperty.txt 14776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\auxiliary\SentenceBreakProperty.txt > ..\unidata\SentenceBreakProperty.txt 14786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\auxiliary\WordBreakProperty.txt > ..\unidata\WordBreakProperty.txt 14796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\EastAsianWidth.txt | ucdmerge > ..\unidata\EastAsianWidth.txt 14806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\LineBreak.txt | ucdmerge > ..\unidata\LineBreak.txt 14816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update FractionalUCA.txt and UCARules.txt with new canonical closure 14836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname 14856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run preparse.pl 14866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + make sure that data.h is writable 14876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + perl preparse.pl \cvs\oss\icu > out.txt 14886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.h & uscript.h & uprops.h & uprops.c & genprops 14906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new block & script values 14916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + script values already added in ICU 3.6 because all of ISO 15924 is now covered 14926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode data source code for hardcoding core data 14946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgC:\cvs\oss\icu\source\data>NMAKE /f makedata.mak ICUMAKE=\cvs\oss\icu\source\data\ CFG=debug uni-core-data 14956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 14966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU data make path is \cvs\oss\icu\source\data\ 14976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU root path is \cvs\oss\icu 14986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "ucmlocal.mk". Not building user-additional converter files. 14996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org[etc.] 15006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Character Properties 15016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Case Mapping Properties 15026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode BiDi/Shaping Properties 15036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Normalization 15046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .icu files built to "\cvs\oss\icu\source\data\out\build\icudt35l" 15056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .c source files built to "\cvs\oss\icu\source\data\out\tmp" 15066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the .c source files to C:\cvs\oss\icu\source\common 15086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and rebuild the common library 15096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers 15116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak 15126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h 15136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- configure.in 15146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information 15166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Run ICU4J com.ibm.icu.dev.tool.layout.ScriptNameBuilder. This generates LEScripts.h, LELanguage.h, 15176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptAndLanguageTags.h and ScriptAndLanguageTags.cpp in the working directory. (it also generates 15186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptRunData.cpp, which is no longer needed.) 15196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgThe generated files have a current copyright date and "@draft" statement. 15216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* copy the above files into <icu>/source/layout, replacing the old files. 15236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgAdd new default entries to the indicClassTables array in <icu>/source/layout/IndicClassTables.cpp 15256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.organd the complexTable array in <icu>/source/layoutex/ParagraphLayout.cpp. (This step should be automated...) 15266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* rebuild the layout and layoutex libraries. 15286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 15306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 4.1 update 15326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related Jitterbugs 15346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org4332 RFE: Update to Unicode 4.1 15366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org4157 RBBI, TR29 4.1 updates 15376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code 15396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation 15416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip: 15426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org DerivedCoreProperties.txt 15436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org DerivedNormalizationProps.txt 15446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org NormalizationTest.txt 15456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org GraphemeBreakProperty.txt 15466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org SentenceBreakProperty.txt 15476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WordBreakProperty.txt 15486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip and ucdmerge: 15496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org EastAsianWidth.txt 15506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org LineBreak.txt 15516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* add new files to the repository 15536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org GraphemeBreakProperty.txt 15546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org SentenceBreakProperty.txt 15556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org WordBreakProperty.txt 15566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update FractionalUCA.txt and UCARules.txt with new canonical closure 15586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname 15606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- handle new enumerated properties in sub read_uchar 15616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run preparse.pl 15626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.h & uscript.h & uprops.h & uprops.c & genprops 15646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new binary properties 15656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + Pattern_Syntax 15666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + Pattern_White_Space 15676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new enumerated properties 15686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + Grapheme_Cluster_Break 15696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + Sentence_Break 15706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + Word_Break 15716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new block & script & line break values 15726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* gencase 15746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- case-ignorable changes 15756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org see http://www.unicode.org/versions/Unicode4.1.0/#CaseMods 15766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org now: (D47a) Word_Break=MidLetter or Mn, Me, Cf, Lm, Sk 15776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers 15796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak 15806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h 15816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- configure.in 15826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** tests 15846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- verify that u_charMirror() round-trips 15856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- test all new properties and some new values of old properties 15866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** other code 15886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 15896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* hardcoded Unihan range end/limit 15906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unihan range end moves from 9FA5 to 9FBB 15916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org search for both 9FA5 (end) and 9FA6 (limit) (regex 9FA[56], case-insensitive) 15926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + do not modify BOCU/BOCSU code because that would change the encoding 15936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and break binary compatibility! 15946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + similarly, do not change the GB 18030 range data (ucnvmbcs.c), 15956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org NamePrepProfile.txt 15966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + ignore trietest.c: test data is arbitrary 15976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + ignore tstnorm.cpp: test optimization, not important 15986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + ignore collation: 9FA[56] only appears in comments; swapCJK() uses the whole block up to 9FFF 15996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + do change line_th.txt and word_th.txt 16006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org by replacing hardcoded ranges with the new property values 16016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + do change gennames.c 16026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgsource\data\brkitr\line_th.txt(229): \u33E0-\u33FE \u3400-\u4DB5 \u4E00-\u9FA5 \uA000-\uA48C \uA490-\uA4C6 16046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgsource\data\brkitr\word_th.txt(23): \u33E0-\u33FE \u3400-\u4DB5 \u4E00-\u9FA5 \uA000-\uA48C \uA490-\uA4C6 16056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgsource\tools\gennames\gennames.c(971): 0x4e00, 0x9fa5, 16066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* case mappings 16086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- compare new special casing context conditions with previous ones 16096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org see http://www.unicode.org/versions/Unicode4.1.0/#CaseMods 16106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname 16126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- consider storing only the short name if it is the same as the long name 16136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** other reviews 16156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UAX #29 changes (grapheme/word/sentence breaks) 16166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UAX #14 changes (line breaks) 16176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Pattern_Syntax & Pattern_White_Space 16186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- *** 16206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 4.0.1 update 16226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related Jitterbugs 16246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org3170 RFE: Update to Unicode 4.0.1 16266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org3171 Add new Unicode 4.0.1 properties 16276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org3520 use Unicode 4.0.1 updates for break iteration 16286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code 16306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation 16326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip: DerivedNormalizationProps.txt, NormalizationTest.txt, DerivedCoreProperties.txt 16336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip and ucdmerge: EastAsianWidth.txt, LineBreak.txt 16346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file fixes 16366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- fix UnicodeData.txt general categories of Ethiopic digits Nd->No 16376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org according to PRI #26 16386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org http://www.unicode.org/review/resolved-pri.html#pri26 16396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- undone again because no corrigendum in sight; 16406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org instead modified tests to not check consistency on this for Unicode 4.0.1 16416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* ucdterms.txt 16436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update from http://www.unicode.org/copyright.html 16446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org formatted for plain text 16456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.h & uprops.h & uprops.c & genprops 16476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- add UBLOCK_CYRILLIC_SUPPLEMENT because the block is renamed 16486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- add U_LB_INSEPARABLE due to a spelling fix 16496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + put short name comment only on line with new constant 16506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org for genpname perl script parser 16516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new binary properties 16526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + STerm 16536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + Variation_Selector 16546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname 16566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- fix genpname perl script so that it doesn't choke on more than 2 names per property value 16576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- perl script: correctly calculate the maximum number of fields per row 16586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uscript.h 16606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new script code Hrkt=Katakana_Or_Hiragana 16616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* gennorm.c track changes in DerivedNormalizationProps.txt 16636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- "FNC" -> "FC_NFKC" 16646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- single field "NFD_NO" -> two fields "NFD_QC; N" etc. 16656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genprops/props2.c track changes in DerivedNumericValues.txt 16676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- changed from 3 columns to 2, dropping the numeric type 16686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org + assume that the type is always numeric for Han characters, 16696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org and that only those are added in addition to what UnicodeData.txt lists 16706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers 16726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak 16736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h 16746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- configure.in 16756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** tests 16776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update test of default bidi classes according to PRI #28 16786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org /tsutil/cucdtst/TestUnicodeData 16796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org http://www.unicode.org/review/resolved-pri.html#pri28 16806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- bidi tests: change exemplar character for ES depending on Unicode version 16816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- change hardcoded expected property values where they change 16826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** other code 16846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* name matching 16866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- read UCD.html 16876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* scripts 16896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- use new Hrkt=Katakana_Or_Hiragana 16906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org 16916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* ZWJ & ZWNJ 16926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- are now part of combining character sequences 16936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- break iteration used to assume that LB classes did not overlap; now they do for ZWJ & ZWNJ 1694