16f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Copyright (C) 2004-2013, International Business Machines
26f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Corporation and others.  All Rights Reserved.
36f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*
46f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*   file name:  changes.txt
56f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*   encoding:   US-ASCII
66f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*   tab size:   8 (not used)
76f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*   indentation:4
86f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*
96f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*   created on: 2004may06
106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*   created by: Markus W. Scherer
116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*
126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* change log for Unicode updates
136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 6.3 update
176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/review/pri249/  -- beta review
196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/reports/uax-proposed-updates.html
206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/versions/beta-6.3.0.html#notable_issues
216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/reports/tr44/tr44-11.html
226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** ICU Trac
246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 10128: update ICU to Unicode 6.3 beta
266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 10168: update ICU to Unicode 6.3 final
276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- C++ branches/markus/uni63 at r33552 from trunk at r33551
286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Java branches/markus/uni63 at r33550 from trunk at r33553
296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 10142: implement Unicode 6.3 bidi algorithm additions
316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers
336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak
346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h
356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (configure.in & configure: have been modified to extract the version from uchar.h)
366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.util.VersionInfo
376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.dev.test.lang.UCharacterTest.VERSION_
386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Run ICU4C "configure" _after_ updating the Unicode version number in uchar.h
406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  so that the makefiles see the new version number.
416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code
436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation
456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- download UCD, UCA & IDNA files
476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- make sure that the Unicode data folder passed into preparseucd.py
486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  includes a copy of the latest IdnaMappingTable.txt (can be in some subfolder)
496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- modify preparseucd.py:
506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  parse new file BidiBrackets.txt
516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  with new properties bpb=Bidi_Paired_Bracket and bpt=Bidi_Paired_Bracket_Type
526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icutools/trunk/src/unicode$ py/preparseucd.py ~/unidata/uni63/20130425 ~/svn.icu/uni63/src ~/svn.icutools/trunk/src
536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This writes files (especially ppucd.txt) to the ICU4C unidata and testdata subfolders.
546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Check test file diffs for previously commented-out, known-failing data lines;
556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  probably need to keep those commented out.
566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyAliases.txt changes
586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Enumerated Property
596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  bpt                      ; Bidi_Paired_Bracket_Type
606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UProperty.java & UCharacter.BidiPairedBracketType
616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> ubidi_props.h & .c & UBiDiProps.java
626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> remember to write the max value at UBIDI_MAX_VALUES_INDEX
636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uprops.cpp
646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> change ubidi.icu format version from 2.0 to 2.1
656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Miscellaneous Property
666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  bpb                      ; Bidi_Paired_Bracket
676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UProperty.java
686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> ppucd.h & .cpp
696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyValueAliases.txt changes
716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 3 Bidi_Paired_Bracket_Type (bpt) values:
726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  bpt; c                                ; Close
736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  bpt; n                                ; None
746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  bpt; o                                ; Open
756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UCharacter.BidiPairedBracketType
766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> ubidi_props.h & .c & UBiDiProps.java
776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> change ubidi.icu format version from 2.0 to 2.1
786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 4 new Bidi_Class (bc) values:
796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  bc ; FSI                              ; First_Strong_Isolate
806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  bc ; LRI                              ; Left_To_Right_Isolate
816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  bc ; RLI                              ; Right_To_Left_Isolate
826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  bc ; PDI                              ; Pop_Directional_Isolate
836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UCharacterEnums.ECharacterDirection
846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> until the bidi code gets updated,
856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org     Roozbeh suggests mapping the new bc values to ON (Other_Neutral)
866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 3 new Word_Break (WB) values:
876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  WB ; HL                               ; Hebrew_Letter
886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  WB ; SQ                               ; Single_Quote
896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  WB ; DQ                               ; Double_Quote
906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UCharacter.WordBreak
916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> first time Word_Break numeric constants exceed 4 bits (now 17 values)
926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 2 new script codes from ISO 15924 http://www.unicode.org/iso15924/codechanges.html
936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (added 2012-10-16)
946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Aghb  239     Caucasian Albanian
956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Mahj  314     Mahajani
966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uscript.h
976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> com.ibm.icu.lang.UScript
986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    find     USCRIPT_([^ ]+) *= ([0-9]+),(.+)
996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    replace  public static final int \1 = \2;\3
1006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> preparseucd.py _scripts_only_in_iso15924
1016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add to expectedLong and expectedShort names in cintltst/cucdapi.c/TestUScriptCodeAPI()
1026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      and in com.ibm.icu.dev.test.lang.TestUScript.java
1036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> update Script metadata: SCRIPT_PROPS[] in uscript_props.cpp & UScript.ScriptMetadata
1046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org     (not strictly necessary for NOT_ENCODED scripts)
1056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate normalization data files
1076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ export LD_LIBRARY_PATH=~/svn.icu/uni63/dbg/lib
1086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ SRC_DATA_IN=~/svn.icu/uni63/src/source/data/in
1096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ UNIDATA=~/svn.icu/uni63/src/source/data/unidata
1106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfc.nrm     -s $UNIDATA/norm2 nfc.txt
1116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc.nrm    -s $UNIDATA/norm2 nfc.txt nfkc.txt
1126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc_cf.nrm -s $UNIDATA/norm2 nfc.txt nfkc.txt nfkc_cf.txt
1136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ bin/gennorm2 -o $SRC_DATA_IN/uts46.nrm   -s $UNIDATA/norm2 nfc.txt uts46.txt
1146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build ICU (make install)
1166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  so that the tools build can pick up the new definitions from the installed header files.
1176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icu/uni63/dbg$ echo;echo;make -j5 install > out.txt 2>&1 ; tail -n 20 out.txt
1196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools using CMake+make
1216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icutools/trunk/src/unicode/c/icudefs.txt:
1236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org# Location (--prefix) of where ICU was installed.
1256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgset(ICU_INST_DIR /home/mscherer/svn.icu/uni63/inst)
1266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org# Location of the ICU source tree.
1276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgset(ICU_SRC_DIR /home/mscherer/svn.icu/uni63/src)
1286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icutools/trunk/dbg/unicode/c$ cmake ../../../src/unicode/c
1306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icutools/trunk/dbg/unicode/c$ make
1316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate core properties data files
1336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icutools/trunk/dbg/unicode/c$ genprops/genprops ~/svn.icu/uni63/src
1346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icutools/trunk/dbg/unicode/c$ genuca/genuca -i ~/svn.icu/uni63/dbg/data/out/build/icudt52l ~/svn.icu/uni63/src
1356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU (make install) & tools
1366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run genuca again (see step above) so that it picks up the new case mappings and nfc.nrm
1376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU (make install) & tools
1386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update uts46test.cpp and UTS46Test.java if there are new characters that are equivalent to
1406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sequences with non-LDH ASCII (that is, their decompositions contain '=' or similar)
1416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- grep IdnaMappingTable.txt or uts46.txt for "disallowed_STD3_valid" on non-ASCII characters
1426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unicode 6.0..6.3: U+2260, U+226E, U+226F
1436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- nothing new in 6.3, no test file to update
1446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update Java data files
1466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh just the UCD-related files, just to be safe
1476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- see (ICU4C)/source/data/icu4j-readme.txt
1486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- mkdir /tmp/icu4j
1496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
1506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  output:
1516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ...
1526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Unicode .icu files built to ./out/build/icudt52l
1536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p ./out/icu4j/com/ibm/icu/impl/data/icudt52b
1546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p ./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt52b
1556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    echo pnames.icu ubidi.icu ucase.icu uprops.icu > ./out/icu4j/add.txt
1566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    LD_LIBRARY_PATH=../lib:../stubdata:../tools/ctestfw:$LD_LIBRARY_PATH  ../bin/icupkg ./out/tmp/icudt52l.dat ./out/icu4j/icudt52b.dat -a ./out/icu4j/add.txt -s ./out/build/icudt52l -x '*' -tb -d ./out/icu4j/com/ibm/icu/impl/data/icudt52b
1576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mv ./out/icu4j/"com/ibm/icu/impl/data/icudt52b/zoneinfo64.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt52b/metaZones.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt52b/timezoneTypes.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt52b/windowsZones.res" "./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt52b"
1586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    jar cf ./out/icu4j/icudata.jar -C ./out/icu4j com/ibm/icu/impl/data/icudt52b/
1596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/main/shared/data
1606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp ./out/icu4j/icudata.jar /tmp/icu4j/main/shared/data
1616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    jar cf ./out/icu4j/icutzdata.jar -C ./out/icu4j/tzdata com/ibm/icu/impl/data/icudt52b/
1626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/main/shared/data
1636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp ./out/icu4j/icutzdata.jar /tmp/icu4j/main/shared/data
1646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    make[1]: Leaving directory `/home/mscherer/svn.icu/uni63/dbg/data'
1656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the big-endian Unicode data files to another location,
1666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  separate from the other data files
1676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/coll
1686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/brkitr
1696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni63/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt52b/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt52b
1706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni63/dbg/data/out/icu4j$ rm /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/cnvalias.icu
1716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni63/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt52b/*.nrm /tmp/icu4j/com/ibm/icu/impl/data/icudt52b
1726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni63/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt52b/coll/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/coll
1736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni63/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt52b/brkitr/* /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/brkitr
1746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J
1756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni63/dbg/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt52b
1766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* refresh Java test .txt files
1786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy new .txt files into ICU4J's main/tests/core/src/com/ibm/icu/dev/data/unicode
1796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UCA -- mostly skipped for ICU 52 / Unicode 6.3, except update coll/* files
1816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
1826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- get output from Mark's tools; look in http://www.unicode.org/Public/UCA/<beta version>/
1836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- CLDR root files for ICU are in CollationAuxiliary.zip; unpack that
1846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/FractionalUCA.txt with FractionalUCA_SHORT.txt
1856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/UCARules.txt with UCA_Rules_SHORT.txt
1866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (note removing the underscore before "Rules")
1876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update (ICU4C)/source/test/testdata/CollationTest_*.txt
1886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  and (ICU4J)/main/tests/collate/src/com/ibm/icu/dev/data/CollationTest_*.txt
1896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  with output from Mark's Unicode tools (..._CLDR_..._SHORT.txt)
1906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check test file diffs for previously commented-out, known-failing data lines;
1916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  probably need to keep those commented out
1926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check FractionalUCA.txt for manual changes of lead bytes from IMPLICIT to Hani
1936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run genuca, see command line above
1946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU4C
1956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J collation data:
1966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (subset of instructions above for properties data refresh, except copies all coll/*)
1976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni63/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
1986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni63/dbg$ mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/coll
1996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni63/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt52b/coll/* /tmp/icu4j/com/ibm/icu/impl/data/icudt52b/coll
2006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni63/dbg/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt52b
2016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run all tests with the *_SHORT.txt or the full files (the full ones have comments, useful for debugging)
2026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- note on intltest: if collate/UCAConformanceTest fails, then
2036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  utility/MultithreadTest/TestCollators will fail as well;
2046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  fix the conformance test before looking into the multi-thread test
2056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* test ICU, fix test code where necessary
2076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* When refreshing all of ICU4J data from ICU4C
2096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
2106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- cp /tmp/icu4j/main/shared/data/icudata.jar ~/svn.icu4j/trunk/src/main/shared/data
2116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgor
2126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni63/dbg$ make ICU4J_ROOT=~/svn.icu4j/trunk/src icu4j-data-install
2136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information
2156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- skipped for Unicode 6.3: no new scripts
2166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** merge the Unicode update branches back onto the trunk
2186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- do not merge the icudata.jar and testdata.jar,
2196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  instead rebuild them from merged & tested ICU4C
2206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
2226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 6.2 update
2246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/review/pri230/
2266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/versions/beta-6.2.0.html
2276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/reports/tr44/tr44-9.html#Unicode_6.2.0
2286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/review/pri227/  Changes to Script Extensions Property Values
2296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/review/pri228/  Changing some common characters from Punctuation to Symbol
2306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/review/pri229/  Linebreaking Changes for Pictographic Symbols
2316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://www.unicode.org/reports/tr46/tr46-8.html  IDNA
2326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orghttp://unicode.org/Public/idna/6.2.0/
2336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** ICU Trac
2356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 9515: Unicode 6.2: final ICU update
2376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 9514: UCA 6.2: fix UCARules.txt
2396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 9437: update ICU to Unicode 6.2
2416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- C++ branches/markus/uni62 at r32050 from trunk at r32041
2426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Java branches/markus/uni62 at r32068 from trunk at r32066
2436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers
2456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak
2466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h
2476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (configure.in & configure: have been modified to extract the version from uchar.h)
2486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.util.VersionInfo
2496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.dev.test.lang.UCharacterTest.VERSION_
2506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code
2526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation
2546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- download UCD, UCA & IDNA files
2566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- make sure that the Unicode data folder passed into preparseucd.py
2576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  includes a copy of the latest IdnaMappingTable.txt (can be in some subfolder)
2586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- modify preparseucd.py: NamesList.txt is now in UTF-8
2596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode$ py/preparseucd.py ~/uni62/20120816 ~/svn.icu/uni62/src ~/svn.icu/tools/trunk/src
2606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This writes files (especially ppucd.txt) to the ICU4C unidata and testdata subfolders.
2616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Check test file diffs for previously commented-out, known-failing data lines;
2626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  probably need to keep those commented out.
2636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyValueAliases.txt changes
2656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Line_Break (lb) value:
2666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  lb ; RI                               ; Regional_Indicator
2676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UCharacter.LineBreak
2686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Word_Break (WB) value:
2696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  WB ; RI                               ; Regional_Indicator
2706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UCharacter.WordBreak
2716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Grapheme_Cluster_Break (GCB) value:
2726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  GCB; RI                               ; Regional_Indicator
2736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UCharacter.GraphemeClusterBreak
2746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* 3 new numeric values
2766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  The new value -1, which was really supposed to be NaN but that would have required
2776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new UnicodeData.txt syntax, can already be represented as a "fraction" of -1/1,
2786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  but encodeNumericValue() in corepropsbuilder.cpp had to be fixed.
2796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp;12456;na=CUNEIFORM NUMERIC SIGN NIGIDAMIN;nv=-1
2806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp;12457;na=CUNEIFORM NUMERIC SIGN NIGIDAESH;nv=-1
2816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  The two new values 216000 and 432000 require an addition to the encoding of numeric values.
2826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp;12432;na=CUNEIFORM NUMERIC SIGN SHAR2 TIMES GAL PLUS DISH;nv=216000
2836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp;12433;na=CUNEIFORM NUMERIC SIGN SHAR2 TIMES GAL PLUS MIN;nv=432000
2846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uprops.h, uchar.c & UCharacterProperty.java
2856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> cucdtst.c & UCharacterTest.java
2866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate normalization data files
2886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ export LD_LIBRARY_PATH=~/svn.icu/uni62/dbg/lib
2896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ SRC_DATA_IN=~/svn.icu/uni62/src/source/data/in
2906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ UNIDATA=~/svn.icu/uni62/src/source/data/unidata
2916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfc.nrm     -s $UNIDATA/norm2 nfc.txt
2926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc.nrm    -s $UNIDATA/norm2 nfc.txt nfkc.txt
2936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc_cf.nrm -s $UNIDATA/norm2 nfc.txt nfkc.txt nfkc_cf.txt
2946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ bin/gennorm2 -o $SRC_DATA_IN/uts46.nrm   -s $UNIDATA/norm2 nfc.txt uts46.txt
2956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
2966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build ICU (make install)
2976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  so that the tools build can pick up the new definitions from the installed header files.
2986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools using CMake+make
2996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate core properties data files
3016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/dbg/unicode$ c/genprops/genprops ~/svn.icu/uni62/src
3026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- in initial bootstrapping, change the UCA version
3036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  in source/data/unidata/FractionalUCA.txt to match the new Unicode version
3046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/dbg/unicode$ c/genuca/genuca -i ~/svn.icu/uni62/dbg/data/out/build/icudt50l ~/svn.icu/uni62/src
3056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU (make install) & tools
3066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + if genrb fails to build coll/root.res with an U_INVALID_FORMAT_ERROR,
3076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    check if the UCA version in FractionalUCA.txt matches the new Unicode version
3086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    (see step above)
3096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run genuca again (see step above) so that it picks up the new case mappings and nfc.nrm
3106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU (make install) & tools
3116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update uts46test.cpp and UTS46Test.java if there are new characters that are equivalent to
3136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sequences with non-LDH ASCII (that is, their decompositions contain '=' or similar)
3146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- grep IdnaMappingTable.txt or uts46.txt for "disallowed_STD3_valid" on non-ASCII characters
3156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unicode 6.0..6.2: U+2260, U+226E, U+226F
3166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- nothing new in 6.2, no test file to update
3176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update Java data files
3196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh just the UCD-related files, just to be safe
3206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- see (ICU4C)/source/data/icu4j-readme.txt
3216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- mkdir /tmp/icu4j
3226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
3236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  output:
3246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ...
3256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Unicode .icu files built to ./out/build/icudt50l
3266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p ./out/icu4j/com/ibm/icu/impl/data/icudt50b
3276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p ./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt50b
3286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    echo pnames.icu ubidi.icu ucase.icu uprops.icu > ./out/icu4j/add.txt
3296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    LD_LIBRARY_PATH=../lib:../stubdata:../tools/ctestfw:$LD_LIBRARY_PATH  ../bin/icupkg ./out/tmp/icudt50l.dat ./out/icu4j/icudt50b.dat -a ./out/icu4j/add.txt -s ./out/build/icudt50l -x '*' -tb -d ./out/icu4j/com/ibm/icu/impl/data/icudt50b
3306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mv ./out/icu4j/"com/ibm/icu/impl/data/icudt50b/zoneinfo64.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt50b/metaZones.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt50b/timezoneTypes.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt50b/windowsZones.res" "./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt50b"
3316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    jar cf ./out/icu4j/icudata.jar -C ./out/icu4j com/ibm/icu/impl/data/icudt50b/
3326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/main/shared/data
3336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp ./out/icu4j/icudata.jar /tmp/icu4j/main/shared/data
3346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    jar cf ./out/icu4j/icutzdata.jar -C ./out/icu4j/tzdata com/ibm/icu/impl/data/icudt50b/
3356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/main/shared/data
3366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp ./out/icu4j/icutzdata.jar /tmp/icu4j/main/shared/data
3376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    make[1]: Leaving directory `/home/mscherer/svn.icu/uni62/dbg/data'
3386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the big-endian Unicode data files to another location,
3396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  separate from the other data files
3406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/coll
3416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/brkitr
3426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni62/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt50b/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt50b
3436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni62/dbg/data/out/icu4j$ rm /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/cnvalias.icu
3446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni62/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt50b/*.nrm /tmp/icu4j/com/ibm/icu/impl/data/icudt50b
3456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni62/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt50b/coll/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/coll
3466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni62/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt50b/brkitr/* /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/brkitr
3476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J
3486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni62/dbg/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt50b
3496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* refresh Java test .txt files
3516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy new .txt files into ICU4J's main/tests/core/src/com/ibm/icu/dev/data/unicode
3526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UCA
3546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- get output from Mark's tools; look in http://www.unicode.org/Public/UCA/<beta version>/
3566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- CLDR root files for ICU are in CollationAuxiliary.zip; unpack that
3576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/FractionalUCA.txt with FractionalUCA_SHORT.txt
3586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/UCARules.txt with UCA_Rules_SHORT.txt
3596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (note removing the underscore before "Rules")
3606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update (ICU4C)/source/test/testdata/CollationTest_*.txt
3616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  and (ICU4J)/main/tests/collate/src/com/ibm/icu/dev/data/CollationTest_*.txt
3626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  with output from Mark's Unicode tools (..._CLDR_..._SHORT.txt)
3636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check test file diffs for previously commented-out, known-failing data lines;
3646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  probably need to keep those commented out
3656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check FractionalUCA.txt for manual changes of lead bytes from IMPLICIT to Hani
3666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run genuca, see command line above
3676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU4C
3686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J collation data:
3696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (subset of instructions above for properties data refresh, except copies all coll/*)
3706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni62/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
3716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni62/bld$ mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/coll
3726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni62/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt50b/coll/* /tmp/icu4j/com/ibm/icu/impl/data/icudt50b/coll
3736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/uni62/bld/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt50b
3746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run all tests with the *_SHORT.txt or the full files (the full ones have comments, useful for debugging)
3756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- note on intltest: if collate/UCAConformanceTest fails, then
3766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  utility/MultithreadTest/TestCollators will fail as well;
3776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  fix the conformance test before looking into the multi-thread test
3786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* test ICU, fix test code where necessary
3806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* When refreshing all of ICU4J data from ICU4C
3826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
3836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- cp /tmp/icu4j/main/shared/data/icudata.jar ~/svn.icu4j/trunk/src/main/shared/data
3846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgor
3856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/uni62/dbg$ make ICU4J_ROOT=~/svn.icu4j/trunk/src icu4j-data-install
3866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information
3886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- skipped for Unicode 6.2: no new scripts
3896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** merge the Unicode update branches back onto the trunk
3916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- do not merge the icudata.jar and testdata.jar,
3926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  instead rebuild them from merged & tested ICU4C
3936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
3956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgFuture Unicode update
3976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
3986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgTools simplified since the Unicode 6.1 update. See
3996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- http://site.icu-project.org/design/props/ppucd
4006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- http://bugs.icu-project.org/trac/wiki/Markus/ReviewTicket8972
4016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Unicode version numbers
4036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- icutools/unicode/makedefs.sh was deleted, so one fewer place for version & path updates
4046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation
4066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdcopy.py, idna2nrm.py and genpname/preparse.pl replaced by preparseucd.py:
4076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode$ py/preparseucd.py ~/uni61/20120118 ~/svn.icu/trunk/src ~/svn.icu/tools/trunk/src
4086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This writes files (especially ppucd.txt) to the ICU4C unidata and testdata subfolders.
4096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Check test file diffs for previously commented-out, known-failing data lines;
4106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  probably need to keep those commented out.
4116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyValueAliases.txt changes
4136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Script codes that are in ISO 15924 but not in Unicode are now listed in
4146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  preparseucd.py, in the _scripts_only_in_iso15924 variable.
4156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  If there are new ISO codes, then add them.
4166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  If Unicode adds some of them, then remove them from the .py variable.
4176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UnicodeData.txt changes
4196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- No more manual changes for CJK ranges for algorithmic names;
4206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  those are now written to ppucd.txt and genprops reads them from there.
4216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate core properties data files (makeprops.sh was deleted)
4236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/dbg/unicode$ c/genprops/genprops ~/svn.icu/trunk/src
4246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* no more manual updates of source/data/unidata/norm2/nfkc_cf.txt
4266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- it is now generated by preparseucd.py
4276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* no more separate idna2nrm.py run and manual copying to generate source/data/unidata/norm2/uts46.txt
4296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- it is now generated by preparseucd.py
4306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- make sure that the Unicode data folder passed into preparseucd.py
4316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  includes a copy of http://www.unicode.org/Public/idna/6.1.0/IdnaMappingTable.txt
4326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (can be in some subfolder)
4336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate normalization data files
4356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ export LD_LIBRARY_PATH=~/svn.icu/trunk/dbg/lib
4366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ SRC_DATA_IN=~/svn.icu/trunk/src/source/data/in
4376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ UNIDATA=~/svn.icu/trunk/src/source/data/unidata
4386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfc.nrm     -s $UNIDATA/norm2 nfc.txt
4396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc.nrm    -s $UNIDATA/norm2 nfc.txt nfkc.txt
4406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ bin/gennorm2 -o $SRC_DATA_IN/nfkc_cf.nrm -s $UNIDATA/norm2 nfc.txt nfkc.txt nfkc_cf.txt
4416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ bin/gennorm2 -o $SRC_DATA_IN/uts46.nrm   -s $UNIDATA/norm2 nfc.txt uts46.txt
4426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build ICU (make install)
4446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools using CMake+make
4456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* new way to call genuca (makeuca.sh was deleted)
4476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/dbg/unicode$ c/genuca/genuca -i ~/svn.icu/trunk/dbg/data/out/build/icudt49l ~/svn.icu/trunk/src
4486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
4506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 6.1 update
4526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** ICU Trac
4546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 8995 final update to Unicode 6.1
4566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 8994 regenerate source/layout/CanonData.cpp
4576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 8961 support Unicode "Age" value *names*
4596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 8963 support multiple character name aliases & types
4606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ticket 8827 "update ICU to Unicode 6.1"
4626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- C++ branches/markus/uni61 at r30864 from trunk at r30843
4636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Java branches/markus/uni61 at r30865 from trunk at r30863
4646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers
4666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak
4676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h
4686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (configure.in & configure: have been modified to extract the version from uchar.h)
4696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.util.VersionInfo
4706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- icutools/unicode/makedefs.sh
4716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + also review & update other definitions in that file,
4726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    e.g. the ICU version in this path: BLD_DATA_FILES=$ICU_BLD/data/out/build/icudt49l
4736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code
4756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation
4776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icu/tools/trunk/src/unicode/c/genprops/misc$ ./ucdcopy.py ~/uni61/20111205/ucd ~/uni61/processed
4796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This prepares both unidata and testdata files in respective output subfolders.
4806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Check test file diffs for previously commented-out, known-failing data lines;
4816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  probably need to keep those commented out.
4826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
4836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyValueAliases.txt changes
4846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 11 new block names:
4856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Arabic_Extended_A
4866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Arabic_Mathematical_Alphabetic_Symbols
4876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Chakma
4886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Meetei_Mayek_Extensions
4896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Meroitic_Cursive
4906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Meroitic_Hieroglyphs
4916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Miao
4926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Sharada
4936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Sora_Sompeng
4946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Sundanese_Supplement
4956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Takri
4966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add to uchar.h
4976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add to UCharacter.UnicodeBlock IDs
4986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Eclipse find     UBLOCK_([^ ]+) = ([0-9]+), (/.+)
4996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org            replace  public static final int \1_ID = \2; \3
5006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add to UCharacter.UnicodeBlock objects
5016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Eclipse find     UBLOCK_([^ ]+) = [0-9]+, (/.+)
5026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org            replace  public static final UnicodeBlock \1 = new UnicodeBlock("\1", \1_ID); \2
5036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 1 new Joining_Group (jg) value:
5046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Rohingya_Yeh
5056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UCharacter.JoiningGroup
5066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 2 new Line_Break (lb) values:
5076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  CJ=Conditional_Japanese_Starter
5086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  HL=Hebrew_Letter
5096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UCharacter.LineBreak
5106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 7 new scripts:
5116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sc ; Cakm      ; Chakma
5126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sc ; Merc      ; Meroitic_Cursive
5136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sc ; Mero      ; Meroitic_Hieroglyphs
5146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sc ; Plrd      ; Miao
5156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sc ; Shrd      ; Sharada
5166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sc ; Sora      ; Sora_Sompeng
5176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sc ; Takr      ; Takri
5186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> remove these from SyntheticPropertyValueAliases.txt
5196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> fix expectedLong names in cucdapi.c/TestUScriptCodeAPI()
5206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      and in com.ibm.icu.dev.test.lang.TestUScript.java
5216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 2 new script codes from ISO 15924 http://www.unicode.org/iso15924/codechanges.html
5226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (added 2011-06-21)
5236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Khoj        322     Khojki
5246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Tirh        326     Tirhuta
5256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    and another one added 2011-12-09
5266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Hluw        080     Anatolian Hieroglyphs (Luwian Hieroglyphs, Hittite Hieroglyphs)
5276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uscript.h
5286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> com.ibm.icu.lang.UScript
5296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    find     USCRIPT_([^ ]+) *= ([0-9]+),(.+)
5306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    replace  public static final int \1 = \2;\3
5316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> SyntheticPropertyValueAliases.txt
5326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add to expectedLong and expectedShort names in cintltst/cucdapi.c/TestUScriptCodeAPI()
5336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      and in com.ibm.icu.dev.test.lang.TestUScript.java
5346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
5356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UnicodeData.txt changes
5366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- the last Unihan code point changes from U+9FCB to U+9FCC
5376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  search for both 9FCB (end) and 9FCC (limit) (regex 9FC[BC], case-insensitive)
5386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + do change gennames.c
5396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + do change swapCJK() in ucol.cpp & ImplicitCEGenerator.java
5406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
5416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* DerivedBidiClass.txt changes
5426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 2 new default-AL blocks:
5436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org#     Arabic Extended-A: U+08A0  -  U+08FF  (was default-R)
5446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org#     Arabic Mathematical Alphabetic Symbols:
5456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org#                       U+1EE00  - U+1EEFF  (was default-R)
5466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 2 new default-R blocks:
5476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org#     Meroitic Hieroglyphs:
5486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org#                        U+10980 - U+1099F
5496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org#     Meroitic Cursive:  U+109A0 - U+109FF
5506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> should be picked up by the explicit data in the file
5516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
5526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* NameAliases.txt changes
5536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- from
5546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    # Each line has two fields
5556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    # First field: Code point
5566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    # Second field: Alias
5576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- to
5586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    # Each line has three fields, as described here:
5596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    #
5606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    # First field:  Code point
5616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    # Second field: Alias
5626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    # Third field:  Type
5636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Also, the file previously allowed multiple aliases but only now does it
5646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  actually provide multiple, even multiple of the same type. For example,
5656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    FEFF;BYTE ORDER MARK;alternate
5666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    FEFF;BOM;abbreviation
5676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    FEFF;ZWNBSP;abbreviation
5686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This breaks our gennames parser, unames.icu data structure, and API.
5696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Fix gennames to only pick up "correction" aliases.
5706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  New ticket #8963 for further changes.
5716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
5726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname/preparse.pl (on Linux)
5736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + cd ~/svn.icu/tools/trunk/src/unicode/c/genpname
5746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + make sure that data.h is writable
5756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + perl preparse.pl ~/svn.icu/trunk/src > out.txt
5766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + preparse.pl shows no errors, out.txt Info and Warning lines look ok
5776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
5786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build ICU (make install)
5796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  so that the tools build can pick up the new definitions from the installed header files.
5806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools (at least genpname) using CMake+make
5816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
5826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname
5836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (builds both pnames.icu and propname_data.h)
5846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genpname/genpname -v -d ~/svn.icu/trunk/src/source/data/in
5856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genpname/genpname -v -d ~/svn.icu/trunk/src/source/common --csource
5866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
5876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build ICU (make install)
5886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools using CMake+make
5896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
5906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update source/data/unidata/norm2/nfkc_cf.txt
5916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- follow the instructions in nfkc_cf.txt for updating it from DerivedNormalizationProps.txt
5926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
5936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update source/data/unidata/norm2/uts46.txt
5946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- download http://www.unicode.org/Public/idna/6.1.0/IdnaMappingTable.txt
5956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  to ~/svn.icu/tools/trunk/src/unicode/py
5966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- adjust idna2nrm.py to remove "; NV8": For UTS #46, we do not care about "not valid in IDNA2008".
5976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode/py$ ./idna2nrm.py
5986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode/py$ cp uts46.txt ~/svn.icu/trunk/src/source/data/unidata/norm2
5996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update uts46test.cpp and UTS46Test.java if there are new characters that are equivalent to
6016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sequences with non-LDH ASCII (that is, their decompositions contain '=' or similar)
6026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- grep IdnaMappingTable.txt or uts46.txt for "disallowed_STD3_valid" on non-ASCII characters
6036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unicode 6.0..6.1: U+2260, U+226E, U+226F
6046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- nothing new in 6.1, no test file to update
6056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate core properties data files
6076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- in initial bootstrapping, change the UCA version
6086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  in source/data/unidata/FractionalUCA.txt to match the new Unicode version
6096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode$ ./makeprops.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld
6106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools
6116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + if genrb fails to build coll/root.res with an U_INVALID_FORMAT_ERROR,
6126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    check if the UCA version in FractionalUCA.txt matches the new Unicode version
6136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    (see step above)
6146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run makeuca.sh so that genuca picks up the new case mappings and nfc.nrm:
6156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  ~/svn.icu/tools/trunk/src/unicode$ ./makeuca.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld
6166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools
6176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update Java data files
6196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh just the UCD-related files, just to be safe
6206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- see (ICU4C)/source/data/icu4j-readme.txt
6216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- mkdir /tmp/icu4j
6226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
6236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  output:
6246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ...
6256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Unicode .icu files built to ./out/build/icudt49l
6266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p ./out/icu4j/com/ibm/icu/impl/data/icudt49b
6276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p ./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt49b
6286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    echo pnames.icu ubidi.icu ucase.icu uprops.icu > ./out/icu4j/add.txt
6296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    LD_LIBRARY_PATH=../lib:../stubdata:../tools/ctestfw:$LD_LIBRARY_PATH  ../bin/icupkg ./out/tmp/icudt49l.dat ./out/icu4j/icudt49b.dat -a ./out/icu4j/add.txt -s ./out/build/icudt49l -x '*' -tb -d ./out/icu4j/com/ibm/icu/impl/data/icudt49b
6306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mv ./out/icu4j/"com/ibm/icu/impl/data/icudt49b/zoneinfo64.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt49b/metaZones.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt49b/timezoneTypes.res" ./out/icu4j/"com/ibm/icu/impl/data/icudt49b/windowsZones.res" "./out/icu4j/tzdata/com/ibm/icu/impl/data/icudt49b"
6316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    jar cf ./out/icu4j/icudata.jar -C ./out/icu4j com/ibm/icu/impl/data/icudt49b/
6326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/main/shared/data
6336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp ./out/icu4j/icudata.jar /tmp/icu4j/main/shared/data
6346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    jar cf ./out/icu4j/icutzdata.jar -C ./out/icu4j/tzdata com/ibm/icu/impl/data/icudt49b/
6356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/main/shared/data
6366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp ./out/icu4j/icutzdata.jar /tmp/icu4j/main/shared/data
6376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    make[1]: Leaving directory `/home/mscherer/svn.icu/trunk/bld/data'
6386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the big-endian Unicode data files to another location,
6396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  separate from the other data files
6406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/coll
6416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/brkitr
6426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt49b/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt49b
6436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ rm /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/cnvalias.icu
6446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt49b/*.nrm /tmp/icu4j/com/ibm/icu/impl/data/icudt49b
6456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt49b/coll/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/coll
6466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt49b/brkitr/* /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/brkitr
6476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J
6486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt49b
6496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* refresh Java test .txt files
6516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy new .txt files into ICU4J's main/tests/core/src/com/ibm/icu/dev/data/unicode
6526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* test ICU so far, fix test code where necessary
6546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- temporarily ignore collation issues that look like UCA/UCD mismatches,
6556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  until UCA data is updated
6566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UCA
6586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- get output from Mark's tools; look in
6606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    http://www.unicode.org/Public/UCA/6.1.0/CollationAuxiliary-<dev. version>.txt
6616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/FractionalUCA.txt with FractionalUCA_SHORT.txt
6626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/UCARules.txt with UCA_Rules_SHORT.txt
6636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (note removing the underscore before "Rules")
6646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update (ICU)/source/test/testdata/CollationTest_*.txt
6656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  and (ICU4J)/main/tests/collate/src/com/ibm/icu/dev/data/CollationTest_*.txt
6666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  with output from Mark's Unicode tools (..._CLDR_..._SHORT.txt)
6676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check test file diffs for previously commented-out, known-failing data lines;
6686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  probably need to keep those commented out
6696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- check FractionalUCA.txt for manual changes of lead bytes from IMPLICIT to Hani
6706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run makeuca.sh:
6716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  ~/svn.icu/tools/trunk/src/unicode$ ./makeuca.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld
6726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU4C
6736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J collation data:
6746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (subset of instructions above for properties data refresh, except copies all coll/*)
6756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
6766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld$ mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/coll
6776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt49b/coll/* /tmp/icu4j/com/ibm/icu/impl/data/icudt49b/coll
6786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt49b
6796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run all tests with the *_SHORT.txt or the full files (the full ones have comments, useful for debugging)
6806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- note on intltest: if collate/UCAConformanceTest fails, then
6816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  utility/MultithreadTest/TestCollators will fail as well;
6826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  fix the conformance test before looking into the multi-thread test
6836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* When refreshing all of ICU4J data from ICU4C
6856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
6866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- cp /tmp/icu4j/main/shared/data/icudata.jar ~/svn.icu4j/trunk/src/main/shared/data
6876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgor
6886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=~/svn.icu4j/trunk/src icu4j-data-install
6896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information
6916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org(For details see the Unicode 5.2 change log below.)
6936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Run icu4j-tools: com.ibm.icu.dev.tool.layout.ScriptNameBuilder.
6956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  This generates LEScripts.h, LELanguages.h, ScriptAndLanguageTags.h and ScriptAndLanguageTags.cpp
6966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  in the working directory.
6976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (It also generates ScriptRunData.cpp, which is no longer needed.)
6986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
6996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  The generated files have a current copyright date and "@draft" statement.
7006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- diff current <icu>/source/layout files vs. generated ones
7026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu4j/trunk/src$ kdiff3 ~/svn.icu/trunk/src/source/layout tools/misc/src/com/ibm/icu/dev/tool/layout
7036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  review and manually merge desired changes;
7046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  fix gratuitous changes, incorrect @draft and missing aliases;
7056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Unicode-derived script codes should be "born stable" like constants in uchar.h, uscript.h etc.
7066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- if you just copy the above files, then
7076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  fix mixed line endings, review the diffs as above and restore changes to API tags etc.;
7086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  manually re-add the "Indic script xyz v.2" tags in ScriptAndLanguageTags.h
7096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** merge the Unicode update branches back onto the trunk
7116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- do not merge the icudata.jar and testdata.jar,
7126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  instead rebuild them from merged & tested ICU4C
7136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
7156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU 4.8 (no Unicode update, just new script codes)
7176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* 9 new script codes from ISO 15924 http://www.unicode.org/iso15924/codechanges.html
7196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (added 2010-12-21)
7206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Afak    439     Afaka
7216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Jurc    510     Jurchen
7226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Mroo    199     Mro, Mru
7236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Nshu    499     Nüshu
7246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Shrd    319     Sharada, Śāradā
7256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Sora    398     Sora Sompeng
7266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Takr    321     Takri, Ṭākrī, Ṭāṅkrī
7276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Tang    520     Tangut
7286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Wole    480     Woleai
7296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uscript.h
7306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> com.ibm.icu.lang.UScript
7316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    find     USCRIPT_([^ ]+) *= ([0-9]+),(.+)
7326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    replace  public static final int \1 = \2;\3
7336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> genpname/SyntheticPropertyValueAliases.txt
7346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add to expectedLong and expectedShort names in cintltst/cucdapi.c/TestUScriptCodeAPI()
7356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      and in com.ibm.icu.dev.test.lang.TestUScript.java
7366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname/preparse.pl (on Linux)
7386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + cd ~/svn.icu/tools/trunk/src/unicode/c/genpname
7396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + make sure that data.h is writable
7406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + perl preparse.pl ~/svn.icu/trunk/src > out.txt
7416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + preparse.pl shows no errors, out.txt Info and Warning lines look ok
7426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* rebuild Unicode tools (at least genpname) using make
7446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- You might first need to "make install" ICU so that the tools build can pick
7456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  up the new definitions from the installed header files.
7466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname
7486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (builds both pnames.icu and propname_data.h)
7496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genpname/genpname -v -d ~/svn.icu/trunk/src/source/data/in
7506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genpname/genpname -v -d ~/svn.icu/trunk/src/source/common --csource
7516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools
7526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genprops
7546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genprops/genprops -d ~/svn.icu/trunk/src/source/data/in -s ~/svn.icu/trunk/src/source/data/unidata -i ~/svn.icu/trunk/dbg/data/out/build/icudt48l -u 6.0
7556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode/c$ genprops/genprops -d ~/svn.icu/trunk/src/source/common --csource -s ~/svn.icu/trunk/src/source/data/unidata -i ~/svn.icu/trunk/dbg/data/out/build/icudt48l -u 6.0
7566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools
7576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update Java data files
7596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh just the UCD-related files, just to be safe
7606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- see (ICU4C)/source/data/icu4j-readme.txt
7616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- mkdir /tmp/icu4j
7626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/dbg$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
7636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the big-endian Unicode data files to another location,
7646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  separate from the other data files
7656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt48b
7666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt48b/pnames.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt48b
7676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/dbg/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt48b/uprops.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt48b
7686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J
7696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/dbg/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt48b
7706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* should have updated the layout engine script codes but forgot
7726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
7746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 6.0 update
7766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related ICU Trac tickets
7786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7264 Unicode 6.0 Update
7806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers
7826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak
7836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h
7846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (configure.in & configure: have been modified to extract the version from uchar.h)
7856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- com.ibm.icu.util.VersionInfo
7866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code
7886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation
7906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org~/svn.icu/tools/trunk/src/unicode/c/genprops/misc$ ./ucdcopy.py ~/uni60/20100720/ucd ~/uni60/processed
7926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- This now prepares both unidata and testdata files in respective output subfolders.
7936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
7946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyAliases.txt changes
7956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Script_Extensions property defined in the new ScriptExtensions.txt file
7966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  but not listed in PropertyAliases.txt; reported to unicode.org;
7976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> added to tools/trunk/src/unicode/c/genpname/SyntheticPropertyAliases.txt
7986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    scx; Script_Extensions
7996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h with new UProperty section
8006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> com.ibm.icu.lang.UProperty, parallel with uchar.h
8016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
8026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* PropertyValueAliases.txt changes
8036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 12 new block names:
8046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Alchemical_Symbols
8056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Bamum_Supplement
8066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Batak
8076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Brahmi
8086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  CJK_Unified_Ideographs_Extension_D
8096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Emoticons
8106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Ethiopic_Extended_A
8116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Kana_Supplement
8126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Mandaic
8136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Miscellaneous_Symbols_And_Pictographs
8146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Playing_Cards
8156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Transport_And_Map_Symbols
8166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add to uchar.h
8176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add to UCharacter.UnicodeBlock
8186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Eclipse find     UBLOCK_([^ ]+) = [0-9]+, (/.+)
8196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org            replace  public static final UnicodeBlock \1 = new UnicodeBlock("\1", \1_ID); \2
8206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Joining_Group (jg) values:
8216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Teh_Marbuta_Goal becomes the new canonical value for the old Hamza_On_Heh_Goal which becomes an alias
8226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uchar.h & UCharacter.JoiningGroup
8236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 3 new scripts:
8246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sc ; Batk      ; Batak
8256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sc ; Brah      ; Brahmi
8266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sc ; Mand      ; Mandaic
8276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> remove these from SyntheticPropertyValueAliases.txt
8286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add alias USCRIPT_MANDAIC to USCRIPT_MANDAEAN
8296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> fix expectedLong names in cucdapi.c/TestUScriptCodeAPI()
8306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      and in com.ibm.icu.dev.test.lang.TestUScript.java
8316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- 13 new script codes from ISO 15924 http://www.unicode.org/iso15924/codechanges.html
8326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (added 2009-11-11..2010-07-18)
8336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Bass        259     Bassa Vah
8346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Dupl        755     Duployan shortand
8356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Elba        226     Elbasan
8366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Gran        343     Grantha
8376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Kpel        436     Kpelle
8386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Loma        437     Loma
8396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Mend        438     Mende
8406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Merc        101     Meroitic Cursive
8416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Narb        106     Old North Arabian
8426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Nbat        159     Nabataean
8436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Palm        126     Palmyrene
8446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Sind        318     Sindhi
8456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Wara        262     Warang Citi
8466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> uscript.h
8476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> com.ibm.icu.lang.UScript
8486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    find     USCRIPT_([^ ]+) *= ([0-9]+),(.+)
8496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    replace  public static final int \1 = \2;\3
8506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> SyntheticPropertyValueAliases.txt
8516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add to expectedLong and expectedShort names in cintltst/cucdapi.c/TestUScriptCodeAPI()
8526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      and in com.ibm.icu.dev.test.lang.TestUScript.java
8536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ISO 15924 name change
8546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Mero        100     Meroitic Hieroglyphs (was Meroitic)
8556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add new alias USCRIPT_MEROITIC_HIEROGLYPHS to USCRIPT_MEROITIC
8566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- property value alias added for Cham, was already moved out of SyntheticPropertyValueAliases.txt
8576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
8586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UnicodeData.txt changes
8596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new CJK block:
8606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  2B740;<CJK Ideograph Extension D, First>;Lo;0;L;;;;;N;;;;;
8616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  2B81D;<CJK Ideograph Extension D, Last>;Lo;0;L;;;;;N;;;;;
8626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> add to tools/trunk/src/unicode/c/gennames/gennames.c, with new ucdVersion
8636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
8646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode tools using CMake+make
8656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
8666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname/preparse.pl (on Linux)
8676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + cd ~/svn.icu/tools/trunk/src/unicode/c/genpname
8686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + make sure that data.h is writable
8696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + perl preparse.pl ~/svn.icu/trunk/src > out.txt
8706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + preparse.pl shows no errors, out.txt Info and Warning lines look ok
8716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
8726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* rebuild Unicode tools (at least genpname) using make
8736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- You might first need to "make install" ICU so that the tools build can pick
8746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  up the new definitions from the installed header files.
8756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
8766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* run genpname
8776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/bld/unicode$ c/genpname/genpname -v -d ~/svn.icu/trunk/src/source/data/in
8786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools
8796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
8806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update source/data/unidata/norm2/nfkc_cf.txt
8816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- follow the instructions in nfkc_cf.txt for updating it from DerivedNormalizationProps.txt
8826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
8836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update source/data/unidata/norm2/uts46.txt
8846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- download http://www.unicode.org/Public/idna/6.0.0/IdnaMappingTable.txt
8856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  to ~/svn.icu/tools/trunk/src/unicode/py
8866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- adjust idna2nrm.py to handle new disallowed_STD3_valid and disallowed_STD3_mapped values
8876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode/py$ ./idna2nrm.py
8886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode/py$ cp uts46.txt ~/svn.icu/trunk/src/source/data/unidata/norm2
8896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
8906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update uts46test.cpp and UTS46Test.java if there are new characters that are equivalent to
8916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  sequences with non-LDH ASCII (that is, their decompositions contain '=' or similar)
8926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- grep IdnaMappingTable.txt or uts46.txt for "disallowed_STD3_valid" on non-ASCII characters
8936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unicode 6.0: U+2260, U+226E, U+226F
8946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
8956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* generate core properties data files
8966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/tools/trunk/src/unicode$ ./makeprops.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld
8976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools
8986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run makeuca.sh so that genuca picks up the new nfc.nrm:
8996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  ~/svn.icu/tools/trunk/src/unicode$ ./makeuca.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld
9006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU & tools
9016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* implement new Script_Extensions property (provisional)
9036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- parser & generator: genprops & uprops.icu
9046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uscript.h, uprops.h, uchar.c, uniset_props.cpp and others, plus cintltst/cucdapi.c & intltest/usettest.cpp
9056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UScript.java, UCharacterProperty.java, UnicodeSet.java, TestUScript.java, UnicodeSetTest.java
9066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* switch ubidi.icu, ucase.icu and uprops.icu from UTrie to UTrie2
9086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- (one-time change)
9096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- genbidi/gencase/genprops tools changes
9106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- re-run makeprops.sh (see above)
9116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UCharacterProperty.java, UCharacterTypeIterator.java,
9126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  UBiDiProps.java, UCaseProps.java, and several others with minor changes;
9136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  UCharacterPropertyReader.java deleted and its code folded into UCharacterProperty.java
9146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update Java data files
9166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh just the UCD-related files, just to be safe
9176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- see (ICU4C)/source/data/icu4j-readme.txt
9186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- mkdir /tmp/icu4j
9196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
9206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  output:
9216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ...
9226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Unicode .icu files built to ./out/build/icudt45l
9236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p ./out/icu4j/com/ibm/icu/impl/data/icudt45b
9246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    echo ubidi.icu ucase.icu uprops.icu > ./out/icu4j/add.txt
9256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    LD_LIBRARY_PATH=../lib:../stubdata:../tools/ctestfw:$LD_LIBRARY_PATH  ../bin/icupkg ./out/tmp/icudt45l.dat ./out/icu4j/icudt45b.dat -a ./out/icu4j/add.txt -s ./out/build/icudt45l -x '*' -tb -d ./out/icu4j/com/ibm/icu/impl/data/icudt45b
9266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    jar cf ./out/icu4j/icudata.jar -C ./out/icu4j com/ibm/icu/impl/data/icudt45b
9276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/main/shared/data
9286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    cp ./out/icu4j/icudata.jar /tmp/icu4j/main/shared/data
9296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the big-endian Unicode data files to another location,
9306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  separate from the other data files
9316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/coll
9326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/brkitr
9336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt45b/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt45b
9346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ rm /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/cnvalias.icu
9356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt45b/*.nrm /tmp/icu4j/com/ibm/icu/impl/data/icudt45b
9366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt45b/coll/*.icu /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/coll
9376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt45b/brkitr/* /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/brkitr
9386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J
9396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt45b
9406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* refresh Java test .txt files
9426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy new .txt files into ICU4J's main/tests/core/src/com/ibm/icu/dev/data/unicode
9436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* un-hardcode normalization skippable (NF*_Inert) test data
9456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- removes one manual step from the Unicode upgrade, and removes dependency on one of Mark's tools
9466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* copy updated break iterator test files
9486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- now handled by early ucdcopy.py and
9496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  copying the uni60/processed/testdata files to ~/svn.icu/trunk/src/source/test/testdata
9506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (old instructions:
9516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org   copy from (Unicode 6.0)/ucd/auxiliary/*BreakTest-6....txt
9526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org   to ~/svn.icu/trunk/src/source/test/testdata)
9536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- they are not used in ICU4J
9546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* UCA
9566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- get output from Mark's tools; look in
9586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    http://www.unicode.org/~book/incoming/mark/uca6.0.0/
9596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    http://www.macchiato.com/unicode/utc/additional-uca-files
9606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    http://www.unicode.org/Public/UCA/6.0.0/
9616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    http://www.unicode.org/~mdavis/uca/
9626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/FractionalUCA.txt with FractionalUCA_SHORT.txt
9636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/UCARules.txt with UCA_Rules_SHORT.txt
9646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update Han-implicit ranges for new CJK extensions:
9656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  swapCJK() in ucol.cpp & ImplicitCEGenerator.java
9666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- genuca: allow bytes 02 for U+FFFE, new merge-sort character;
9676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  do not add it into invuca so that tailoring primary-after an ignorable works
9686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- genuca: permit space between [variable top] bytes
9696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucol.cpp: treat noncharacters like unassigned rather than ignorable
9706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run makeuca.sh:
9716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  ~/svn.icu/tools/trunk/src/unicode$ ./makeuca.sh ~/svn.icu/trunk/src ~/svn.icu/trunk/bld
9726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- rebuild ICU4C
9736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- refresh ICU4J collation data:
9746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (subset of instructions above for properties data refresh, except copies all coll/*)
9756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
9766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    mkdir -p /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/coll
9776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ cp com/ibm/icu/impl/data/icudt45b/coll/* /tmp/icu4j/com/ibm/icu/impl/data/icudt45b/coll
9786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ~/svn.icu/trunk/bld/data/out/icu4j$ jar uf ~/svn.icu4j/trunk/src/main/shared/data/icudata.jar -C /tmp/icu4j com/ibm/icu/impl/data/icudt45b
9796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update (ICU)/source/test/testdata/CollationTest_*.txt
9806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  and (ICU4J)/main/tests/collate/src/com/ibm/icu/dev/data/CollationTest_*.txt
9816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  with output from Mark's Unicode tools
9826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run all tests with the *_SHORT.txt or the full files (the full ones have comments)
9836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- note on intltest: if collate/UCAConformanceTest fails, then
9846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  utility/MultithreadTest/TestCollators will fail as well;
9856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  fix the conformance test before looking into the multi-thread test
9866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* When refreshing all of ICU4J data from ICU4C
9886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=/tmp/icu4j icu4j-data-install
9896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- cp /tmp/icu4j/main/shared/data/icudata.jar ~/svn.icu4j/trunk/src/main/shared/data
9906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgor
9916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ~/svn.icu/trunk/bld$ make ICU4J_ROOT=~/svn.icu4j/trunk/src icu4j-data-install
9926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information
9946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org(For details see the Unicode 5.2 change log below.)
9966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
9976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Run ICU4J com.ibm.icu.dev.tool.layout.ScriptNameBuilder. This generates LEScripts.h, LELanguages.h,
9986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptAndLanguageTags.h and ScriptAndLanguageTags.cpp in the working directory. (It also generates
9996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptRunData.cpp, which is no longer needed.)
10006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgThe generated files have a current copyright date and "@draft" statement.
10026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* copy the above files into <icu>/source/layout, replacing the old files.
10046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* fix mixed line endings
10056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* review the diffs and fix incorrect @draft and missing aliases;
10066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Unicode-derived script codes should be "born stable" like constants in uchar.h, uscript.h etc.
10076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* manually re-add the "Indic script xyz v.2" tags in ScriptAndLanguageTags.h
10086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
10106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 5.2 update
10126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related ICU Trac tickets
10146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7084 Unicode 5.2
10166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7167 verify collation bytes
10186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7235 Java test NAME_ALIAS
10196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7236 Java DerivedCoreProperties.txt test
10206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7237 Java BidiTest.txt
10216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7238 UTrie2 in core unidata
10226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7239 test for tailoring gaps
10236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7240 Java fix CollationMiscTest
10246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org7243 update layout engine for Unicode 5.2
10256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers
10276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak
10286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h
10296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- configure.in & configure
10306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update ucdVersion in gennames.c if an algorithmic range changes
10316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code
10336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation
10356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgpython source\tools\genprops\misc\ucdcopy.py "C:\Documents and Settings\mscherer\My Documents\unicode\ucd\5.2.0" C:\svn\icuproj\icu\trunk\source\data\unidata
10376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- includes finding files regardless of version numbers,
10386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  copying them, and performing the equivalent processing of the
10396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  ucdstrip and ucdmerge tools on the desired set of files
10406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* notes on changes
10426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- PropertyAliases.txt
10436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  moved from numeric to enumerated:
10446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ccc       ; Canonical_Combining_Class
10456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new string properties:
10466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    NFKC_CF   ; NFKC_Casefold
10476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Name_Alias; Name_Alias
10486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new binary properties:
10496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Cased     ; Cased
10506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    CI        ; Case_Ignorable
10516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    CWCF      ; Changes_When_Casefolded
10526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    CWCM      ; Changes_When_Casemapped
10536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    CWKCF     ; Changes_When_NFKC_Casefolded
10546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    CWL       ; Changes_When_Lowercased
10556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    CWT       ; Changes_When_Titlecased
10566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    CWU       ; Changes_When_Uppercased
10576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new CJK Unihan properties (not supported by ICU)
10586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- PropertyValueAliases.txt
10596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new block names
10606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new scripts
10616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  one script code change:
10626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    sc ; Qaai      ; Inherited
10636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ->
10646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    sc ; Zinh      ; Inherited                        ; Qaai
10656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new Line_Break (lb) value:
10666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    lb ; CP        ; Close_Parenthesis
10676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new Joining_Group (jg) values: Farsi_Yeh, Nya
10686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  other new values:
10696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    ccc; 214; ATA  ; Attached_Above
10706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- DerivedBidiClass.txt
10716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new default-R range: U+1E800 - U+1EFFF
10726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UnicodeData.txt
10736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  all of the ISO comments are gone
10746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new CJK block end:
10756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    9FC3;<CJK Ideograph, Last> -> 9FCB;<CJK Ideograph, Last>
10766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  new CJK block:
10776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    2A700;<CJK Ideograph Extension C, First>;Lo;0;L;;;;;N;;;;;
10786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    2B734;<CJK Ideograph Extension C, Last>;Lo;0;L;;;;;N;;;;;
10796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname
10816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run preparse.pl
10826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + cd \svn\icuproj\icu\trunk\source\tools\genpname
10836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + make sure that data.h is writable
10846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + perl preparse.pl \svn\icuproj\icu\trunk > out.txt
10856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + preparse.pl complains with errors like the following:
10866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      Error: sc:Egyp already set to Egyptian_Hieroglyphs, cannot set to Egyp at preparse.pl line 1322, <GEN6> line 34.
10876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    This is because ICU 4.0 had scripts from ISO 15924 which are now
10886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    added to Unicode 5.2, and the Perl script shows a conflict between SyntheticPropertyValueAliases.txt
10896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    and PropertyValueAliases.txt.
10906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    -> Removed duplicate script entries from SyntheticPropertyValueAliases.txt:
10916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org       Egyp, Java, Lana, Mtei, Orkh, Armi, Avst, Kthi, Phli, Prti, Samr, Tavt
10926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + preparse.pl complains with errors about block names missing from uchar.h; add them
10936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
10946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.h & uscript.h & uprops.h & uprops.c & genprops
10956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new block & script values
10966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + 26 new blocks
10976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    copy new blocks from Blocks.txt
10986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    MS VC++ 2008 regular expression:
10996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      find "^{[0-9A-F]+}\.\.{[0-9A-F]+}; {[A-Z].+}$"
11006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      replace with "    UBLOCK_\3 = 172, /*[\1]*/"
11016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + several new script values already added in ICU 4.0 for ISO 15924 coverage
11026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    (removed from SyntheticPropertyValueAliases.txt, see genpname notes above)
11036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + 3 new script values added for ISO 15924 and Unicode 5.2 coverage
11046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + 1 new script value added for ISO 15924 coverage (not in Unicode 5.2)
11056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    (added to SyntheticPropertyValueAliases.txt)
11066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Joining Group (JG) values: Farsi_Yeh, Nya
11076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Line_Break (lb) value:
11086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    lb ; CP        ; Close_Parenthesis
11096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* hardcoded Unihan range end/limit
11116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unihan range end moves from 9FC3 to 9FCB
11126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  search for both 9FC3 (end) and 9FC4 (limit) (regex 9FC[34], case-insensitive)
11136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + do change gennames.c
11146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Compare definitions of new binary properties with what we used to use
11166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  in algorithms, to see if the definitions changed.
11176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Verified that definitions for Cased and Case_Ignorable are unchanged.
11186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  The gencase tool now parses the newly public Case_Ignorable values
11196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  in case the definition changes in the future.
11206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.c & uprops.h & uprops.c & genprops
11226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new numeric values that didn't exist in Unicode data before:
11236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    1/7, 1/9, 1/10, 3/10, 1/16, 3/16
11246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  the ones with denominators >9 cannot be supported by uprops.icu formatVersion 5,
11256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  therefore redesign the encoding of numeric types and values for formatVersion 6;
11266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  design for simple numbers up to at least 144 ("one gross"),
11276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  large values up to at least 10^20,
11286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  and fractions with numerators -1..17 and denominators 1..16
11296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  to cover current and expected future values
11306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  (e.g., more Han numeric values, Meroitic twelfths)
11316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* reimplement Hangul_Syllable_Type for new Jamo characters
11336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- the old code assumed that all Jamo characters are in the 11xx block
11346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unicode 5.2 fills holes there and adds new Jamo characters in
11356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    A960..A97F; Hangul Jamo Extended-A
11366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  and in
11376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    D7B0..D7FF; Hangul Jamo Extended-B
11386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Hangul_Syllable_Type can be trivially derived from a subset of
11396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Grapheme_Cluster_Break values
11406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode data source code for hardcoding core data
11426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgC:\svn\icuproj\icu\trunk\source\data>NMAKE /f makedata.mak ICUMAKE=\svn\icuproj\icu\trunk\source\data\ CFG=x86\release uni-core-data
11436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU data make path is \svn\icuproj\icu\trunk\source\data\
11456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU root path is \svn\icuproj\icu\trunk
11466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "ucmlocal.mk". Not building user-additional converter files.
11476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "brklocal.mk". Not building user-additional break iterator files.
11486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "reslocal.mk". Not building user-additional resource bundle files.
11496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "collocal.mk". Not building user-additional resource bundle files.
11506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "rbnflocal.mk". Not building user-additional resource bundle files.
11516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "trnslocal.mk". Not building user-additional transliterator files.
11526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "misclocal.mk". Not building user-additional miscellaenous files.
11536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "spreplocal.mk". Not building user-additional stringprep files.
11546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Property Names
11556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Character Properties
11566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Case Mapping Properties
11576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode BiDi/Shaping Properties
11586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Normalization
11596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .icu files built to "\svn\icuproj\icu\trunk\source\data\out\build\icudt43l"
11606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .c source files built to "\svn\icuproj\icu\trunk\source\data\out\tmp"
11616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the .c source files to C:\svn\icuproj\icu\trunk\source\common
11636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  and rebuild the common library
11646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** UCA
11666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update FractionalUCA.txt with new canonical closure (output from Mark's Unicode tools)
11686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/data/unidata/UCARules.txt with UCA_Rules_SHORT.txt from Mark's Unicode tools
11696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update source/test/testdata/CollationTest_*.txt with output from Mark's Unicode tools
11706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org[ Begin obsolete instructions:
11716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Starting with UCA 5.2, we use the CollationTest_*_SHORT.txt files not the *_STUB.txt files.
11726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    - generate the source/test/testdata/CollationTest_*_STUB.txt files via source/tools/genuca/genteststub.py
11736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      on Windows:
11746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org        python C:\svn\icuproj\icu\trunk\source\tools\genuca\genteststub.py CollationTest_NON_IGNORABLE_SHORT.txt CollationTest_NON_IGNORABLE_STUB.txt
11756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org        python C:\svn\icuproj\icu\trunk\source\tools\genuca\genteststub.py CollationTest_SHIFTED_SHORT.txt CollationTest_SHIFTED_STUB.txt
11766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  End obsolete instructions]
11776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run all tests with the *_SHORT.txt or the full files (the full ones have comments)
11786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  not just the *_STUB.txt files
11796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- note on intltest: if collate/UCAConformanceTest fails, then
11806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  utility/MultithreadTest/TestCollators will fail as well;
11816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  fix the conformance test before looking into the multi-thread test
11826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Implement Cased & Case_Ignorable properties
11846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- via UProperty; call ucase.h functions ucase_getType() and ucase_getTypeOrIgnorable()
11856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Problem: These properties should be disjoint, but aren't
11866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UTC 2009nov decision: skip all Case_Ignorable regardless of whether they are Cased or not
11876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- change ucase.icu to be able to store any combination of Cased and Case_Ignorable
11886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Implement Changes_When_Xyz properties
11906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- without stored data
11916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Implement Name_Alias property
11936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- add it as another name field in unames.icu
11946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- make it available via u_charName() and UCharNameChoice and
11956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- consider it in u_charFromName()
11966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Break iterators
11986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
11996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Update break iterator rules to new UAX versions and new property values
12006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Update source/test/testdata/<boundary>Test.txt files from <unicode.org ucd>/ucd/auxiliary
12016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** new BidiTest file
12036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- review format and data
12046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy BidiTest.txt to source/test/testdata
12056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- write test code using this data
12066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- fix ICU code where it fails the conformance test
12076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Java
12096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- generally, find and update code corresponding to C/C++
12106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UCharacter.UnicodeBlock constants:
12116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  a) add an _ID integer per new block, update COUNT
12126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  b) add a class instance per new block
12136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org     Visual Studio regex:
12146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org        find            UBLOCK_{[^ ]+} = [0-9]+, {/.+}
12156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org        replace with    public static final UnicodeBlock \1 = new UnicodeBlock("\1", \1_ID); \2
12166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- CHAR_NAME_ALIAS -> UCharacter.getNameAlias() and getCharFromNameAlias()
12176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- port test changes to Java
12196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information
12216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org(For comparison, see the Unicode 5.1 update: http://bugs.icu-project.org/trac/changeset/23833)
12236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Run ICU4J com.ibm.icu.dev.tool.layout.ScriptNameBuilder. This generates LEScripts.h, LELanguages.h,
12256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptAndLanguageTags.h and ScriptAndLanguageTags.cpp in the working directory. (It also generates
12266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptRunData.cpp, which is no longer needed.)
12276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgThe generated files have a current copyright date and "@draft" statement.
12296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org-> Eric Mader wrote in email on 20090930:
12316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    "I think the tool has been modified to update @draft to @stable for
12326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org     older scripts and to add @draft for new scripts.
12336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org     (I worked with an intern on this last year.)
12346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org     You should check the output after you run it."
12356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* copy the above files into <icu>/source/layout, replacing the old files.
12376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* fix mixed line endings
12386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* review the diffs and fix incorrect @draft and missing aliases
12396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* manually re-add the "Indic script xyz v.2" tags in ScriptAndLanguageTags.h
12406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgAdd new default entries to the indicClassTables array in <icu>/source/layout/IndicClassTables.cpp
12426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.organd the complexTable array in <icu>/source/layoutex/ParagraphLayout.cpp. (This step should be automated...)
12436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org-> Eric Mader wrote in email on 20090930:
12456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    "This is just a matter of making sure that all the per-script tables have
12466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org     entries for any new scripts that were added.
12476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org     If any new Indic characters were added, then the class tables in
12486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org     IndicClassTables.cpp should be updated to reflect this.
12496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org     John Emmons should know how to do this if it's required."
12506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* rebuild the layout and layoutex libraries.
12526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Documentation
12546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Update User Guide
12556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + Jamo_Short_Name, sfc->scf, binary property value aliases
12566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
12586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 5.1 update
12606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related ICU Trac tickets
12626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org5696 Update to Unicode 5.1
12646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers
12666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak
12676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h
12686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- configure.in & configure
12696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update ucdVersion in gennames.c if an algorithmic range changes
12706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code
12726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation
12746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip:
12756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    DerivedCoreProperties.txt
12766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    DerivedNormalizationProps.txt
12776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    NormalizationTest.txt
12786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    PropList.txt
12796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Scripts.txt
12806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    GraphemeBreakProperty.txt
12816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    SentenceBreakProperty.txt
12826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    WordBreakProperty.txt
12836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip and ucdmerge:
12846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    EastAsianWidth.txt
12856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    LineBreak.txt
12866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
12876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* my ucd2unidata.bat (needs to be updated each time with UCD and file version numbers)
12886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\BidiMirroring.txt ..\unidata\
12896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\Blocks.txt ..\unidata\
12906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\CaseFolding.txt ..\unidata\
12916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\DerivedAge.txt ..\unidata\
12926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\extracted\DerivedBidiClass.txt ..\unidata\
12936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\extracted\DerivedJoiningGroup.txt ..\unidata\
12946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\extracted\DerivedJoiningType.txt ..\unidata\
12956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\extracted\DerivedNumericValues.txt ..\unidata\
12966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\NormalizationCorrections.txt ..\unidata\
12976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\PropertyAliases.txt ..\unidata\
12986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\PropertyValueAliases.txt ..\unidata\
12996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\SpecialCasing.txt ..\unidata\
13006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.1.0\ucd\UnicodeData.txt ..\unidata\
13016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
13026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\DerivedCoreProperties.txt > ..\unidata\DerivedCoreProperties.txt
13036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\DerivedNormalizationProps.txt > ..\unidata\DerivedNormalizationProps.txt
13046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\NormalizationTest.txt > ..\unidata\NormalizationTest.txt
13056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\PropList.txt > ..\unidata\PropList.txt
13066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\Scripts.txt > ..\unidata\Scripts.txt
13076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\auxiliary\GraphemeBreakProperty.txt > ..\unidata\GraphemeBreakProperty.txt
13086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\auxiliary\SentenceBreakProperty.txt > ..\unidata\SentenceBreakProperty.txt
13096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\auxiliary\WordBreakProperty.txt > ..\unidata\WordBreakProperty.txt
13106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\EastAsianWidth.txt | ucdmerge > ..\unidata\EastAsianWidth.txt
13116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.1.0\ucd\LineBreak.txt | ucdmerge > ..\unidata\LineBreak.txt
13126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
13136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname
13146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run preparse.pl
13156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + cd \svn\icuproj\icu\uni51\source\tools\genpname
13166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + make sure that data.h is writable
13176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + perl preparse.pl \svn\icuproj\icu\uni51 > out.txt
13186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + preparse.pl complains with errors like the following:
13196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      Error: sc:Cari already set to Carian, cannot set to Cari at preparse.pl line 1308, <GEN6> line 30.
13206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    This is because ICU 3.8 had scripts from ISO 15924 which are now
13216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    added to Unicode 5.1, and the script shows a conflict between SyntheticPropertyValueAliases.txt
13226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    and PropertyValueAliases.txt.
13236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    -> Removed duplicate script entries from SyntheticPropertyValueAliases.txt:
13246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org       Cari, Cham, Kali, Lepc, Lyci, Lydi, Olck, Rjng, Saur, Sund, Vaii
13256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + PropertyValueAliases.txt now explicitly contains values for boolean properties:
13266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org      N/Y, No/Yes, F/T, False/True
13276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    -> Added N/No and Y/Yes to preparse.pl function read_PropertyValueAliases.
13286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org       It will use further values from the file if present.
13296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
13306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.h & uscript.h & uprops.h & uprops.c & genprops
13316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new block & script values
13326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + 17 new blocks
13336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + 11 new script values already added in ICU 3.8 for ISO 15924 coverage
13346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    (removed from SyntheticPropertyValueAliases.txt)
13356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + 14 new script values added for ISO 15924 coverage (not in Unicode 5.1)
13366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    (added to SyntheticPropertyValueAliases.txt)
13376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uprops.icu (uprops.h) only provides 7 bits for script codes.
13386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  In ICU 4.0 there are USCRIPT_CODE_LIMIT=130 script codes now.
13396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  There is none above 127 yet which is the script code for an
13406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  assigned Unicode character, so ICU 4.0 uprops.icu does not store any
13416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  script code values greater than 127.
13426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  However, it does need to store the maximum script value=USCRIPT_CODE_LIMIT-1=129
13436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  in a parallel bit field, and that overflows now.
13446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Also, future values >=128 would be incompatible anyway.
13456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  uprops.h is modified to move around several of the bit fields
13466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  in the properties vector words, and now uses 8 bits for the script code.
13476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Two other bit fields also grow to accommodate future growth:
13486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  Block (current count: 172) grows from 8 to 9 bits,
13496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  and Word_Break grows from 4 to 5 bits.
13506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- renamed property Simple_Case_Folding (sfc->scf)
13516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + nothing to be done: handled as normal alias
13526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new property JSN Jamo_Short_Name
13536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + no new API: only contributes to the Name property
13546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Grapheme_Cluster_Break (GCB) value: SM=SpacingMark
13556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Joining Group (JG) value: Burushashki_Yeh_Barree
13566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Sentence_Break (SB) values:
13576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    SB ; CR        ; CR
13586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    SB ; EX        ; Extend
13596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    SB ; LF        ; LF
13606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    SB ; SC        ; SContinue
13616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Word_Break (WB) values:
13626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    WB ; CR        ; CR
13636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    WB ; Extend    ; Extend
13646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    WB ; LF        ; LF
13656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    WB ; MB        ; MidNumLet
13666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
13676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Further changes in the 2008-02-29 update:
13686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Default_Ignorable_Code_Point: The new file removes Cc, Cs, noncharacters from DICP
13696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  because they should not normally be invisible.
13706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Joining Group (JG) value Burushashki_Yeh_Barree was renamed to Burushaski_Yeh_Barree (one 'h' removed)
13716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Grapheme_Cluster_Break (GCB) value: PP=Prepend
13726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new Word_Break (WB) value: NL=Newline
13736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
13746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* hardcoded Unihan range end/limit (see Unicode 4.1 update for comparison)
13756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unihan range end moves from 9FBB to 9FC3
13766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  search for both 9FBB (end) and 9FBC (limit) (regex 9FB[BC], case-insensitive)
13776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + do change gennames.c
13786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
13796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode data source code for hardcoding core data
13806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgC:\svn\icuproj\icu\uni51\source\data>NMAKE /f makedata.mak ICUMAKE=\svn\icuproj\icu\uni51\source\data\ CFG=debug uni-core-data
13816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
13826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU data make path is \svn\icuproj\icu\uni51\source\data\
13836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU root path is \svn\icuproj\icu\uni51
13846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "ucmlocal.mk". Not building user-additional converter files.
13856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "brklocal.mk". Not building user-additional break iterator files.
13866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "reslocal.mk". Not building user-additional resource bundle files.
13876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "collocal.mk". Not building user-additional resource bundle files.
13886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "rbnflocal.mk". Not building user-additional resource bundle files.
13896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "trnslocal.mk". Not building user-additional transliterator files.
13906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "misclocal.mk". Not building user-additional miscellaenous files.
13916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Character Properties
13926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Case Mapping Properties
13936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode BiDi/Shaping Properties
13946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Normalization
13956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .icu files built to "\svn\icuproj\icu\uni51\source\data\out\build\icudt39l"
13966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .c source files built to "\svn\icuproj\icu\uni51\source\data\out\tmp"
13976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
13986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the .c source files to C:\svn\icuproj\icu\uni51\source\common
13996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  and rebuild the common library
14006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Break iterators
14026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Update break iterator rules to new UAX versions and new property values
14046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** UCA
14066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update FractionalUCA.txt and UCARules.txt with new canonical closure
14086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Test suites
14106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Test that APIs using Unicode property value aliases (like UnicodeSet)
14116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  support all of the boolean values N/Y, No/Yes, F/T, False/True
14126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  -> TestBinaryValues() tests in both cintltst and intltest
14136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information
14156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Run ICU4J com.ibm.icu.dev.tool.layout.ScriptNameBuilder. This generates LEScripts.h, LELanguage.h,
14166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptAndLanguageTags.h and ScriptAndLanguageTags.cpp in the working directory. (it also generates
14176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptRunData.cpp, which is no longer needed.)
14186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgThe generated files have a current copyright date and "@draft" statement.
14206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* copy the above files into <icu>/source/layout, replacing the old files.
14226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgAdd new default entries to the indicClassTables array in <icu>/source/layout/IndicClassTables.cpp
14246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.organd the complexTable array in <icu>/source/layoutex/ParagraphLayout.cpp. (This step should be automated...)
14256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* rebuild the layout and layoutex libraries.
14276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Documentation
14296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Update User Guide
14306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + Jamo_Short_Name, sfc->scf, binary property value aliases
14316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
14336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 5.0 update
14356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related Jitterbugs
14376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org5084 RFE: Update to Unicode 5.0
14396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code
14416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation
14436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip:
14446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    DerivedCoreProperties.txt
14456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    DerivedNormalizationProps.txt
14466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    NormalizationTest.txt
14476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    PropList.txt
14486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    Scripts.txt
14496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    GraphemeBreakProperty.txt
14506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    SentenceBreakProperty.txt
14516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    WordBreakProperty.txt
14526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip and ucdmerge:
14536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    EastAsianWidth.txt
14546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    LineBreak.txt
14556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* my ucd2unidata.bat (needs to be updated each time with UCD and file version numbers)
14576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\BidiMirroring.txt ..\unidata\
14586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\Blocks.txt ..\unidata\
14596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\CaseFolding.txt ..\unidata\
14606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\DerivedAge.txt ..\unidata\
14616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\extracted\DerivedBidiClass.txt ..\unidata\
14626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\extracted\DerivedJoiningGroup.txt ..\unidata\
14636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\extracted\DerivedJoiningType.txt ..\unidata\
14646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\extracted\DerivedNumericValues.txt ..\unidata\
14656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\NormalizationCorrections.txt ..\unidata\
14666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\PropertyAliases.txt ..\unidata\
14676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\PropertyValueAliases.txt ..\unidata\
14686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\SpecialCasing.txt ..\unidata\
14696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgcopy 5.0.0\ucd\UnicodeData.txt ..\unidata\
14706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\DerivedCoreProperties.txt > ..\unidata\DerivedCoreProperties.txt
14726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\DerivedNormalizationProps.txt > ..\unidata\DerivedNormalizationProps.txt
14736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\NormalizationTest.txt > ..\unidata\NormalizationTest.txt
14746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\PropList.txt > ..\unidata\PropList.txt
14756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\Scripts.txt > ..\unidata\Scripts.txt
14766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\auxiliary\GraphemeBreakProperty.txt > ..\unidata\GraphemeBreakProperty.txt
14776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\auxiliary\SentenceBreakProperty.txt > ..\unidata\SentenceBreakProperty.txt
14786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\auxiliary\WordBreakProperty.txt > ..\unidata\WordBreakProperty.txt
14796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\EastAsianWidth.txt | ucdmerge > ..\unidata\EastAsianWidth.txt
14806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgucdstrip < 5.0.0\ucd\LineBreak.txt | ucdmerge > ..\unidata\LineBreak.txt
14816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update FractionalUCA.txt and UCARules.txt with new canonical closure
14836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname
14856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run preparse.pl
14866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + make sure that data.h is writable
14876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + perl preparse.pl \cvs\oss\icu > out.txt
14886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.h & uscript.h & uprops.h & uprops.c & genprops
14906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new block & script values
14916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + script values already added in ICU 3.6 because all of ISO 15924 is now covered
14926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* build Unicode data source code for hardcoding core data
14946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgC:\cvs\oss\icu\source\data>NMAKE /f makedata.mak ICUMAKE=\cvs\oss\icu\source\data\ CFG=debug uni-core-data
14956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
14966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU data make path is \cvs\oss\icu\source\data\
14976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgICU root path is \cvs\oss\icu
14986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgInformation: cannot find "ucmlocal.mk". Not building user-additional converter files.
14996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org[etc.]
15006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Character Properties
15016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Case Mapping Properties
15026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode BiDi/Shaping Properties
15036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgCreating data file for Unicode Normalization
15046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .icu files built to "\cvs\oss\icu\source\data\out\build\icudt35l"
15056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode .c source files built to "\cvs\oss\icu\source\data\out\tmp"
15066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- copy the .c source files to C:\cvs\oss\icu\source\common
15086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  and rebuild the common library
15096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers
15116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak
15126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h
15136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- configure.in
15146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** LayoutEngine script information
15166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* Run ICU4J com.ibm.icu.dev.tool.layout.ScriptNameBuilder. This generates LEScripts.h, LELanguage.h,
15176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptAndLanguageTags.h and ScriptAndLanguageTags.cpp in the working directory. (it also generates
15186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgScriptRunData.cpp, which is no longer needed.)
15196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgThe generated files have a current copyright date and "@draft" statement.
15216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* copy the above files into <icu>/source/layout, replacing the old files.
15236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgAdd new default entries to the indicClassTables array in <icu>/source/layout/IndicClassTables.cpp
15256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.organd the complexTable array in <icu>/source/layoutex/ParagraphLayout.cpp. (This step should be automated...)
15266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* rebuild the layout and layoutex libraries.
15286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
15306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 4.1 update
15326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related Jitterbugs
15346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org4332 RFE: Update to Unicode 4.1
15366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org4157 RBBI, TR29 4.1 updates
15376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code
15396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation
15416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip:
15426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    DerivedCoreProperties.txt
15436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    DerivedNormalizationProps.txt
15446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    NormalizationTest.txt
15456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    GraphemeBreakProperty.txt
15466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    SentenceBreakProperty.txt
15476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    WordBreakProperty.txt
15486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip and ucdmerge:
15496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    EastAsianWidth.txt
15506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    LineBreak.txt
15516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* add new files to the repository
15536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    GraphemeBreakProperty.txt
15546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    SentenceBreakProperty.txt
15556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    WordBreakProperty.txt
15566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* update FractionalUCA.txt and UCARules.txt with new canonical closure
15586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname
15606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- handle new enumerated properties in sub read_uchar
15616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- run preparse.pl
15626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.h & uscript.h & uprops.h & uprops.c & genprops
15646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new binary properties
15656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + Pattern_Syntax
15666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + Pattern_White_Space
15676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new enumerated properties
15686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + Grapheme_Cluster_Break
15696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + Sentence_Break
15706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + Word_Break
15716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new block & script & line break values
15726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* gencase
15746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- case-ignorable changes
15756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  see http://www.unicode.org/versions/Unicode4.1.0/#CaseMods
15766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  now: (D47a) Word_Break=MidLetter or Mn, Me, Cf, Lm, Sk
15776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers
15796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak
15806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h
15816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- configure.in
15826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** tests
15846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- verify that u_charMirror() round-trips
15856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- test all new properties and some new values of old properties
15866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** other code
15886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
15896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* hardcoded Unihan range end/limit
15906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Unihan range end moves from 9FA5 to 9FBB
15916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  search for both 9FA5 (end) and 9FA6 (limit) (regex 9FA[56], case-insensitive)
15926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + do not modify BOCU/BOCSU code because that would change the encoding
15936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    and break binary compatibility!
15946f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + similarly, do not change the GB 18030 range data (ucnvmbcs.c),
15956f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    NamePrepProfile.txt
15966f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + ignore trietest.c: test data is arbitrary
15976f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + ignore tstnorm.cpp: test optimization, not important
15986f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + ignore collation: 9FA[56] only appears in comments; swapCJK() uses the whole block up to 9FFF
15996f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + do change line_th.txt and word_th.txt
16006f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    by replacing hardcoded ranges with the new property values
16016f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + do change gennames.c
16026f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16036f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgsource\data\brkitr\line_th.txt(229):        \u33E0-\u33FE \u3400-\u4DB5 \u4E00-\u9FA5 \uA000-\uA48C \uA490-\uA4C6
16046f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgsource\data\brkitr\word_th.txt(23):        \u33E0-\u33FE \u3400-\u4DB5 \u4E00-\u9FA5 \uA000-\uA48C \uA490-\uA4C6
16056f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgsource\tools\gennames\gennames.c(971):        0x4e00, 0x9fa5,
16066f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16076f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* case mappings
16086f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- compare new special casing context conditions with previous ones
16096f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  see http://www.unicode.org/versions/Unicode4.1.0/#CaseMods
16106f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16116f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname
16126f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- consider storing only the short name if it is the same as the long name
16136f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16146f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** other reviews
16156f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UAX #29 changes (grapheme/word/sentence breaks)
16166f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- UAX #14 changes (line breaks)
16176f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- Pattern_Syntax & Pattern_White_Space
16186f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16196f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org---------------------------------------------------------------------------- ***
16206f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16216f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.orgUnicode 4.0.1 update
16226f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16236f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** related Jitterbugs
16246f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16256f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org3170 RFE: Update to Unicode 4.0.1
16266f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org3171 Add new Unicode 4.0.1 properties
16276f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org3520 use Unicode 4.0.1 updates for break iteration
16286f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16296f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** data files & enums & parser code
16306f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16316f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file preparation
16326f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip: DerivedNormalizationProps.txt, NormalizationTest.txt, DerivedCoreProperties.txt
16336f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- ucdstrip and ucdmerge: EastAsianWidth.txt, LineBreak.txt
16346f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16356f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* file fixes
16366f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- fix UnicodeData.txt general categories of Ethiopic digits Nd->No
16376f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  according to PRI #26
16386f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  http://www.unicode.org/review/resolved-pri.html#pri26
16396f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- undone again because no corrigendum in sight;
16406f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  instead modified tests to not check consistency on this for Unicode 4.0.1
16416f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16426f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* ucdterms.txt
16436f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update from http://www.unicode.org/copyright.html
16446f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  formatted for plain text
16456f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16466f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uchar.h & uprops.h & uprops.c & genprops
16476f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- add UBLOCK_CYRILLIC_SUPPLEMENT because the block is renamed
16486f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- add U_LB_INSEPARABLE due to a spelling fix
16496f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + put short name comment only on line with new constant
16506f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    for genpname perl script parser
16516f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new binary properties
16526f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + STerm
16536f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + Variation_Selector
16546f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16556f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genpname
16566f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- fix genpname perl script so that it doesn't choke on more than 2 names per property value
16576f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- perl script: correctly calculate the maximum number of fields per row
16586f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16596f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* uscript.h
16606f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- new script code Hrkt=Katakana_Or_Hiragana
16616f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16626f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* gennorm.c track changes in DerivedNormalizationProps.txt
16636f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- "FNC" -> "FC_NFKC"
16646f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- single field "NFD_NO" -> two fields "NFD_QC; N" etc.
16656f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16666f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* genprops/props2.c track changes in DerivedNumericValues.txt
16676f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- changed from 3 columns to 2, dropping the numeric type
16686f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  + assume that the type is always numeric for Han characters,
16696f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org    and that only those are added in addition to what UnicodeData.txt lists
16706f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16716f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** Unicode version numbers
16726f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- makedata.mak
16736f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- uchar.h
16746f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- configure.in
16756f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16766f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** tests
16776f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- update test of default bidi classes according to PRI #28
16786f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  /tsutil/cucdtst/TestUnicodeData
16796f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org  http://www.unicode.org/review/resolved-pri.html#pri28
16806f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- bidi tests: change exemplar character for ES depending on Unicode version
16816f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- change hardcoded expected property values where they change
16826f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16836f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org*** other code
16846f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16856f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* name matching
16866f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- read UCD.html
16876f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16886f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* scripts
16896f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- use new Hrkt=Katakana_Or_Hiragana
16906f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org
16916f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org* ZWJ & ZWNJ
16926f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- are now part of combining character sequences
16936f31ac30b9092fd02a8c97e5216cf53f3e4fae4jshin@chromium.org- break iteration used to assume that LB classes did not overlap; now they do for ZWJ & ZWNJ
1694