History log of /external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
Revision Date Author Comments (<<< Hide modified files) (Show modified files >>>)
e8b5d64039614ecce472eda1a7a603736da25d86 29-May-2014 Behdad Esfahbod <behdad@behdad.org> [indic] Do NOT allow reph formation on placeholders

Only allow it on DOTTED CIRCLE. No effect on test numbers.

Test: U+0930,094D,00A0
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
cf78dd483cbe1759a8ecb731879e041a53ba9bb3 27-May-2014 Behdad Esfahbod <behdad@behdad.org> [indic/myanmar] Rename OT_NBSP to OT_PLACEHOLDER
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
ddbdfcbf1c10eed0a7b81b29fee99f1bd22113e5 23-May-2014 Behdad Esfahbod <behdad@behdad.org> [indic] Simplify grammar

No functional change.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
4e9b1f662b23966e67c548b86afeff2bec9b0eb7 23-May-2014 Behdad Esfahbod <behdad@behdad.org> [indic] Always start new syllable for Avagraha

In fact, the previous grammar was ambigious. No functional
change.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
9f9bd9bf31161660214b8b39a78cdafbb79db1be 23-May-2014 Behdad Esfahbod <behdad@behdad.org> [indic] Rename avagraha cluster to symbol cluster

In anticipation of adding more characters to that class of clusters.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
e848bfae7c975a6fae434daf8e3db4d69914df9f 23-May-2014 Behdad Esfahbod <behdad@behdad.org> [indic] Recategorize U+A8E0..A8F1 as OT_VD

Up to two of them come after all OT_A characters.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
c519536c34c842304da558dd4a9e3844fc261b20 23-May-2014 Behdad Esfahbod <behdad@behdad.org> [indic] Allow up to three tone marks

According to Roozbeh, there are valid combinations in Unicode
proposals for up to three. Previously we were allowing up to two.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
a74f0de225c403998212e2618dcf9452bc5b590d 24-Oct-2013 Behdad Esfahbod <behdad@behdad.org> [indic] Fix CM2, really

Followup from 6e613f3365bf4e9fd778758c53e7de00c64beca1.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
c16012e9019ec59c2200a3cc29b8a37ea70eda70 18-Oct-2013 Behdad Esfahbod <behdad@behdad.org> [indic] Add Javanese support!

Seems to be working just fine!
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
1a7de1ba9876b0554c758acbc6459366d9d98a5d 16-Oct-2013 Behdad Esfahbod <behdad@behdad.org> [indic] Improve Avagraha support in machine
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
3756efaf4e14ec3b5b1def700a1b5985f162372b 16-Oct-2013 Behdad Esfahbod <behdad@behdad.org> [indic] Misc harmless fixes!

First, we were abusing OT_VD instead of OT_A. Fix that
but moving OT_A in the grammar where it belongs (which
is different from what the spec says).

Also, only allow medial consonants after all other
consonants. This doesn't affect any current character.

Finally, fix Halant attachment in presence of medial
consonants. Again, this currently doesn't affect any
sequence.

I lied. There's Gurmukhi U+0A75 which is Consonant_Medial.
Uniscribe allows one of those in each of these positions:
before matras, after matras and before syllable modifiers,
and after syllable modifiers! We currently just allow
unlimited numbers of it, before matras.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
5c558877da5db8c734ba072f01e5e4797876619c 16-Oct-2013 Behdad Esfahbod <behdad@behdad.org> [indic] Allow up to two syllable modifiers

Bug 70509 - Candrabindu+Visarga doesn't work in Devanagari
https://bugs.freedesktop.org/show_bug.cgi?id=70509

We categorize both bindus and visarga as syllable-modifiers.
OT spec doesn't actually say what characters go in the syllable
modifier category, and allows one. We just allow up to two now.

Test case: U+0930,U+0941,U+0901,U+0903

Uniscribe currently doesn't support that and produces a
dotted circle.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
3c7b3641cfb00f2c4dcc0768b9854e4f4410d15f 15-Oct-2013 Behdad Esfahbod <behdad@behdad.org> [indic] Handle Avagraha

It can come either at the end(ish!) of the syllable, or independently.
When independent, it accepts a few bits and pieces.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
e7562f53fe6a506d2c6d59d6688e4fa468bba462 14-Feb-2013 Behdad Esfahbod <behdad@behdad.org> Fix compile warnings for ragel-generated machines
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
596740db04e7c1dadae0d8be6e401089fcaffc2d 22-Dec-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Insert dottedcircle after a lone Malayalam dot-reph
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
0beb66e3a61ae8bb1fa66e54b1ff1abb2f8711e9 06-Dec-2012 Behdad Esfahbod <behdad@behdad.org> Fix warnings
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
9cac1338c4bc3e9034cbfa104291ed0329debefe 13-Nov-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Allow Consonant_Medial's after Consonant's

Mostly affects Myanmar, but also Tai Tham, Javanese, and Cham. The
latter three are untested (no fonts!).
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
166b5cf7ec2d37fb54a909c437ccdec1edaf37aa 07-Sep-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Find syllables before any features are applied

With FreeSerif, it seems that the 'ccmp' feature does ligature
substituttions. That was then causing syllable match failures. We now
find syllables before any features have been applied.

Test sequence: U+0D9A,U+0DCA,U+200D,U+0DBB,U+0DCF
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
f0b8ed1b6dd9f1d2b9084c101a6fc5dee0cc22a8 05-Sep-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Allow "H,ZWJ,M"

Uniscribe accepts a Halant,ZWJ before matras. Allow that.

BENGALI down from 295 to 291
DEVANAGARI down from 69 to 57
GUJARATI down from 19 to 17
KANNADA down from 871 to 867
MALAYALAM down from 340 to 337
TELUGU down from 20 to 16

Currently at:

BENGALI: 353897 out of 354188 tests passed. 291 failed (0.0821598%)
DEVANAGARI: 707337 out of 707394 tests passed. 57 failed (0.00805774%)
GUJARATI: 366440 out of 366457 tests passed. 17 failed (0.00463902%)
GURMUKHI: 60704 out of 60747 tests passed. 43 failed (0.0707854%)
KANNADA: 951046 out of 951913 tests passed. 867 failed (0.0910798%)
KHMER: 299077 out of 299124 tests passed. 47 failed (0.0157125%)
LAO: 53611 out of 53644 tests passed. 33 failed (0.0615167%)
MALAYALAM: 1047997 out of 1048334 tests passed. 337 failed (0.0321462%)
ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%)
SINHALA: 271666 out of 271847 tests passed. 181 failed (0.0665816%)
TAMIL: 1091754 out of 1091754 tests passed. 0 failed (0%)
TELUGU: 970557 out of 970573 tests passed. 16 failed (0.00164851%)
TIBETAN: 208469 out of 208469 tests passed. 0 failed (0%)
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
4ed717ef61813fa16cf74f2874848e9feb81568f 05-Sep-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Relax grammar

Now that we insert dotted-circle, tests break more easily when our indic
machine breaks.

In particular, a few Devanagari tests were having sequences like
"C,H,ZWJ,N", and because of the ZWJ the Nukta does NOT get reordered to
before the Halant as the grammar used to expect... Fixup.

Another case is as simple as "C,ZWJ,SM".

Fixes 10 out of 79 failures:

DEVANAGARI: 707325 out of 707394 tests passed. 69 failed (0.00975411%)
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
aa7141efe49991a1160489106984e95163fe2ab8 05-Sep-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Fix Khmer syllable-final coeng-consonant

Brings down Khmer failures from 162 to 47.

KHMER: 299077 out of 299124 tests passed. 47 failed (0.0157125%)

Also rebaselined some of the test files that had only-inherited lines.
Removing those, the stats are:

BENGALI: 353893 out of 354188 tests passed. 295 failed (0.0832891%)
DEVANAGARI: 707315 out of 707394 tests passed. 79 failed (0.0111678%)
GUJARATI: 366438 out of 366457 tests passed. 19 failed (0.00518478%)
GURMUKHI: 60704 out of 60747 tests passed. 43 failed (0.0707854%)
KANNADA: 951042 out of 951913 tests passed. 871 failed (0.0915%)
KHMER: 299077 out of 299124 tests passed. 47 failed (0.0157125%)
LAO: 53611 out of 53644 tests passed. 33 failed (0.0615167%)
MALAYALAM: 1047994 out of 1048334 tests passed. 340 failed (0.0324324%)
ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%)
SINHALA: 271666 out of 271847 tests passed. 181 failed (0.0665816%)
TAMIL: 1091754 out of 1091754 tests passed. 0 failed (0%)
TELUGU: 970553 out of 970573 tests passed. 20 failed (0.00206064%)
TIBETAN: 208469 out of 208469 tests passed. 0 failed (0%)

Still some regressions, but some of the more egregious cases are
addressed.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
b85800f9de8976a7418ef9df467d3080c6ab0199 01-Sep-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Implement dotted-circle insertion for broken clusters

No panic, we reeally insert dotted circle when it's absolutely broken.

Fixes most of the dotted-circle cases against Uniscribe. (for Devanagari
fixes 80% of them, for Khmer 70%; the rest look like Uniscribe being
really bogus...)

I had to make a decision. Apparently Uniscribe adds one dotted circle
to each broken character. I tried that, but that goes wrong easily with
split matras. So I made it add only one dotted circle to an entire
broken syllable tail. As in: "if there was a dotted circle here, this
would have formed a correct cluster." That works better for split
stuff, and I like it more.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
327d14ef188396006d54af976506ab6f8bb2869a 31-Aug-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Start adding dotted-circle instrastructure
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
85fc6c483f6d734febbe39270e84701a651f01f1 02-Aug-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Move more stuff to the shape_plan

Almost done. Need to add per-script static tables.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
8bb5deba9630d35878eb6edb4643ecfabf99f15f 02-Aug-2012 Behdad Esfahbod <behdad@behdad.org> [OT] Pipe shape_plan down to pause_callbacks
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
dff0ece11d61978c04e839501f179a5c3077f340 24-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Limit matras to 4 per syllable

Also limit joiners.

This limits our syllable length to a constant, and is
closer to what Uniscribe does anyway.

Two Devanagari tests regressed, but who cares about tests with 20
joiners in a row?! Devanagari at 57 (0.00821766%) now.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
73d71cc527d28fd5519c5d965c272ea1fb149a0e 24-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] End Vowel-based syllable at ZWJ

One Devanagari test regressed, plus 10 Malayalam (at 1545 now).

Fixed 120 Sinhala failures. Now at 208 (0.0765136%).
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
9fa052733eb93a3ce1205f63ff8f74cb295cbe99 24-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Limit syllables to at most five consonants

Seems to be about what Uniscribe does. Not exactly. But close enough.
More consonants will start a new cluster.

A few scripts went way down in failures. In particular:

- Devanagari failures went down from 490 to 56.
- Telugu went down from 113 to 49.

Other scripts went down slightly or didn't change. New numbers:

BENGALI: 353908 out of 354285 tests passed. 377 failed (0.106412%)
DEVANAGARI: 693572 out of 693628 tests passed. 56 failed (0.00807349%)
GUJARATI: 366485 out of 366506 tests passed. 21 failed (0.00572978%)
GURMUKHI: 60750 out of 60809 tests passed. 59 failed (0.0970251%)
KANNADA: 950730 out of 951913 tests passed. 1183 failed (0.124276%)
KHMER: 298613 out of 299124 tests passed. 511 failed (0.170832%)
MALAYALAM: 1046881 out of 1048416 tests passed. 1535 failed (0.146411%)
ORIYA: 42320 out of 42329 tests passed. 9 failed (0.021262%)
SINHALA: 271333 out of 271847 tests passed. 514 failed (0.189077%)
TAMIL: 1091837 out of 1091837 tests passed. 0 failed (0%)
TELUGU: 970524 out of 970573 tests passed. 49 failed (0.00504856%)

Some of the remaining Telugu and Devanagari issues seem to be Uniscribe
eating Anusvara when placed before a non-joiner. Ouch!
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
5791f329159c9863317e2b507514c29321be31a7 20-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Allow a ZWNJ after SM's

Malayalam failures go way down. Other scripts benefitted slightly too.
Sinhala had one or two test regressions, but...
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
9e4f94a72cea6d65a6a7ba5a47db92e00dbfbb91 20-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Break syllables at Halant,ZWNJ

That's really what Uniscribe does, and explains a lot of pecularities of
Halant,ZWNJ before the base.

Sent Telugu from 1% failures to 0.03%. Improved Kannada and Malayalam
slightly. Fixed half of Bengali, and did NOT break anything!
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
422ecd2d3c198a36d07d409341cb82ea57c7ad6b 19-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Accept a forced Rakar sequence at the end of syllable

In Sinhala, Rakar is formed by Al-Lakuna,ZWJ,Ra. If you put that at the
end of a Consonant,Matra syllable, you get a dotted-circle from
Uniscribe. Apparently adding a ZWJ before the Al-Lakuna "fixes" that.
And people have been encoding that sequence... So, allow a forced
"ZWJ,Virama,ZWJ,Ra" sequence at the of syllables.

Fixes some 100 or more of Sinhala failures. Now at 622 only (0.23%).
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
6fc1732003d71cf90d37247482772c3da884687f 18-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Allow joiners on both sides of Halant at the same time

The sequence <ZWJ,Al-Lakuna,ZWJ> is used in Sinhala to explicitly ask
for Rakar. Fixes two-thousand Sinhala tests. Not many left.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
552d19b7a11f7dff888587fce4d56d9f8e47e819 18-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Treat Register Shifters like Nukta

Really this time.

Fixes another 18 Khmer tests.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
dcb527242b1eca4db1e190a7802f9cd132aaf46e 18-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Allow joiners before matras

Fixes 1 more Devanagari test!
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
391cc0331749e263bdfe83a8f5f6d76f2360ee7a 18-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Allow halant group in Vowel and placeholder syllables

Fixes 2 out of 560 Devanagari failures. AND:
Fixes 1 out of 2 Tamil failures.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
ca4e3d3eab7b738c2b8e2a81696a28bca1b81495 18-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Streamline halant/joiner in grammar
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
418d00dffddd95a1f27e9be15752d494c627d45e 18-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Minor
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
4c3691d2a32ca7e54a54f7c08098fd96fa7af39e 18-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Hopefully minor!

Refactoring Indic machin. No semantic change.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
db8981f1e0e8625714568c6d0f11f0b317b11d0a 18-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Position Khmer Robat

It's a visual Repha.

Still not positioning logical Repha as occurs in Malayalam.

Another 200 Khmer failures fixed. 547 to go. That's better than
Devanagari!
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
25bc489498ef7d0beb8fe9ab663e3f0b2f52c9c2 17-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Better categorize Register Shifters and Khmer Various signs

Down another 500 or so Khmer failures!
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
34b57149065d96f7528aaccaa7654e956ce27e93 17-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Treat Khmer Register Shifters more like Nuktas

Except that there may be a ZWNJ before a Register Shifter.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
11e2a601b19861b05dbb2051d2d078c3cfd75b29 17-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Minor
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
c50ed71e9a3df1844f564de66d54b46a696c1356 17-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Recategorize Khmer coeng sign as a separate category OT_Coeng

Amend the syllable structure to allow a final subscripted consonant
(Coeng+C) and a final subscripted independent vowel (Coeng+V).
Fixes another 2k of Khmer failures.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
deb521dee4fdca8c2124cfb39a205e6269d4a70d 17-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Add a separate Coeng class

No characters recategorized yet. No semantic change.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
7d09c98a1fff97127e48eae48d380dc9fcff288e 16-Jul-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Recognizer Register Shifter marks

Fixes another 6% of the Khmer failures.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
a98d0ab18624501ee60551304f2715361ac643da 13-Jul-2012 Behdad Esfahbod <behdad@behdad.org> Make sure HB_BEGIN_DECLS / HB_END_DECLS is only used in public headers

So we can use them to switch default visibility to internal if desired,
and use these to make only declared symbols public.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
27aba594c90b4444c35273a38f5fedc8e09d9a88 24-May-2012 Behdad Esfahbod <behdad@behdad.org> Minor
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
18c06e189bd078affbb84c3bb5bb80687a227c5e 11-May-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Add Uniscribe bug feature for dotted circle

For dotted-circle independent clusters, Uniscribe does no Reph shaping
for the exact sequence Ra+Halant+25CC. Which also is the only possible
sequence with 25CC at the end.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
9c09928989316e2befe00d52ed66e055637ccd36 11-May-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Allow multiple Consonants in Vowel/NBSP syllables

Uniscribe allows multiple Halant+Consonant after a Vowel.
Tests:
↦ * U+0905,U+094D,U+092B,U+094D,930,94d,930
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
8c0aa486f31e9b6cbb31ce295573b53b0a214124 11-May-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Allow two Nuktas per consonant

Uniscribe allows up to two nuktas per consonant and one per matra. It does so
indepent of whether the consonant already has a nukta in it. Tests:

* U+0916,U+093C,U+0941
* U+0959,U+093C,U+0941
* U+0916,U+093C,U+093C,U+0941
* U+0959,U+093C,U+093C,U+0941
* U+0916,U+093C,U+093C,U+093C,U+0941
* U+0959,U+093C,U+093C,U+093C,U+0941
* 915,93c,93c,,94d,U+0916,U+093C,U+093C,U+093e,93c,93c
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
3399a06e7033651ee926448737bdb18e553c1796 11-May-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Fix U+0952 and similar classification to match Uniscribe

See comments.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
ff24d1081af08a887895975285d7e38f5d07bc37 11-May-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Don't use syllable serial value 0
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
4be46bade26faf13f7b4d447e9cc608e183955dc 11-May-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Fix state machine to backtrack
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
cee7187447b76b22e1bb6136d137b35ac49c3a5d 11-May-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Move syllable tracking from Indic to generic layer

This is to incorporate it into GSUB/GPOS processing.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
86e5dd386a7989701da476db89be268e4ac1e219 09-May-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Don't give up syllable parsing upon junk
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
ef24cc8c8e2478a6352c340f4611a617646de4cc 09-May-2012 Behdad Esfahbod <behdad@behdad.org> [Indic] Towards multi-cluster syllables and final reordering
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
9ceca3aeb14cc096f5f87660cf7351bc35073084 17-Apr-2012 Behdad Esfahbod <behdad@behdad.org> Fix ragel regexp in vowel-based syllable

As reported by datao zhang on the mailing list.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
b870afcd1b436614af95db6dc297e54c8f03f0cd 17-Apr-2012 Behdad Esfahbod <behdad@behdad.org> Rewrite ragel expression to better match the one on MS spec

https://www.microsoft.com/typography/otfntdev/devanot/shaping.aspx
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
d4cc44716c1e098f8abbc0e495404598026ef242 08-Apr-2012 Behdad Esfahbod <behdad@behdad.org> Move code around, in prep for Thai/Lao shaper
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
461b9b6347e4f58589f5be82c40a2df61da2c715 02-Mar-2012 Behdad Esfahbod <behdad@behdad.org> Fix cluster formation in Indic

Makes number of failures against Uniscribe with hi_IN dictionary from
OO.o to go down from 6334 to 4290. Not bad for a one-line change!

Mozilla Bug 729626 - ASAN: heap-buffer-overflow HTML
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
743807a3ce1b2229e5307a8aea074a7544623d8d 29-Jul-2011 Behdad Esfahbod <behdad@behdad.org> [Indic] Apply Indic features

Find the base consonant and apply basic Indic features accordingly.
Nothing complete, but does something for now. Specifically:
no Ra handling right now, and no ZWJ/ZWNJ.

Number of failing shape-complex tests goes from 174 down to 125.

Next: reorder matras.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
76f76812ac7cca8ac6935952a2360d5e151480fa 08-Jul-2011 Behdad Esfahbod <behdad@behdad.org> Shuffle code around, remove shape_plan from complex shapers
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
d69d5ceaa0ad30e8d4b9783507c59c6d4221de4f 04-Jul-2011 Behdad Esfahbod <behdad@behdad.org> [Indic] Well, at least finding syllables works now :)

Still not much there.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
c7fe56a1d5d3e969b6ec51cd9ecd471706a19568 25-Jun-2011 Behdad Esfahbod <behdad@behdad.org> [Indic] Some of the basic features are global; Mark them so
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl
867361c3ad39629a8d5b7dc48d558a1c19e37d43 18-Jun-2011 Behdad Esfahbod <behdad@behdad.org> [indic] Add syllable recognition state machine

Using an incredible tool called Ragel.
/external/harfbuzz_ng/src/hb-ot-shape-complex-indic-machine.rl