History log of /packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
Revision Date Author Comments (<<< Hide modified files) (Show modified files >>>)
d03e065b861bd1cdb90bc70f18b7d7a3815b7a4d 18-Oct-2012 Jean Chalard <jchalard@google.com> Don't consider non-words for split word suggestion

Bug: 7368683
Change-Id: Iecd8348be788e5a749dafabdf7eddc4a14d901e0
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
1cd7ca991961937c1a84572a6cafa3eaf5181be4 14-Sep-2012 Keisuke Kuroyanagi <ksk@google.com> Using isSkippableChar instead of '\'' and '-'.

Change-Id: I295a72b683b2ebc28a006baa66baf7e1b97100eb
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
7914e907b5d31ec4b2034a94e393129833770531 07-Sep-2012 Ken Wakasa <kwakasa@google.com> Add comments for meaningless lines

Change-Id: I20ebe246f6980fd3054c2c7e98b5150eaabbcc77
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
18ebba3a665906e17c77f532723a2ebf5f3d2f38 06-Sep-2012 Jean Chalard <jchalard@google.com> Fix one-off bugs reported by Valgrind

Bug: 7108990
Change-Id: I40ba30f50a26b65bcac905fc005ad6bb9cb034cc
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
f2789819bd005b5b0581e8439601b5501306327d 03-Sep-2012 Ken Wakasa <kwakasa@google.com> Cosmetic fixes and a bug fix in UnigramDictionary::testCharGroupForContinuedLikeness().

This change has actually been extracted from a change work in progress I4fe423834b8131fb122251892c98228a6e08ba25

Change-Id: I52568fa09da2ea22be7f8bfe9676b7cd73c31fa4
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
72b1c9394105b6fbc0d8c6ff00f3574ee37a9aaa 31-Aug-2012 Jean Chalard <jchalard@google.com> Reinstate the shortcut-only attribute

Also add the blacklist attribute

Bug: 7005742
Bug: 2704000
Change-Id: Icbe60bdf25bfb098d9e3f20870be30d6aef07c9d
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
687a244703a02323ebd64433cbaead5def499861 23-Aug-2012 Satoshi Kataoka <satok@google.com> Step 38-A Cleanup touch path

Change-Id: I9bfac98b9ac10f3812290f21673c4002241eae01
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
8af8c15c6ba370ef3c0e9083e5229b4080ea288f 17-Aug-2012 Jean Chalard <jchalard@google.com> Allow one-character suggestions.

Bug: 6906525
Change-Id: I62117653ac1281f825afc4ebb30f7e786902b750
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
b14fc88e482e53ba6852c8d5da5d9826c68d041f 10-Aug-2012 Jean Chalard <jchalard@google.com> Tag the whitelisted entries in native code.

Since this is already used in Java land, this actually does
activate the whitelist path, and the code is now fully
functional. We still have to remove the old whitelist resource
and to compile the dictionary that includes the whitelist.

Bug: 6906525
Change-Id: Iacde5313e303b9ed792940efaf6bcfa4ee1317bd
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
926ef06a1ef8a0f8e01baa26c23172b93012d2b1 10-Aug-2012 Jean Chalard <jchalard@google.com> Pass the type from native code all the way to Java.

Bug: 6906525
Change-Id: I057390d47a223450e22d8338509e22c28fc0d5f6
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
77e8e81ad95cfc1eb8f8407fc872674b8d08bbe9 02-Aug-2012 Ken Wakasa <kwakasa@google.com> Header cleanup. Moved a couple of functions from .h to .cpp.

Change-Id: Ifd12a7632f75395bd0ef5e394d5c2abd6cbe28c6
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
195605084ed156b58f0bae002f121d98c1ace867 31-Jul-2012 Jean Chalard <jchalard@google.com> Move flags belonging to BinaryFormat to the right place.

These masks and flags are constants that are an integral part
of the format. They belong in BinaryFormat and have nothing to
do in UnigramDictionary.
This needs I6751dda4 to not break the build

Bug: 6429243
Change-Id: Ic1c842b3245f7fdc25aa8d1459c5bb07b262e265
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
f1008c550168e50f930ea1e043000b395ce0f129 31-Jul-2012 Ken Wakasa <kwakasa@google.com> Cleanup headers

Change-Id: I9124366b2000cad54966fb6936bb4691bbf115a3
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
5460ea389d83722ac98abaef8a2bb9900fb571e7 30-Jul-2012 Ken Wakasa <kwakasa@google.com> Misc cleanups

Change-Id: I28308f12c3064299acefc346b72279036c3726a7
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
0bbb917d12358e0264796e75dea888f244761b64 25-Jul-2012 Ken Wakasa <kwakasa@google.com> Cosmetic fixes and style fixes

Change-Id: I69c42ff945cdf0d5205c6ca61d6861a0479492dc
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
6931df9c17aaeb04288f937cabf956c1b9eb0cc9 11-Jul-2012 Jean Chalard <jchalard@google.com> Pass an array to output suggestion types (A119a)

This needs the matching A119b change to not break the build.
The array is passed, but not used yet.

Bug: 6166228
Change-Id: Ia91d658461d989ee8c94e9b31bb06f4a36f4c5b6
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
e9a86e2cdb58dd8d5601138294521e966d164520 28-Jun-2012 Jean Chalard <jchalard@google.com> Search bigrams for the lower case version of the word (A46)

...if there aren't any for the exact case version.

Bug: 6752830
Change-Id: I2737148b01ba04a64febe009ceb2ef53c265d224
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
b1ed1d47acb366706df0ccc77711df92eff4f51e 15-Jun-2012 satok <satok@google.com> Make native bigram dictionary const

Change-Id: Id883f2d69483c9234877ad42446a582258e1cf91
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
1bc038c5e40bd0fa7a44331a569abe3ae88f0152 14-Jun-2012 satok <satok@google.com> Move correction state to stack memory

*Before
(0) 13.18 (0.01%)
(1) 93025.41 (62.06%)
(2) 10.75 (0.01%)
(3) 10.50 (0.01%)
(4) 117.50 (0.08%)
(5) 55678.98 (37.14%)
(6) 9.09 (0.01%)
(20) 883.84 (0.59%)
Total 149898.24 (sum of others 149749.25)

*After
(0) 17.41 (0.01%)
(1) 92673.41 (61.95%)
(2) 10.62 (0.01%)
(3) 10.37 (0.01%)
(4) 120.96 (0.08%)
(5) 55741.18 (37.26%)
(6) 11.01 (0.01%)
(20) 862.72 (0.58%)
Total 149595.52 (sum of others 149447.68)

Change-Id: Ia5a25a544fc388e4dab1e08d8f78d5117b249cf3
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
6bc051d29171a4c7ff9e19ad7363adb1b567bc78 08-Jun-2012 Satoshi Kataoka <satok@google.com> Refactor step3: const unigram_dictionary

- make all functions in unigram_dictionary const

Change-Id: I94a64865aa793435e6c81890e68d999f6374074e
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
4a3db7057f77dc85311fb1f94934b5a004ab613e 08-Jun-2012 Satoshi Kataoka <satok@google.com> refactor native step 2

- proximity_info is now stateless

Change-Id: I62725bfe05b161fa8fc050ea6b50867e10a354e2
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
3e8c58f68d53e6cc9dbf59201c7bdfb8ad04a1cd 05-Jun-2012 Satoshi Kataoka <satok@google.com> Separate state from proximity_info step1

Bug: 6548943

Change-Id: I7b4fbe20615a28151a74875be43b9f4a20e0dce9
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
02963c40326af954f4ea351e4f00fc83e2356d68 31-May-2012 Satoshi Kataoka <satok@google.com> am fb3fdf15: Merge "Enhance the safety net in multiple word suggestions" into jb-dev

* commit 'fb3fdf15040cb67c311047dea0144fa5aa071185':
Enhance the safety net in multiple word suggestions
67e3cc8488aeae6fbeec139a07c08491b0fc40e8 31-May-2012 Satoshi Kataoka <satok@google.com> Enhance the safety net in multiple word suggestions

Bug: 6576793

Change-Id: I97100b482d3cf4e1b417c197b5d1e971a72b998d
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
d8096b1a12ce1f6f53a2b269956043d77276f007 30-May-2012 Satoshi Kataoka <satok@google.com> am 6cbe204f: Fix the performance issue on suggesting aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

* commit '6cbe204fce109fab652da15f4c8ea1ae35fca3e7':
Fix the performance issue on suggesting aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
6cbe204fce109fab652da15f4c8ea1ae35fca3e7 30-May-2012 Satoshi Kataoka <satok@google.com> Fix the performance issue on suggesting aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

Bug: 6576793

Change-Id: I46f56654cd25dc28668ad75ac71e0e3beb8cdcf3
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
40ab25cab0e723f34dc2f1442f414761149b2338 29-May-2012 Satoshi Kataoka <satok@google.com> am 51705efc: Merge "Add a JNI to get the frequency" into jb-dev

* commit '51705efc96c1c555cf9e0f557ea8cdc1c1c97781':
Add a JNI to get the frequency
2f854e170c9fde47cae804145f90d164cdb5ceb8 29-May-2012 Satoshi Kataoka <satok@google.com> Add a JNI to get the frequency

Bug: 4192129
Change-Id: I3f220f5a10114d4eb23956148076cf76220bda0f
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
074e8c920683d77fae191b34185193c896e3aa00 23-May-2012 satok <satok@google.com> am a0ac31fc: Fix the issue on multiple words suggestion

* commit 'a0ac31fcaa01c21592a6e7af243c14dada65cf3e':
Fix the issue on multiple words suggestion
a0ac31fcaa01c21592a6e7af243c14dada65cf3e 23-May-2012 satok <satok@google.com> Fix the issue on multiple words suggestion

Bug: 6509844
Change-Id: I823074a2b29befc3e60c63699ab4dc7719105c63
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
cd7c41352f575e411705b3ff07f5e7c31c94db7a 17-May-2012 Jean-Baptiste Queru <jbq@google.com> Fix build

Change-Id: I799811aa3afb59bba2e4086a063f5da03669bba3
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
3b088a2f365a9ce06f58243c83cb961ea2920b7e 16-May-2012 Ken Wakasa <kwakasa@google.com> Add missing includes.

Change-Id: Ic7199045d0cffb208871f52cc167194013351d32
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
0028ed3627ff4f37a62a80f3b2c857e373cd5090 16-May-2012 satok <satok@google.com> Use "float" instead of "double"

Change-Id: I93ed4d88ede4058f081dd8d634b00dfff4e96d07
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
db1939dbaa1de59eaf5693e2c89b02b323e9aac8 15-May-2012 satok <satok@google.com> Reorder suggestions result according to auto correction threshold

Bug: 5413904
Change-Id: I3aa3a8109ba45d2129b58d8242866fd3dd3473cb
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
49ba135fdedb3c6b33ec915e91ecad682b7655b8 07-May-2012 Jean Chalard <jchalard@google.com> Perform the actual bigram frequency lookup.

This still returns the unigram frequency, because the values stored
for bigrams in the dictionary are not ready to be returned in-place
instead of unigram values. Aside from this, the code is complete.

Bug: 6313806
Change-Id: If7bb7b644730782277f0f6663334c170b7fe13fb
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
8950ce6c44706467bb386570ae236a2b8b983666 07-May-2012 Jean Chalard <jchalard@google.com> Replace the bigram list position with the map and filter

Passing the position will not allow us a reasonable lookup
time. Replace this with a map and bloom filter for very fast
lookup.

Bug: 6313806
Change-Id: I3a61c0001cbc987c1c3c7b8df635d4590a370144
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
4d9b202c4009352352ae98f8b13fe6330d102ba6 23-Apr-2012 Jean Chalard <jchalard@google.com> Pass the bigram list position from the top level

The position itself is still a const int = 0 until we have the previous
word passed to the function. This basically does the plumbing.

Bug: 6313806
Change-Id: Ib58995f334fe93e3ff5704d7c79f332017f101ac
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
171d1809ffc724de4fb793f481d592644e3d141e 23-Apr-2012 Jean Chalard <jchalard@google.com> Add methods to inverse compute the probability.

For now the probability is just returned with the same
value it had, but this is some ground work that needs to be
done anyway.

Bug: 6313806
Change-Id: I9bb8b96b294109771208ade558c9ad56932d2f8e
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
522a04ea5b249d0af556647d2abcad57e5b99b4f 23-Apr-2012 Jean Chalard <jchalard@google.com> Pass words as int[] to the native code.

We need to get the bigrams during the call to getSuggestions for
bug#6313806. We already give an int[] to getSuggestions and we
wanted to get rid of char[]'s anyway because it doesn't work with
surrogate pairs, so here we go.

Bug: 6313806
Change-Id: I56ce99f1db6b3302cdf42f0527343bded837091e
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
338d3ec725a952cbe603ac8b2d49c337463f4093 06-Apr-2012 Jean Chalard <jchalard@google.com> Replace the flags in getSuggestions with a boolean.

Change-Id: I0ec44df1979cb1dc21017ea290d2151a2af0e7cd

Conflicts:

java/src/com/android/inputmethod/latin/Suggest.java
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
aa8df599146e9599b872398c067a2ee27079b659 06-Apr-2012 Jean Chalard <jchalard@google.com> Enable using the flags read from the binary file.

Change-Id: Ib420c3e174ccc1a80c4b6fd066de3b7a2b6fb290
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
cd274b146961c95abf1436caa8ec482650432b6e 06-Apr-2012 Jean Chalard <jchalard@google.com> Save the flags in a member in the unigram dictionary.

Change-Id: Ic8fad9110db6b97f98ace27af0f347b4e69de8c8
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
e81ac8baa0dc0e8d671c813b93100070c23b9a1d 06-Apr-2012 Jean Chalard <jchalard@google.com> Add a method to get the flags from a binary dictionary.

This method is not used yet

Change-Id: Ic15d3d423aff2c83c712bc0aa56571d30755e663
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
5b0761e6a94227d6ef788f589fb6edcd44ed791f 06-Apr-2012 Jean Chalard <jchalard@google.com> Remove write-only stuff

Change-Id: I5ac8ab64c77a298502b3d063ea70db9b4da41716
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
9a933a742d2a3ffdfb955705ad086035bc27db60 27-Mar-2012 Jean Chalard <jchalard@google.com> Read shortcuts as strings in the dictionary.

This has no impact on performance.
Before:
(0) 9.61 (0.01%)
(1) 57514.58 (56.70%)
(2) 10.55 (0.01%)
(3) 10.79 (0.01%)
(4) 133.20 (0.13%)
(5) 43553.87 (42.94%)
(6) 10.03 (0.01%)
(20) 47.20 (0.05%)
Total 101431.47 (sum of others 101289.84)

After:
(0) 10.52 (0.01%)
(1) 56311.16 (56.66%)
(2) 13.40 (0.01%)
(3) 10.98 (0.01%)
(4) 136.72 (0.14%)
(5) 42707.92 (42.97%)
(6) 9.79 (0.01%)
(20) 51.35 (0.05%)
Total 99390.76 (sum of others 99251.84)

The difference is not significant with regard to measure imprecision

Change-Id: I2e4f1ef7a5e99082e67dd27f56cf4fc432bb48fa
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp
3ef3e24a12ed72204f7a6f2e4b2df8ce7d243746 30-Mar-2012 Ken Wakasa <kwakasa@google.com> Move the "src" directory as a preparation for Ib4a47342 and I66f6c5b9

Change-Id: I3ab65059f6e356530484bfd0bba26a634a4cba65
/packages/inputmethods/LatinIME/native/jni/src/unigram_dictionary.cpp