training data 0.00382936
different data 0.003490102
chinese data 0.003489223
unlabeled data 0.003476033
labeled data 0.003443325
test data 0.003384542
ing data 0.003298521
annotated data 0.003195737
data source 0.00309272
projected data 0.0030924710000000003
data sources 0.003072891
data this 0.002978899
unannotated data 0.002975118
jected data 0.002975063
informative data 0.002974542
beled data 0.002968059
language model 0.002468234
chinese word 0.0023621429999999997
english word 0.002340199
learning model 0.002272751
joint model 0.002093203
translation model 0.002088777
word sentence 0.002061282
word alignment 0.001997419
word coverage 0.001952627
maximum word 0.0019499589999999998
word sense 0.0019458779999999998
linear model 0.001926783
training corpus 0.0019252879999999998
bayes model 0.0019022140000000002
word alignments 0.0018957709999999999
word align 0.001865963
poor word 0.001860353
input word 0.001860142
word orders 0.001839902
tagging model 0.0018345800000000002
final model 0.001826781
trigram model 0.0018242970000000001
ment model 0.0017991250000000002
trained model 0.0017949020000000001
pervised model 0.0017905870000000002
ital model 0.001783518
training algorithm 0.0017171579999999999
training set 0.0016869429999999998
chinese corpus 0.001585151
chinese words 0.001577701
learning method 0.001571046
other words 0.001566789
chinese pos 0.001563427
english words 0.001555757
pos tag 0.001548716
english parser 0.00153457
model 0.00153431
other approach 0.0014941300000000002
similar language 0.0014804100000000001
same pos 0.001474016
learning approach 0.00147034
parallel corpus 0.001467815
same tag 0.0014421640000000001
test sentences 0.001425903
training examples 0.0014078799999999998
notated training 0.0013678599999999998
training tanno 0.001362849
training exam 0.001362026
other methods 0.001358485
learning methods 0.001334695
tag set 0.001332095
pos tags 0.001310626
large corpus 0.001305488
language pairs 0.0012961420000000001
ing models 0.0012918629999999999
annotated corpus 0.001291665
natural language 0.001283937
language applications 0.001273414
backoff language 0.001271339
language processing 0.001265618
language side 0.001260222
language resources 0.001253454
supervised learning 0.001252475
english noun 0.0012466719999999999
small corpus 0.001242816
test set 0.001242125
unknown words 0.001231857
language pair 0.001230077
pos tagger 0.001213133
language modeling 0.001211007
foreign language 0.001196243
dissimilar language 0.0011831810000000002
english tagger 0.001174048
discounting method 0.001164241
english nlp 0.001159478
treebank pos 0.0011464140000000001
chinese treebank 0.001129273
learning process 0.0011230060000000002
basic pos 0.001119608
training 0.00111328
different views 0.00110227
noisy corpus 0.001097154
pos tagging 0.001090554
quality english 0.001086823
