english sentences 0.0029987399999999997
english sentence 0.0028538599999999997
english data 0.00264151
english corpus 0.002439212
word features 0.002263356
english text 0.002263237
parallel english 0.002094134
many english 0.002088125
annotated english 0.002042807
english classifier 0.002033618
german sentences 0.001994741
word alignment 0.001967203
english side 0.001957483
english discourse 0.001935901
english speakers 0.001927697
monolingual english 0.001912784
word class 0.001912638
english novels 0.001905
modern english 0.001897856
english utterances 0.0018892170000000001
english units 0.001886161
contemporary english 0.001883049
german sentence 0.001849861
many sentences 0.0018136049999999998
word align 0.0017909529999999999
context sentences 0.0017897899999999999
random sentences 0.0017355209999999999
labeled sentences 0.001716561
complete sentences 0.0016912399999999999
sentence alignment 0.0016893029999999999
individual sentences 0.001680996
english 0.00163663
target language 0.001636172
surrounding sentences 0.0016090599999999998
vidual sentences 0.0016090599999999998
partial sentences 0.0016090599999999998
training data 0.0015871120000000001
source language 0.001569443
person information 0.001546909
language pair 0.001546098
local words 0.0015451800000000002
statistical model 0.001531251
sentence con 0.00151852
current sentence 0.0015152409999999999
sequence model 0.001513499
previous sentence 0.001513454
language processing 0.001477655
sentence splitter 0.001464206
sentence aligner 0.001464206
sentence boundary 0.001464206
global information 0.001460261
natural language 0.0014504890000000001
formulaic language 0.001435679
german corpus 0.001435213
model accuracy 0.001420814
social information 0.001409997
sentences 0.00136211
bayes model 0.00131372
effective model 0.001291362
data selection 0.00128728
segmentation information 0.001276014
parallel corpus 0.001260086
data preparation 0.0012517000000000001
sentence 0.00121723
language 0.00118527
words 0.00118452
feature set 0.0011636939999999998
different languages 0.00116145
same time 0.0011507840000000002
other examples 0.0010823579999999998
such languages 0.001074145
comparable corpus 0.001060212
training set 0.001059466
model 0.00103089
meaningful features 0.001017595
idiosyncratic features 0.001017595
initial position 0.001012574
information 0.00101202
translation science 9.796100000000001E-4
different websites 9.74631E-4
glish text 9.73659E-4
german pronouns 9.69035E-4
chine translation 9.66247E-4
archaic feature 9.634839999999999E-4
german side 9.534840000000001E-4
labeled training 9.36683E-4
single german 9.13772E-4
statistical analysis 9.07869E-4
german novels 9.01001E-4
second person 8.971719999999999E-4
global constraints 8.91403E-4
disambiguation rules 8.90628E-4
such nov 8.90411E-4
text categorization 8.8937E-4
many languages 8.878989999999999E-4
clustering methods 8.82954E-4
regression models 8.78406E-4
text categoriza 8.762170000000001E-4
annotation errors 8.73788E-4
bayes models 8.72072E-4
