extraction model 0.002777801
penalty model 0.0026113350000000002
gap model 0.002576525
tic model 0.0025441870000000003
robust model 0.002521735
model limitation 0.0025157870000000002
model 0.00227734
alignment algorithm 0.002193596
sequence alignment 0.002138794
local alignment 0.002039799
word accord 0.001920346
quence alignment 0.001888908
pairwise alignment 0.001861056
alignment mechanism 0.001851922
alignment exten 0.001846818
common words 0.00168136
function words 0.001647895
component words 0.001610397
alignment 0.0016086
certain words 0.00149327
test corpus 0.001479087
content words 0.001412453
unmatched words 0.001406056
nent words 0.001404255
test set 0.001334392
corpus size 0.001277074
mwe extraction 0.001275062
semantic information 0.001257931
sequence set 0.001250061
same set 0.001176439
words 0.00116431
standard set 0.001130273
corpus support 0.001124479
corpus sizes 0.001117645
similarity score 0.0011079599999999998
further mwe 0.001105296
mwe loss 0.001096735
human translation 0.001089374
information extraction 0.001086631
mwe detection 0.001084609
candidate mwe 0.001083036
new approach 0.0010777830000000001
mwe numbers 0.001072429
ﬂexible mwe 0.001066326
pattern extraction 0.001061984
mathematical method 0.001055737
relative probability 0.001048924
common patterns 0.001037135
machine translation 0.001032032
candidate set 0.001028302
standard test 0.001024931
mwe identiﬁcation 0.00102399
mwe min 0.00101947
mwe repeats 0.001013021
mwe identiﬁca 0.001013021
other phrase 9.992500000000001E-4
consistent set 9.6693E-4
original data 9.645789999999999E-4
sistent set 9.62807E-4
extraction task 9.47612E-4
variable patterns 9.47612E-4
open test 9.437079999999999E-4
lcs approach 9.39307E-4
similarity scores 9.20163E-4
mutual information 9.15557E-4
linguistic information 9.145010000000001E-4
detailed pattern 9.130709999999999E-4
advanced evaluation 9.049819999999999E-4
statistical methods 8.94682E-4
our approach 8.90355E-4
pattern extension 8.90292E-4
tag sequence 8.82627E-4
lexical reference 8.82277E-4
frequency data 8.75335E-4
average sentence 8.746260000000001E-4
multiple sequence 8.73924E-4
candidate pattern 8.69958E-4
phrase detection 8.672530000000001E-4
length information 8.67238E-4
pattern recognition 8.660689999999999E-4
global sequence 8.65366E-4
corpus 8.64562E-4
frequent information 8.63554E-4
pos information 8.62019E-4
close test 8.60971E-4
closed test 8.576359999999999E-4
base phrase 8.574850000000001E-4
lexical units 8.56182E-4
data sparseness 8.54708E-4
programming algorithm 8.54662E-4
ﬂexible pattern 8.53248E-4
noise pattern 8.51956E-4
sentence length 8.48325E-4
noun phrase 8.33673E-4
evaluation criterion 8.3069E-4
sum score 8.27982E-4
natural language 8.25243E-4
same context 8.22803E-4
many criteria 8.18341E-4
penalty function 8.1758E-4
