language model 0.00519213
maxent model 0.004375443
bigram model 0.004354663999999999
baseline model 0.004333888
pcfg model 0.00420518
bayes model 0.004178849
basic model 0.004161706
unigram model 0.004130328
gram model 0.004124344
constituent model 0.004110634
line model 0.004092669
erative model 0.004090432
model outper 0.00408602
model 0.00383047
language models 0.002182757
training data 0.0021100299999999997
bigram language 0.001885854
function words 0.001790852
unigram language 0.001661518
gram language 0.001655534
language processing 0.001647794
natural language 0.001627671
original data 0.001551219
training set 0.0015455299999999998
training input 0.001414146
syntactic information 0.0014041919999999999
training document 0.001388778
language 0.00136166
syntactic features 0.001350334
large corpus 0.001330748
training documents 0.0013289299999999999
baseline models 0.001324515
training datasets 0.001311646
lexical information 0.001267036
little training 0.001256491
tive training 0.001221962
sparse training 0.001209651
sufficient training 0.0012047949999999998
insufficient training 0.001203584
base corpus 0.001197391
stop words 0.001192386
discriminative models 0.001190137
performance difference 0.0011716460000000001
different baseline 0.001168458
test set 0.0011334119999999999
complete models 0.001129138
topic information 0.00111262
guage models 0.001106754
baseline method 0.0011059350000000002
poor performance 0.001102765
background corpus 0.001094514
rior performance 0.00109124
generative models 0.001084384
trigram models 0.001077851
brown corpus 0.001075132
wsj corpus 0.001065746
other methods 0.001061576
small number 0.001055884
probabilistic context 0.001044702
extraneous information 0.001030254
different datasets 0.0010269160000000001
test document 9.7666E-4
different authors 9.76263E-4
syntactic writing 9.64509E-4
dataset maxent 9.52823E-4
same topic 9.52603E-4
training 9.4977E-4
syntactic level 9.35969E-4
syntactic infor 9.31468E-4
different areas 9.208840000000001E-4
test documents 9.16812E-4
function 9.12778E-4
several datasets 9.080620000000001E-4
syntactic structure 8.97777E-4
same background 8.975230000000001E-4
current state 8.792629999999999E-4
current approach 8.7873E-4
words 8.78074E-4
binomial variables 8.77731E-4
total number 8.691279999999999E-4
maxent classifier 8.479550000000001E-4
several news 8.46648E-4
penn treebank 8.4293E-4
several approaches 8.4161E-4
limited number 8.41401E-4
performance 8.35764E-4
unequal number 8.297479999999999E-4
ing poetry 8.22824E-4
models 8.21097E-4
pcfg approach 8.17295E-4
free grammar 8.1583E-4
test sets 8.12304E-4
corpus 8.10177E-4
baseline mod 8.076210000000001E-4
original author 7.99897E-4
test samples 7.97089E-4
opennlp sentence 7.83057E-4
sentence segmenter 7.83057E-4
basic approach 7.738210000000001E-4
important documents 7.733740000000001E-4
