large corpus 0.002501064
training corpus 0.0024017920000000002
automatic corpus 0.002325766
test corpus 0.0022826120000000003
corpus con 0.002279764
size corpus 0.0022656810000000003
corpus size 0.0022656810000000003
corpus precision 0.002250083
manual corpus 0.002240932
annotated corpus 0.002211001
corpus figure 0.002207114
tagged corpus 0.002188667
corpus description 0.002175518
matic corpus 0.002152013
tomatic corpus 0.0021454110000000003
corpus 0.0018906
list learning 0.0015205370000000002
list web 0.001401023
language processing 0.001312359
such sentences 0.00126891
learning method 0.001254107
web search 0.00125154
learning features 0.001237966
different domains 0.001213777
learning approach 0.0012074590000000001
different processes 0.001193294
machine learning 0.001191121
natural language 0.001184789
words person 0.0011766469999999998
web documents 0.0011744799999999999
linguistic information 0.001150988
infinite language 0.001126719
korean word 0.001126536
same size 0.001125182
large training 0.001121656
unknown words 0.001104355
supervised learning 0.00110434
functional words 0.00108478
engine web 0.0010819129999999999
learning meth 0.00106937
learning sys 0.00106937
small word 0.001064651
test data 0.001063675
word window 0.001038919
sentence level 0.0010361230000000001
functional word 0.001023413
other contexts 0.0010200119999999998
web sites 0.001017414
word ambiguity 0.001006649
corresponding web 0.00100486
context information 0.001002527
decision list 9.93991E-4
sentence separation 9.88563E-4
web page 9.77528E-4
url list 9.71256E-4
sion list 9.619140000000001E-4
cision list 9.619140000000001E-4
web robot 9.55773E-4
web doc 9.54635E-4
lect web 9.538179999999999E-4
mous web 9.505189999999999E-4
web siderations 9.505189999999999E-4
training automatic 9.46358E-4
search engine 9.43699E-4
previous section 9.429200000000001E-4
limited data 9.41824E-4
sentences features 9.377529999999999E-4
common noun 9.36461E-4
data sparseness 9.359119999999999E-4
sentence refinement 9.35584E-4
ner system 9.34725E-4
text refinement 9.3301E-4
sentence boundary 9.25116E-4
human intervention 9.10076E-4
experimental results 9.05298E-4
contextual information 8.909440000000001E-4
robot sentence 8.893410000000001E-4
sentence separator 8.837270000000001E-4
organization training 8.822200000000001E-4
name alias 8.735209999999999E-4
language 8.69759E-4
son names 8.66056E-4
proper noun 8.6293E-4
manual training 8.61524E-4
several heuristics 8.53211E-4
search engines 8.22163E-4
learning 8.14391E-4
internet search 8.124079999999999E-4
context features 8.08689E-4
words 8.04968E-4
satisfiable performance 7.871460000000001E-4
korean use 7.64888E-4
automatic cor 7.58152E-4
single nouns 7.48243E-4
compound noun 7.47498E-4
pound noun 7.46434E-4
processing schemes 7.451980000000001E-4
automatic generation 7.43474E-4
test manual 7.42344E-4
automatic acquisition 7.39188E-4
