word clustering 0.00306128
unsupervised word 0.002688763
word clusters 0.002600366
lexical word 0.002588392
word categories 0.002522923
word system 0.0025144440000000002
word classes 0.002411549
frequent word 0.002395972
word nature 0.002379518
flat word 0.002335067
similar words 0.002304896
word assumption 0.002267088
duce word 0.002262481
word groupings 0.002260835
word mappings 0.002259969
word alignment 0.002257995
word cool 0.002257995
frequent words 0.002247452
head words 0.002228439
ambiguous words 0.002169623
unknown words 0.0021546639999999997
neighboring words 0.002149214
unique words 0.0021418369999999997
groups words 0.002125272
lar words 0.002120557
rare words 0.002114407
lion words 0.002109694
unsupervised tags 0.001925833
dependency model 0.0019128320000000002
gold tags 0.0018973319999999998
similarity model 0.001880282
words 0.00186551
different tags 0.001834445
gold tag 0.0017479319999999998
possible tags 0.0017396170000000002
clustering algorithm 0.001725868
automata model 0.001700341
supervised tags 0.001671856
data set 0.00164319
monosemous tags 0.0016358219999999999
training data 0.001588924
new tag 0.00158519
flat tags 0.001572137
tags scores 0.001549304
common tags 0.001547828
speech tags 0.001537565
tags our 0.001518651
induced tags 0.0015124259999999999
manual tags 0.001505264
clustering results 0.001499246
monosemous tag 0.0014864219999999998
similarity clustering 0.001479162
model 0.00144837
monosemous clustering 0.001431972
tag distributions 0.0014042389999999998
clustering algorithms 0.0013951039999999999
clustering assignments 0.001392401
clustering schemes 0.001379185
clustering techniques 0.001377269
flat clustering 0.001368287
hard clustering 0.001366857
multiple clustering 0.001359392
polysemous clustering 0.001349573
hierarchical clustering 0.00132357
training set 0.001316092
unlabeled data 0.001310589
unsupervised training 0.001305646
classic clustering 0.001303436
soft clustering 0.001293165
data sets 0.001281843
frequency information 0.001281748
ing models 0.001264573
enough data 0.001252578
different gold 0.001229577
training corpus 0.00122883
first set 0.0012210329999999998
other languages 0.001220901
data the 0.0012190810000000001
hmm models 0.001217129
unsupervised learning 0.001195966
test set 0.001190423
sequence models 0.0011856829999999999
unsupervised categories 0.001183626
final models 0.001173824
lexicalized models 0.001151181
similarity cluster 0.001147541
own models 0.001145133
markov models 0.001143805
unsupervised dependency 0.001139195
unsupervised results 0.001126729
label set 0.001115402
gold dependency 0.001110694
same way 0.001103981
later models 0.001093773
pler models 0.001093107
lexical categories 0.001083255
syntactic categories 0.001072701
ing gold 0.00106302
standard tagging 0.001062966
such sequence 0.001061502
