wikipedia data 0.00317246
english wikipedia 0.00294962
wikipedia language 0.0029070610000000003
wikipedia article 0.002860084
wikipedia articles 0.002807413
wikipedia revision 0.0026609480000000002
local wikipedia 0.002470523
wikipedia wiki 0.002431164
individual wikipedia 0.002425703
wikipedia dump 0.0024205200000000002
wikipedia growth 0.002407795
wikipedia access 0.0024042720000000003
wikipedia object 0.0023826570000000003
encyclopedia wikipedia 0.002382584
wikipedia snap 0.002368306
official wikipedia 0.002367325
wikipedia snapshots 0.002352887
wikipedia apis 0.0023479060000000003
glish wikipedia 0.0023467740000000003
wikipedia miner 0.0023467650000000002
wikipedia mirror 0.0023460860000000003
established wikipedia 0.0023460860000000003
wikipedia policies 0.0023460860000000003
wikipedia website 0.0023460860000000003
wikipedia administrators 0.0023460860000000003
wikipedia sum 0.0023460860000000003
wikipedia 0.0021169
word sense 0.001748511
revision data 0.0015996080000000002
training data 0.001592342
article text 0.00158427
sense disambiguation 0.0015798230000000002
disambiguation category 0.001463812
available data 0.001424494
ing data 0.001404866
revision text 0.001385134
jwpl data 0.0013838960000000001
history data 0.001341868
data transfer 0.001321866
live data 0.0013204660000000002
meta data 0.001303734
vision data 0.0012914200000000002
text revisions 0.001291168
data volume 0.0012847450000000001
redundant data 0.0012847450000000001
new article 0.001265272
main category 0.0012100919999999999
article revisions 0.0011932660000000001
language processing 0.0011265000000000001
article content 0.001117704
full text 0.001102988
article edits 0.001101242
language versions 0.001097307
recent english 0.001093098
value language 0.001092957
specific language 0.001087047
text files 0.001083141
text categorization 0.001078438
first sentence 0.001077215
plain text 0.001075391
text summarization 0.001071738
eleven text 0.001071738
english maincategory 0.001069152
single article 0.0010657190000000001
language version 0.001054531
category membership 0.0010534729999999999
natural language 0.0010520360000000001
same time 0.001051059
specific article 0.0010400700000000001
other operations 0.001029196
other components 0.001027781
article automobile 0.001009178
full article 0.001005086
individual articles 9.99316E-4
other conditions 9.96716E-4
other possibilities 9.92066E-4
diff algorithm 9.90232E-4
page query 9.89142E-4
article sections 9.74679E-4
article stores 9.74379E-4
research results 9.70493E-4
first use 9.67615E-4
sense 9.397E-4
revision storage 9.36721E-4
revision pair 9.30645E-4
meaningful articles 9.28691E-4
revision database 9.25133E-4
new resource 9.2397E-4
knowledge source 9.23402E-4
spam articles 9.23324E-4
deleted articles 9.204080000000001E-4
second sentence 9.158710000000001E-4
string search 9.138740000000001E-4
previous revision 9.11499E-4
database user 8.91233E-4
disambiguation fromtimestamp 8.90839E-4
nlp research 8.72357E-4
complete set 8.651479999999999E-4
additional user 8.62816E-4
user comment 8.55304E-4
