szte.io
Class CorpusStat
java.lang.Object
szte.io.CorpusStat
public class CorpusStat
- extends java.lang.Object
CorpusStat counts basic statistics (size, #labels, #avg. labels/doc etc) about the multi-labeling corpora (using the DocumentSet interface).
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
CorpusStat
public CorpusStat()
getTotalTokenNum
public static int getTotalTokenNum(DocumentSet docset)
getTotalAssignmentNum
public static int getTotalAssignmentNum(DocumentSet docset)
getLabelSet
public static java.util.Set<java.lang.String> getLabelSet(DocumentSet docset)
printStatistics
public static void printStatistics(DocumentSet docset)
printStatistics
public static void printStatistics(DocumentSet docseta,
DocumentSet docsetb)
main
public static void main(java.lang.String[] args)