Each line in the the supervised dataset files is a set of tab seprated values with the following order:

Label -> head_1 -> tail_1 -> head_2 -> tail_2

Each line in the the unsupervised dataset files is a set of tab seprated values with the following order:

Label -> head_1 -> tail_1 -> head_2 -> tail_2 -> head_3 -> tail_3
