This directory contains 3 sub-directories:

* posts_xml: These are posts in XML format.

* Sentences: These are the sentences of each post split into sentences. Each file has the following format:

source_discussant <tab> target_discussant <tab> thread_id <tab> post_id <tab> sentence <newline>

* Labels: These are the subgroup labels of discussants. Each file has the following format

discussant <tab> subgroup_id