This was removed due to size constraints. See https://github.com/DFKI-NLP/sam/tree/main/experiments/prediction/filtered for the original data.