Appendix C: Guidelines for Scoring Mismatches Between System Responses and Answer Key