Chiyao Zhou


2026

This paper describes our system for the SemEval-2026 Task 12 on Abductive Event Reasoning (AER). We systematically address the "over-selection" hallucination pathology in Instruction-tuned Large Language Models (LLMs), where models erroneously align distractors with semantic priors rather than retrieved evidence. Our framework utilizes a 32-billion parameter Qwen2.5 foundational model adapted via Low-Rank Adaptation (LoRA) and evaluated under a Zero-shot Chain-of-Thought (CoT) setting. To mitigate epistemic noise, we propose a Precision-Oriented Decoding (POD) strategy that couples low-temperature sampling (T=0.45) with scaled majority voting (K=9). Following a three-stage empirical evolution—from baseline diagnosis to precision optimization and ensemble analysis—our system achieved a score of 0.802 on the official test set. Our findings demonstrate that in causal reasoning tasks with strict penalization for incorrect predictions, epistemic noise suppression is strictly superior to heuristic recall compensation.