Ishaan Karan


2026

We present a Span-Consistency Network (SCN) for conspiracy marker extraction in English social media text. The task requires identifying character-level spans for five marker types (Actor, Action, Effect, Evidence, and Victim) under overlap-based Macro F1 evaluation. Standard token-level classifiers often produce fragmented spans, ignore inter-marker dependencies, and struggle with severe class imbalance.Our approach addresses these challenges through three components. First, a Span Consistency Layer (SCL) propagates span-level confidence signals to encourage coherent boundary formation. Second, Cross-Marker Attention (CMA) models co-occurrence patterns between marker types via a learned correlation matrix. Third, we introduce Span Count Regularization (SCR), a total-variation-based constraint that aligns soft token probabilities with the expected number of discrete spans, mitigating prediction collapse under threshold decoding.Built on DeBERTa-v3-large and trained with a recall-biased Tversky loss, our system is ensembled across five stratified folds. It achieved a Macro F1 of 0.24 on the official test set, placing second among participating teams. Ablation studies show that SCR plays a critical role in maintaining span structure, particularly for low-frequency and long-span markers.
Search
Co-authors
    Venues
    Fix author