Wei Han
Other people with similar names: Wei Han
Unverified author pages with similar names: Wei Han
2026
RADS: Reinforcement Learning-Based Sample Selection Improves Transfer Learning in Low-resource and Imbalanced Clinical Settings
Wei Han | David Martinez Iraola | Anna Khanina | Lawrence Cavedon | Karin Verspoor
Findings of the Association for Computational Linguistics: ACL 2026
Wei Han | David Martinez Iraola | Anna Khanina | Lawrence Cavedon | Karin Verspoor
Findings of the Association for Computational Linguistics: ACL 2026
A common strategy in transfer learning is few shot fine-tuning, but its success is highly dependent on the quality of samples selected as training examples. Active learning methods such as uncertainty sampling and diversity sampling can select useful samples. However, under extremely low-resource and class-imbalanced conditions, they often favor outliers rather than truly informative samples, resulting in degraded performance. In this paper, we introduce RADS (Reinforcement Domain Adaptive Sampling), a robust sample selection strategy using reinforcement learning (RL) to identify the most informative samples. Experimental evaluations on several real world clinical datasets show our sample selection strategy enhances model transferability while maintaining robust performance under extreme class imbalance compared to traditional methods. Our code is open-sourced on GitHub.