SubmissionNumber#=%=#24 FinalPaperTitle#=%=#PreClinIE: An Annotated Corpus for Information Extraction in Preclinical Studies ShortPaperTitle#=%=# NumberOfPages#=%=#14 CopyrightSigned#=%=#Simona Emilova Doneva JobTitle#==# Organization#==# Abstract#==#Animal research, sometimes referred to as preclinical research, plays a vital role in bridging the gap between basic science and clinical applications. However, the rapid increase in publications and the complexity of reported findings make it increasingly difficult for researchers to extract and assess relevant information. While automation through natural language processing (NLP) holds great potential for addressing this challenge, progress is hindered by the absence of high-quality, comprehensive annotated resources specific to preclinical studies. To fill this gap, we introduce PreClinIE, a fully open manually annotated dataset. The corpus consists of abstracts and methods sections from 725 publications, annotated for study rigor indicators (e.g., random allocation) and other study characteristics (e.g., species). We describe the data collection and annotation process, outlining the challenges of working with preclinical literature. By providing this resource, we aim to accelerate the development of NLP tools that enhance literature mining in preclinical research. Author{1}{Firstname}#=%=#Simona Emilova Author{1}{Lastname}#=%=#Doneva Author{1}{Username}#=%=#sdoneva Author{1}{Email}#=%=#donevasimona@gmail.com Author{1}{Affiliation}#=%=#University of Zurich Author{2}{Firstname}#=%=#Hanna Author{2}{Lastname}#=%=#Hubarava Author{2}{Username}#=%=#hanna_hubarava Author{2}{Email}#=%=#hanna.hubarava@gmail.com Author{2}{Affiliation}#=%=#University of Zurich Author{3}{Firstname}#=%=#Pia Andrea Author{3}{Lastname}#=%=#Härvelid Author{3}{Username}#=%=#piahaervelid Author{3}{Email}#=%=#pia.haervelid@hotmail.com Author{3}{Affiliation}#=%=#University of Zurich, Switzerland Author{4}{Firstname}#=%=#Wolfgang Emanuel Author{4}{Lastname}#=%=#Zürrer Author{4}{Email}#=%=#wolfgangemanuel.zuerrer@uzh.ch Author{4}{Affiliation}#=%=#University of Zurich Author{5}{Firstname}#=%=#Julia V. Author{5}{Lastname}#=%=#Bugajska Author{5}{Username}#=%=#jbugajska Author{5}{Email}#=%=#julia.bugajska@gmail.com Author{5}{Affiliation}#=%=#University of Zurich Author{6}{Firstname}#=%=#Bernard Friedrich Author{6}{Lastname}#=%=#Hild Author{6}{Email}#=%=#bernardfriedrich.hild@uzh.ch Author{6}{Affiliation}#=%=#University of Zurich Author{7}{Firstname}#=%=#David Author{7}{Lastname}#=%=#Brüschweiler Author{7}{Email}#=%=#david.brueschweiler@bluewin.ch Author{7}{Affiliation}#=%=#University of Zurich, Switzerland Author{8}{Firstname}#=%=#Gerold Author{8}{Lastname}#=%=#Schneider Author{8}{Username}#=%=#gschneid Author{8}{Email}#=%=#gschneid@ifi.uzh.ch Author{8}{Affiliation}#=%=#University of Zurich Author{9}{Firstname}#=%=#Tilia Author{9}{Lastname}#=%=#Ellendorff Author{9}{Username}#=%=#tilia Author{9}{Email}#=%=#tilia.ellendorff@uzh.ch Author{9}{Affiliation}#=%=#University of Zurich Author{10}{Firstname}#=%=#Benjamin Victor Author{10}{Lastname}#=%=#Ineichen Author{10}{Username}#=%=#benjaminvictorineichen Author{10}{Email}#=%=#benjamin.ineichen@uzh.ch Author{10}{Affiliation}#=%=#University of Zurich, Center for Reproducible Science ========== èéáğö