Won Kim
2026
BioTopicXplor: A Web Tool for Interactive Exploration of PubMed Literature through Reproducible Topics.
Lana Yeganova | Donald Comeau | Won Kim | Natalie Xie | Shubo Tian | W John Wilbur | Zhiyong Lu
BioNLP 2026
Lana Yeganova | Donald Comeau | Won Kim | Natalie Xie | Shubo Tian | W John Wilbur | Zhiyong Lu
BioNLP 2026
The rapid growth of biomedical literature presents a major challenge for organizing knowledge and identifying emerging research trends. While PubMed provides effective access to relevant articles, it does not support understanding the conceptual structure of document collections. Existing tools rely on predefined features, ontologies, or parameter-sensitive clustering methods, limiting their ability to uncover fine-grained, data-driven topics in a reproducible manner. We present BioTopicXplor, an on-demand web server for interactive exploration of biomedical literature derived from arbitrary PubMed queries. The system integrates ConvexTopics, a convex optimization?based topic modeling framework that guarantees convergence to a global optimum and eliminates the need for predefined parameters. This enables the generation of reproducible and fine-grained topic structures across large document collections. Given a PubMed query, BioTopicXplor retrieves relevant articles, performs topic discovery, and organizes the resulting subtopics into a hierarchical structure of higher-level themes. To enhance interpretability, the system incorporates large language models to generate concise, literature-grounded summaries and descriptive titles for each topic, with links to supporting evidence. We demonstrate the utility of BioTopicXplor through a case study on anti-aging research, where the system reveals meaningful thematic structures and supports knowledge discovery.
2024
Guidelines for the Annotation of Deliberate Linguistic Metaphor
Stefanie Dipper | Adam Roussel | Alexandra Wiemann | Won Kim | Tra-My Nguyen
Proceedings of the 4th Workshop on Figurative Language Processing (FigLang 2024)
Stefanie Dipper | Adam Roussel | Alexandra Wiemann | Won Kim | Tra-My Nguyen
Proceedings of the 4th Workshop on Figurative Language Processing (FigLang 2024)
This paper presents guidelines for the annotation of deliberate linguistic metaphor. Expressions that contribute to the same metaphorical image are annotated as a chain along with a semantically contrasting expression of the target domain, which helps to make the domain contrast inherent to metaphor more explicit. So far, a corpus of ten TEDx talks with a total of ca. 20k tokens has been annotated according to these guidelines. 1.35% of the tokens are deliberate metaphorical expressions according to our guidelines, which shows that our guidelines successfully identify a significantly higher proportion of deliberate metaphorical expressions than previous studies.
2018
SingleCite: Towards an improved Single Citation Search in PubMed
Lana Yeganova | Donald C Comeau | Won Kim | W John Wilbur | Zhiyong Lu
Proceedings of the BioNLP 2018 workshop
Lana Yeganova | Donald C Comeau | Won Kim | W John Wilbur | Zhiyong Lu
Proceedings of the BioNLP 2018 workshop
A search that is targeted at finding a specific document in databases is called a Single Citation search. Single citation searches are particularly important for scholarly databases, such as PubMed, because users are frequently searching for a specific publication. In this work we describe SingleCite, a single citation matching system designed to facilitate user’s search for a specific document. We report on the progress that has been achieved towards building that functionality.
2016
PubTermVariants: biomedical term variants and their use for PubMed search
Lana Yeganova | Won Kim | Sun Kim | Rezarta Islamaj Doğan | Wanli Liu | Donald C Comeau | Zhiyong Lu | W John Wilbur
Proceedings of the 15th Workshop on Biomedical Natural Language Processing
Lana Yeganova | Won Kim | Sun Kim | Rezarta Islamaj Doğan | Wanli Liu | Donald C Comeau | Zhiyong Lu | W John Wilbur
Proceedings of the 15th Workshop on Biomedical Natural Language Processing
2012
Classifying Gene Sentences in Biomedical Literature by Combining High-Precision Gene Identifiers
Sun Kim | Won Kim | Don Comeau | W. John Wilbur
BioNLP: Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
Sun Kim | Won Kim | Don Comeau | W. John Wilbur
BioNLP: Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
2011
Text Mining Techniques for Leveraging Positively Labeled Data
Lana Yeganova | Donald C. Comeau | Won Kim | W. John Wilbur
Proceedings of BioNLP 2011 Workshop
Lana Yeganova | Donald C. Comeau | Won Kim | W. John Wilbur
Proceedings of BioNLP 2011 Workshop