Fahad Khan

Other people with similar names: Fahad Khan


2026

In this submission we introduce a workflow/pipeline for creating TEI editions of legacy dictionaries using a parser based on a context-free grammar (CFG). We do this by describing a project which we are currently carrying out and which aims to create a digital edition of an Old English dictionary, Clark-Hall’s "A Concise Anglo-Saxon Dictionary" using this approach. We begin the article by motivating our CFG-based approach, discussing its advantages and disadvantages, and comparing to it other approaches. We argue that this approach is suitable to certain kinds of dictionaries, such as Clark Hall’s. We then describe the microstructure of the dictionary itself with a view both to justifying the kinds of rules which we subsequently describe and to outlining the kinds of resources to which we believe our approach is best suited. We then describe the CFG parser itself and give an account of our experiments in parsing the dictionary. Finally, we outline the enrichment of the parsed dictionary with RDFa and the benefits it has for the published data.
In this paper we present a new framework for analysis, documenting and publishing resources about the recent linguistic phenomenon of algospeak. This proposed framework features the use of two semantic artifacts (both of which we make available as SKOS semantic artifacts in RDF), and a cross-lingual lexicon of algospeak terms which follows a schema intended to facilitate the comparison of algospeak across languages and cultural contexts. Our article also features a discussion of the use of algospeak in two non-anglophone contexts (Italian and Arabic) which resulted from a period of data collection which the authors undertook as preparation for the creation of our framework and the categories which underlie it.
In the context of evolving European and national policies for research infrastructure governance, this paper presents the contribution of a national consortium for language resources and technology to the construction of a national infrastructure for FAIR and interoperable language and cultural data within a broader Humanities and Heritage Open Science initiative. As the national node of a European research infrastructure for language resources, the consortium contributes to translating FAIR and Open Science principles into practice by integrating technical, methodological, and training dimensions. Its activities combine several coordinated components: FAIRification workflows and ontology-based metadata mediation to enhance semantic interoperability across infrastructures; the refactoring and exposure of services through a federated API gateway; and the implementation of a Linguistic Linked Open Data (LLOD) pilot for the validation, transformation, and publication of interoperable RDF datasets. A national training ecosystem — comprising a training platform and a FAIR learning library — supports capacity building and the creation of FAIR-by-design learning materials. Finally, a permanent research observatory monitors community practices and needs, providing evidence-based insights for the continuous improvement of services and training provision. Together, these components demonstrate a coherent strategy for implementing FAIR and Open Science at the national level, while ensuring alignment with major European and national initiatives in the SSH data ecosystem.