Abstract
In this paper, we describe an approach that aims to model heterogeneous resources for information extraction. Document is modeled in graph representation that enables better understanding of multi-media document and its structure which ultimately could result better cross-media information extraction. We also describe our proposed algorithm that segment document-based on the document modeling approach we described in this paper.- Anthology ID:
- L08-1312
- Volume:
- Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
- Month:
- May
- Year:
- 2008
- Address:
- Marrakech, Morocco
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2008/pdf/702_paper.pdf
- DOI:
- Cite (ACL):
- Lei Xia and José Iria. 2008. An Approach to Modeling Heterogeneous Resources for Information Extraction. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco. European Language Resources Association (ELRA).
- Cite (Informal):
- An Approach to Modeling Heterogeneous Resources for Information Extraction (Xia & Iria, LREC 2008)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2008/pdf/702_paper.pdf