Measuring Beginner Friendliness of Japanese Web Pages explaining Academic Concepts by Integrating Neural Image Feature and Text Features

Hayato Shiokawa; Kota Kawaguchi; Bingcai Han; Takehito Utsuro; Yasuhide Kawada; Masaharu Yoshioka; Noriko Kando

doi:10.18653/v1/W18-3721

Measuring Beginner Friendliness of Japanese Web Pages explaining Academic Concepts by Integrating Neural Image Feature and Text Features

Hayato Shiokawa, Kota Kawaguchi, Bingcai Han, Takehito Utsuro, Yasuhide Kawada, Masaharu Yoshioka, Noriko Kando

Abstract

Search engine is an important tool of modern academic study, but the results are lack of measurement of beginner friendliness. In order to improve the efficiency of using search engine for academic study, it is necessary to invent a technique of measuring the beginner friendliness of a Web page explaining academic concepts and to build an automatic measurement system. This paper studies how to integrate heterogeneous features such as a neural image feature generated from the image of the Web page by a variant of CNN (convolutional neural network) as well as text features extracted from the body text of the HTML file of the Web page. Integration is performed through the framework of the SVM classifier learning. Evaluation results show that heterogeneous features perform better than each individual type of features.

Anthology ID:: W18-3721
Volume:: Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications
Month:: July
Year:: 2018
Address:: Melbourne, Australia
Editors:: Yuen-Hsien Tseng, Hsin-Hsi Chen, Vincent Ng, Mamoru Komachi
Venue:: NLP-TEA
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 143–151
Language:
URL:: https://aclanthology.org/W18-3721
DOI:: 10.18653/v1/W18-3721
Bibkey:
Cite (ACL):: Hayato Shiokawa, Kota Kawaguchi, Bingcai Han, Takehito Utsuro, Yasuhide Kawada, Masaharu Yoshioka, and Noriko Kando. 2018. Measuring Beginner Friendliness of Japanese Web Pages explaining Academic Concepts by Integrating Neural Image Feature and Text Features. In Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications, pages 143–151, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):: Measuring Beginner Friendliness of Japanese Web Pages explaining Academic Concepts by Integrating Neural Image Feature and Text Features (Shiokawa et al., NLP-TEA 2018)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-4/W18-3721.pdf

PDF Search