Conference Proceedings

Content Discrimination of Comics Based on Users’ Reviews

Abstract

The objective of our research is to develop a search system to support the exploration of comics. To achieve this objective, it is necessary to obtain content information from comics, such as the world setting and characters. Consequently, we propose a method that obtains content information about comics from their reviews on the Web. Specifically, the proposed method determines a set of keywords that reflect the content of target comics in an appropriate manner and visualizes the relationships between comics by connecting them using their common keywords. Term frequency-inverted document frequency (TF-IDF) and latent Dirichlet allocation (LDA) algorithms are used to determine these keywords. TF-IDF determines keywords that reflect explicit information in the reviews, while LDA determines keywords that reflect implicit underlying topics in the comics. The results of two evaluations conducted to confirm the performance of these algorithms demonstrate that TFIDF helps to obtain meaningful keyword sets, while LDA currently produces relatively unclear keyword sets.

Information

Book title

Proc. 3rd Asian Conference on Information Systems

Pages

79-85

Date of issue

2014/12/01

Citation

Ryo Yamashita, Mitsunori Matsushita. Content Discrimination of Comics Based on Users’ Reviews, Proc. 3rd Asian Conference on Information Systems, pp.79-85, 2014.