時系列情報の値と変化に関する言語表現コーパスの構築 ―動向情報の情報編纂に向けて―

加藤 恒昭; 松下 光範; 神門 典子

doi:10.1527/tjsai.25.637

Abstract

Statistical information such as gasoline prices and approval ratings of political parties, that has time as one of its domains and numerical values as its range is called time series information. You can obtain the trend of a topic you are interested in by extracting time series information from documents on the topic and compiling it. You, in turn, use the trend for the understanding of and information access from those documents. In order to support mainly to design and implement such a information compilation system, we have constructed a corpus, by extracting linguistic expressions describing the values and the behaviors of time series information from documents, in this case newspaper articles, and coding those contents into a formal framework. This paper explains the coding schema used for this corpus construction, and shows its appropriateness. This paper also includes findings obtained through the corpus construction, such as some classification of syntactic patterns and the vocabulary for describing the behaviors of time series information.

Information

Book title

人工知能学会論文誌

Volume

25

Pages

637-650

Date of issue

2010/08/26

DOI

10.1527/tjsai.25.637

Citation

加藤恒昭, 松下光範, 神門典子. 時系列情報の値と変化に関する言語表現コーパスの構築 ―動向情報の情報編纂に向けて―, 人工知能学会論文誌, Vol.25, No.5, pp.637-650, 2010.