Abstract
Statistical information such as gasoline prices and approval ratings of political parties, that has time as one
of its domains and numerical values as its range is called time series information. You can obtain the trend of a
topic you are interested in by extracting time series information from documents on the topic and compiling it. You,
in turn, use the trend for the understanding of and information access from those documents. In order to support
mainly to design and implement such a information compilation system, we have constructed a corpus, by extracting
linguistic expressions describing the values and the behaviors of time series information from documents, in this case
newspaper articles, and coding those contents into a formal framework. This paper explains the coding schema used
for this corpus construction, and shows its appropriateness. This paper also includes findings obtained through the
corpus construction, such as some classification of syntactic patterns and the vocabulary for describing the behaviors
of time series information.
Information
Book title
人工知能学会論文誌
Volume
25
Pages
637-650
Date of issue
2010/08/26
DOI
10.1527/tjsai.25.637
Citation
加藤 恒昭, 松下 光範, 神門 典子. 時系列情報の値と変化に関する言語表現コーパスの構築 ―動向情報の情報編纂に向けて―, 人工知能学会論文誌, Vol.25, No.5, pp.637-650, 2010.