Estimating spatiotemporal focus of documents using entropy with PMI
Abstract
Many text documents are spatiotemporal in nature, i.e. contents of a document can be mapped to a specific time period or location. For example, a news article about the French Revolution can be mapped to year 1789 as time and France as place. Identifying this time period and location associated with the document can be useful for various downstream applications such as document reasoning or spatiotemporal information retrieval. In this paper, temporal entropy with pointwise mutual information (PMI) is proposed to estimate the temporal focus of a document. PMI is used to measure the association of words with time expressions. Moreover, a word’s temporal entropy is considered as a weight to its association with a time point and a single time point with the highest overall score is chosen as the focus time of a document. The proposed method is generic in the sense that it can also be applied for spatial focus estimation of documents. In the case of spatial entropy with PMI, PMI is used to calculate the association between words and place entities. The effectiveness of our proposed methods for spatiotemporal focus estimation is evaluated on diverse datasets of text documents. The experimental evaluation confirms the superiority of our proposed temporal and spatial focus estimation methods.
Source
Turkish Journal of Electrical Engineering and Computer SciencesVolume
28Issue
2Related items
Showing items related by title, author, creator and subject.
-
İlaç Taşıma Sistemleri Olarak Nanopartiküller Kullanılarak Pasif ve Aktif Tümör Hedeflemelerinin Karşılaştırmalı İncelenmesi
Dağlıoğlu, Cenk (2018)Nanopartikül-aracılı ilaç hedefleme kanser araştırmalarının aktif bir alanı olup, tümör dokusuna özgün antikanser etkinliği artırmada çok önemli bir potansiyele sahiptir. Bu çalışmada, hedefleme verimlilik oranlarının ... -
Isıl konfor sıcaklıklarına bağlı olarak bir konutun enerji performansının değerlendirmesi: İzmir örneği
Türkiye’de enerji tüketiminin yaklaşık %34’ü binalarda ve bunun %85 kadarı da ısıtma ve soğutma amaçlı kullanılmaktadır. Binalarda bulunan HVAC sistemlerinin işletme özellikleri, hem binanın ısıl konforunu hem de enerji ... -
Determination of triacylglycerol composition of Ayvalık and Memecik olive oils during storage by chemometric methods
Köseoğlu, Oya; Sevim, Didar; Özdemir, Durmuş (2017)The aim of present investigation is to discriminate two important Turkish olive cultivars (Ayvalık and Memecik) by studying their triacylglycerol (TAG) compositions during storage (15 months) taken from different orchard ...