Volume 8 Issue 6 December 2010


Extraction of Characteristic Description for Analyzing News Agencies

Shin Ishida, Qiang Ma, Masatoshi Yoshikawa

https://doi.org/

Abstract News agencies report news from different viewpoints and with different writing styles and these differences often ap pear in their article descriptions. We propose a method to extract characteristic descriptions on certain entities (per sons, locations, organizations, etc.) in news agency articles. For a given entity, a description is one tuple (called an SVO tuple) consisting of that entity and... Read More


On Arabic Texts Compression and Searching

Hassen Sallay

https://doi.org/

Abstract With the dramatic increasing of electronic Arabic content, the text compression techniques will play a major role in several domains and applications such as search engines, data archiving, searching and retrieval from huge databases. Mainly the combination of compression and indexing techniques allows the interesting possibility to work directly on the compressed textual fi les or databases, which results saving... Read More


Framework For Mixed Entity Resolving System Using Unsupervised Clustering

Byung-Won On , Ingyu Lee

https://doi.org/

Abstract During web search, confusion can happen due to homonym when users use non-unique values as a search term of an entity. Especially, when parts of names of an entity were used as its identifi er, we call a mixed entity resolution problem whose goal is to clear out the erroneous entities. For example, if only last name is used as... Read More


Augmenting Digital Libraries with Web-Based Visualizations

Peter Bergström, Darren C. Atkinson

https://doi.org/

Abstract Digital libraries in their current form are bounded by their ineffcient webpage-based user interface paradigm, and even the most knowledgeable researchers can get lost in the large amount of published material available. A paper and its immediate references are displayed on a single webpage. Unfortunately, it is not readily apparent where the paper belongs in the greater context of a... Read More


Automatic Segmentation, Aggregation and Indexing of Multimodal News Information from Television and the Internet

Maurizio Montagnuolo, Alberto Messina, Roberto Borgotallo

https://doi.org/

Abstract The global diffusion of the Internet has enabled the distribution of informative content through dynamic media such as RSS feeds and video blogs. At the same time, the decreasing cost of electronic devices has increased the pervasive availability of the same informative content in the form of digital audiovisual data. This article presents a system for the large-scale unsupervised acquisition,... Read More


Mining the Blogosphere to Generate Cuisine Hotspot Maps

Chia-Chun Shih, Ting-Chun Peng, Wei-Shen Lai

https://doi.org/

Abstract Choosing a restaurant is one of the most frequent decisions faced in modern daily life; however, it is diffi cult for consumers to choose between food/restaurant by reading large amounts of reviews. This study attempts to generate cuisine hotspot maps through blog content mining to help consumers make restaurant decisions by specialties. The main obstacle in doing this involves recognizing... Read More