Volume 5 Issue 5 October 2007

Unsupervised Learning Aided by Clustering and Local-Global Hierarchical Analysis in Knowledge Exploration

Yihao Zhang, Mehmet A. Orgun, Weiqiang Lin

https://doi.org/

Abstract Unsupervised learning plays an important role in the knowlede exploration discovery. The basic task of unsupervised learning is to find latent variablesor relationships in a given dataset wihout any assumed regularities or patterns. In this paper we apply two advanced models, clustering analysis and hierarchial analysis to accomplish unsupervised learning. K-Means clustering presents its strength in large scale clustering. The... Read More


Pre-processing and Path Normalization of a Web Graph used as a Social Network

C.Bhanu Teja, S.Mitra, A.Bagchi, A.K.Bandyopadhyay

https://doi.org/

Abstract Many efforts have been made for compression of web graphs. Most of these methods are suitable for search engines and are centered around encoding links and URLs efficiently. The purpose is to handle a large set of web pages in the main memory against any web based search. The authors of the present paper are interested in studying web graph... Read More


The Infocious Web Search Engine: Improving Web Searching Through Linguistic Analysis

Alexandros Ntoulas, Gerald Chao, Junghoo Cho

https://doi.org/

Abstract In this paper we present the Infocious Web search engine [23], which currently indexes more than 2 billion pages collected from the Web. The main goal of Infocious is to enhance the way that people find relevant information on the Web by resolving ambiguities present in natural language text. Towards this goal, Infocious performs linguistic analysis to the content of... Read More


Information vs. Robustness in Rank Aggregation: Models, Algorithms and a Statistical Framework for Evaluation

Sibel Adal, Brandeis Hill, Malik Magdon-Ismail

https://doi.org/

Abstract The rank aggregation problem has been studied extensively in recent years with a focus on how to combine several different rankers to obtain a consensus aggregate ranker. We study the rank aggregation problem from a different perspective: how the individual input rankers impact the performance of the aggregate ranker. We develop a general statistical framework based on a model of... Read More


Personalized Searching by Learning WordNet-based User Profiles

Giovanni Semeraro

https://doi.org/

Abstract The amount of information available on the Web and in Digital Libraries is increasing over time. In this context, the role of user modeling and personalized information access is becoming crucial: Users need a personalized support in sifting through large amounts of retrieved information according to their interests. Information filtering and retrieval systems relying on this idea adapt their behavior... Read More


A Comparative Study on Key Phrase Extraction Methods in Automatic Web Site Summarization

Yongzheng Zhang, Evangelos Milios, Nur Zincir-Heywood

https://doi.org/

Abstract Web Site Summarization is the process of automatically generating a concise and informative summary for a given Web site. It has gained more and more attention in recent years as effective summarization could lead to enhanced Web information retrieval systems such as searching for Web sites. Extraction-based approaches to Web site summarization rely on the extraction of the most significant... Read More