Volume 6 Issue 3 June 2008


Web Information Extraction Using Web-specific Features

Jinlin Chen,Ping Zhong

https://doi.org/

Abstract Several problems exist with traditional HMM based approaches for Web information extraction (IE) due to the lack of consideration on Web-specific features. To address this issue we present a Generalized Hidden Markov Model (GHMM) that extends HMMs by making use of Web-specific information for Web IE. In GHMM based approach, Web content blocks instead of terms are used as basic... Read More


A Recommendation Model Based on Site Semantics and Usage Mining

Sofia Stamou, Lefteris Kozanidis, Paraskevi Tzekou, Nikos Zotos

https://doi.org/

Abstract The explosive growth of online data and the diversity of goals that may be pursued over the web have significantly increased the monetary value of the web traffic. To tap into this accelerating market, web site operators try to increase their traffic by customizing their sites to the needs of specific users. Web site customization involves three great challenges: (i)... Read More


A Smart Card Based Remote User Authentication Scheme

Mohammed Misbahuddin, P.Premchand, A.Govardhan

https://doi.org/

Abstract Password based authentication schemes are commonly used to authenticate remote users. Many schemes have been proposed both with and without smart cards but each have its own merits and demerits. This paper analyzes the security of an enhanced Dynamic ID based remote user authentication scheme and shows that the enhanced scheme has major security weaknesses. The paper also presents a... Read More


An actor Like Data Model for a Parallel DBMS

W.K Hidouchi - D.Ezegour

https://doi.org/

Abstract We present in this paper the new concept of “actor databases” (DB-Act) being studied at the INI institute (Act21 project) where we are developing a parallel main memory database system based on an actor like data model. To achieve data distribution we use a Scalable Distributed Data Structures (SDDS) called “distributed Compact Trie Hashing” (CTH*) currently in development at the... Read More


Hybrid Clustering Approach for Term Partitioning in Document Data Sets

K.Thammi Reddy , M.Shashi , L. Prathap Reddy

https://doi.org/

Abstract Information retrieval is one of the major research areas due to accumulation of huge information in digital form. Various techniques of Information retrieval are based on the fact that terms contained in a document along with their frequency of occurrence signify the semantics of the document. Recent attempts to find the relevant document for a context represents documents in a... Read More


Learning Transfer Rules for Machine Translation from Parallel Corpora

Werner Winiwarter, University of Vienna

https://doi.org/

Abstract In this paper we present JETCAT, a Japanese-English transfer-based machine translation system. Our main research contribution is that the transfer rules are not handcrafted but are learnt automatically from a parallel corpus. The system has been implemented in Amzi! Prolog, which offers scalability for large rule bases, full Unicode support for Japanese characters, and several APIs for the seamless integration... Read More