Volume 17 Number 5 October 2019


Optimization of Topic Recognition Model for News Texts Based on LDA

Hongbin Wang, Jianxiong Wang, Yafei Zhang, Meng Wang, Cunli Mao

https://doi.org/10.6025/jdim/2019/17/5/257-269

Abstract Latent Dirichlet Allocation (LDA) is the technique most commonly used in topic modeling methods, but it requires the number of topics generated by LDA to be specified for topic recognition modeling. Except the main iterative methods based on perplexity and nonparametric methods, recent research has no simple way to select the optimal number of topics in the model. Aiming... Read More


An Agent-Based Approach for Extracting Business Association Rules from Centralized Databases Systems

Nadjib Mesbahi, Merouane Zoubeidi, Abdelhak Merizig, Okba Kazar

https://doi.org/10.6025/jdim/2019/17/5/270-288

Abstract Today, enterprises use a variety of applications to manage day-by-day business activities using a large centralized database. Since a huge amount of data stored in this centralized database produced by the daily use of several systems, it is important to integrate decision- making tools to analyse and interpret these business data. For this purpose, Data Mining is a powerful technology that promote information and knowledge extraction from large... Read More


Subjective Sentiment Analysis for Arabic Newswire Comments

Sadik Bessou, Rania Aberkane

https://doi.org/10.6025/jdim/2019/17/5/289-295

Abstract This paper presents an approach based on supervised machine learning methods to discriminate between positive, negative and neutral Arabic reviews in online newswire. The corpus is labeled for subjectivity and sentiment analysis (SSA) at the sentence-level. The model uses both count and TF-IDF representations and apply six machine learning algorithms; Multinomial Naïve Bayes, Support Vector Machines (SVM), Random Forest, Logistic Regression, Multi-layer perceptron and k-nearest neighbors using uni-grams, bi-grams... Read More


Corpus-Based Techniques for Sentiment Lexicon Generation: A Review

Mohammad Darwich, Shahrul Azman Mohd Noah, Nazlia Omar, Nurul Aida Osman

https://doi.org/10.6025/jdim/2019/17/5/296-305

Abstract State-of-the-art sentiment analysis systems rely on a sentiment lexicon, which is the most essential feature that drives their performance. This resource is indispensable for, and greatly contributes to, sentiment analysis tasks. This is evident in the emergence of a large volume of research devoted to the development of automated sentiment lexicon generation algorithms. The task of tagging subjective words with... Read More