Home| Contact Us| New Journals| Browse Journals| Journal Prices| For Authors|

Print ISSN: 0976-416X
Online ISSN:
0976-4178


  About IJCLR
  DLINE Portal Home
Home
Aims & Scope
Editorial Board
Current Issue
Next Issue
Previous Issue
Sample Issue
Upcoming Conferences
Self-archiving policy
Alert Services
Be a Reviewer
Publisher
Paper Submission
Subscription
Contact us
 
  How To Order
  Order Online
Price Information
Request for Complimentary
Print Copy
 
  For Authors
  Guidelines for Contributors
Online Submission
Call for Papers
Author Rights
 
 
RELATED JOURNALS
Journal of Digital Information Management (JDIM)
Journal of Multimedia Processing and Technologies (JMPT)
International Journal of Web Application (IJWA)

 

 
International Journal of Computational Linguistics Research
 

 

New Rules and Improvement in the Stemming Process
Petra Antic
Faculty of Electronic Engineering University of Niš, Aleksandra Medvedeva 14 18000 Niš, Serbia
Abstract: For assessing the effectiveness of stemming it is possible to use the and measure the volume of incorrect stem produced during the stemming process. We have conducted the stemming exercise for Serbian language and we have added new rules for stemming to reduce the errors. After improvement, the evaluation has given new rules which has good results and effects for stemmer corrections.
Keywords: Stemmers, Serbian language, Error metric New Rules and Improvement in the Stemming Process
DOI:https://doi.org/10.6025/jcl/2021/12/3/77-83
Full_Text   PDF 370 KB   Download:   243  times
References:

[1] Milosevic, N. (2012). Sentiment Analysis of Sentences in Serbian language, Master’s Degree Thesis. School of Electrical Engineering, University of Belgrade, Belgrade, Serbia, 2012. (“Masinska analiza sentimenta reenica na srpskom jeziku”)
[2] Keselj, V., Sipka, D. (2008). A Suffix Subsumption-Based Approach to Building Stemmers and Lemmatizers for Highly Inflectional Languages with Sparse Resources, INFOtheca, 9 (1–2), 23a–33a.
[3] Milosevic, N. (2012). Stemmer for Serbian language. arXiv 1209.4471.
[4] Ljubesic, N., Boras, D., Kubelka, O. (2007). Retrieving Information in Croatian: Building a Simple and Efficient Rule- Based Stemmer, in Future 2007: Digital Information and Heritage, Zagreb, Croatia: Department for Information Sciences, Faculty of Humanities and Social Sciences, 313–320.
[5] Ljubesic, N., Klubicka, F., Agic, Z., Jazbec, I.-P. (2016). New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian, 10th International Conference on Language Resources and Evaluation (LREC 2016), Conference Proceedings, 4264–4270, Portoroz, Slovenia, 2016.
[6] MULTEXT-East Morphosyntactic Specifications, Version 5. http://nl.ijs.si/ME/V5/msd/html/msd-hr.html#msd.R-hr
[7] SCStemmers – GitHub repository. https://vukbatanovic.github.io/SCStemmers/
[8] Batanovic, V., Nikolic, B., Milosavljevic, M. (2016). Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset, 10th International Conference on Language Resources and Evaluation (LREC 2016), Conference Proceedings, p 2688– 2696, Portoroz, Slovenia.
[9] Klajn, I. (2005). Serbian language Grammar. Beograd, Zavod zau enike i nastavna sredstva, 2005. (Gramatika srpskog jezika)


Home | Aim & Scope | Editorial Board | Author Guidelines | Publisher | Subscription | Previous Issue | Contact Us |Upcoming Conferences|Sample Issues|Library Recommendation Form|

 

Copyright © 2011 dline.info