Stop word lists in free open-source software packages

J Nothman, H Qin, R Yurchak - … of workshop for NLP open source …, 2018 - aclanthology.org
Open-source software packages for language processing often include stop word lists.
Users may apply them without awareness of their surprising omissions (eg “hasn't” but not …

[PDF][PDF] Stopwords removal and its algorithms based on different methods

J Kaur, PK Buttar - International Journal of Advanced Research in …, 2018 - researchgate.net
In this paper analysis of different methods to remove stopwords in Punjabi language have
been done. For organizing unstructured text in order to implement stopwords removal …

On continent and script-wise divisions-based statistical measures for stop-words lists of international languages

JR Saini, RM Rakholia - Procedia Computer Science, 2016 - Elsevier
The data for the current research work was collected for 42 different International languages
encompassing 3 continents viz. Asia, Europe and South America. The data comprised of …

[HTML][HTML] Quality and safety management framework for intelligent construction: cases study in China

Y Dou, X Yan, T Li, M Wang, R Zheng… - KSCE Journal of Civil …, 2024 - Elsevier
Emerging digital technologies (EDTs) re-energize quality and safety management during the
intelligent transformation and upgrade of the construction sector. Building an innovative …

Integrating word status for joint detection of sentiment and aspect in reviews

A Bagheri - Journal of Information Science, 2019 - journals.sagepub.com
A crucial task in sentiment analysis is aspect detection: the step of selecting the aspects on
which opinions are expressed. This step anticipates the step of determining whether the …

Construction of morpheme-based Amharic stopword list for information retrieval system

T Yeshambel, J Mothe, Y Assabie - … of Science and Technology: 8th EAI …, 2021 - Springer
One of the major forms of pre-processing in information retrieval and many other text
processing applications is filtering out stopwords. They are ignored by many retrieval …

Construction of Amharic information retrieval resources and corpora

T Yeshambel, J Mothe, Y Assabie - Language Resources and Evaluation, 2024 - Springer
The development of information retrieval systems and natural language processing tools
has been made possible for many natural languages because of the availability of natural …

A Study on Corpus-based Stopword Lists in Indian Language IR

SS Sahu, S Pal - ACM Transactions on Asian and Low-Resource …, 2023 - dl.acm.org
We explore and evaluate the effect of different stopword lists (non-corpus-based and corpus-
based) in the information retrieval (IR) tasks with different Indian languages such as Bengali …

Constructing stoplists for historical languages

PJ Burns - Digital Classics Online, 2018 - journals.ub.uni-heidelberg.de
Stoplists are lists of words that have been filtered from documents prior to text analysis tasks,
usually words that are either high frequency or that have low semantic value. This paper …

PSWG: An automatic stop-word list generator for Persian information retrieval systems based on similarity function & POS information

MA Yaghoub-Zadeh-Fard… - … and Innovation (KBEI …, 2015 - ieeexplore.ieee.org
By the advent of new information resources, search engines have encountered a new
challenge since they have been obliged to store a large amount of text materials. This is …