Comparison of text preprocessing methods

CP Chai - Natural Language Engineering, 2023 - cambridge.org
Text preprocessing is not only an essential step to prepare the corpus for modeling but also
a key area that directly affects the natural language processing (NLP) application results. For …

Data reduction in big data: a survey of methods, challenges and future directions

TT Khoei, A Singh - International Journal of Data Science and Analytics, 2024 - Springer
Data reduction plays a pivotal role in managing and analyzing big data, which is
characterized by its volume, velocity, variety, veracity, value, variability, and visibility …

[HTML][HTML] Is text preprocessing still worth the time? A comparative survey on the influence of popular preprocessing methods on Transformers and traditional classifiers

M Siino, I Tinnirello, M La Cascia - Information Systems, 2024 - Elsevier
With the advent of the modern pre-trained Transformers, the text preprocessing has started
to be neglected and not specifically addressed in recent NLP literature. However, both from …

AOH-Senti: aspect-oriented hybrid approach to sentiment analysis of students' feedback

A Kathuria, A Gupta, RK Singla - SN Computer Science, 2023 - Springer
Abstract Evaluation of students' feedback is essential in education as it helps the instructors
to check the effectiveness of their teaching. The feedback collected at the end of the …

Deep-learning-based natural-language-processing models to identify cardiovascular disease hospitalisations of patients with diabetes from routine visits' text

A Guazzo, E Longato, GP Fadini, ML Morieri… - Scientific Reports, 2023 - nature.com
Writing notes is the most widespread method to report clinical events. Therefore, most of the
information about the disease history of a patient remains locked behind free-form text …

Policy making in the financial industry: a framework for regulatory impact analysis using textual analysis

B Clapham, M Bender, J Lausen, P Gomber - Journal of Business …, 2023 - Springer
Regulators conduct regulatory impact analyses (RIA) to evaluate whether regulatory actions
fulfill the desired goals. Although there are different frameworks for conducting RIA, they are …

[PDF][PDF] Software Language Engineering-Text Processing Language Design, Implementation, Evaluation Methods

JW Lutalo - 2024 - preprints.org
Programming languages drive most if not all of modern problem-solving using
computational methods and power. Research into new programming languages and …

A comprehensive cross-language framework for harmful content detection with the aid of sentiment analysis

M Dehghani - arXiv preprint arXiv:2403.01270, 2024 - arxiv.org
In today's digital world, social media plays a significant role in facilitating communication and
content sharing. However, the exponential rise in user-generated content has led to …

Clustering Medical Transcriptions Using K-Means

S Salloum, D Tahat, K Tahat, R Alfaisal… - … and Services (ICCNS …, 2024 - ieeexplore.ieee.org
The clustering of medical transcriptions is an essential task for the categorization and
summarization of large volumes of medical records. This paper explores the efficacy of k …

[PDF][PDF] The Good, the Bad, and the Neutral: Twitter Users' Opinion on the ASUU Strike.

AS Muhammad, M Nat - Informing Sci. Int. J. an Emerg. Transdiscipl., 2022 - academia.edu
ABSTRACT Aim/Purpose Nigeria's university education goes through incessant strikes by
the Academic Staff Union of Universities (ASUU). This strike has led to shared emotion on …