Tired of topic models? clusters of pretrained word embeddings make for fast and good topics too!

S Sia, A Dalmia, SJ Mielke - arXiv preprint arXiv:2004.14914, 2020 - arxiv.org
Topic models are a useful analysis tool to uncover the underlying themes within document
collections. The dominant approach is to use probabilistic topic models that posit a …

The State of the Art of Natural Language Processing—A Systematic Automated Review of NLP Literature Using NLP Techniques

J Sawicki, M Ganzha, M Paprzycki - Data Intelligence, 2023 - direct.mit.edu
Nowadays, natural language processing (NLP) is one of the most popular areas of, broadly
understood, artificial intelligence. Therefore, every day, new research contributions are …

Clustering without over-representation

S Ahmadian, A Epasto, R Kumar… - Proceedings of the 25th …, 2019 - dl.acm.org
In this paper we consider clustering problems in which each point is endowed with a color.
The goal is to cluster the points to minimize the classical clustering cost but with the …

Trends, limitations and open challenges in automatic readability assessment research

S Vajjala - arXiv preprint arXiv:2105.00973, 2021 - arxiv.org
Readability assessment is the task of evaluating the reading difficulty of a given piece of text.
Although research on computational approaches to readability assessment is now two …

A neural pairwise ranking model for readability assessment

J Lee, S Vajjala - arXiv preprint arXiv:2203.07450, 2022 - arxiv.org
Automatic Readability Assessment (ARA), the task of assigning a reading level to a text, is
traditionally treated as a classification problem in NLP research. In this paper, we propose …

[PDF][PDF] 文本可读性的自动分析研究综述

吴思远, 蔡建永, 于东, 江新 - 中文信息学报, 2018 - researchgate.net
(1. 北京语言大学信息科学学院, 北京100083; 2. 北京语言大学对外汉语研究中心, 北京100083;
3. 北京语言大学汉语速成学院, 北京100083) 摘要: 文本可读性问题最初由教育学家提出 …

Fabra: French aggregator-based readability assessment toolkit

R Wilkens, D Alfter, X Wang, A Pintard… - Proceedings of the …, 2022 - aclanthology.org
In this paper, we present the FABRA: readability toolkit based on the aggregation of a large
number of readability predictor variables. The toolkit is implemented as a service-oriented …

Adjectives and adverbs in life sciences across 50 years: Implications for emotions and readability in academic texts

J Wen, L Lei - Scientometrics, 2022 - Springer
Writing in a clear and simple language is critical for scientific communications. Previous
studies argued that the use of adjectives and adverbs cluttered writing and made scientific …

[HTML][HTML] Incorporating Multiple Textual Factors into Unbalanced Financial Distress Prediction: A Feature Selection Methods and Ensemble Classifiers Combined …

S Li, W Shi - International Journal of Computational Intelligence …, 2023 - Springer
Textual-based factors have been widely regarded as a promising feature that can be applied
to financial issues. This study focuses on extracting both basic and semantic textual features …

Investigating readability of french as a foreign language with deep learning and cognitive and pedagogical features

K Yancey, A Pintard, T Francois - Lingue e linguaggio, 2021 - rivisteweb.it
In this paper, we report three experiments evaluating a variety of feature sets and models
intended to develop a new readability model for French as a Foreign Language (FFL) …