Method for Improving Automatic Word Categorization

A Ahmad, MR Amin - 2016 19th international conference on …, 2016 - ieeexplore.ieee.org

In this paper, we present Bengali word embeddings and it's application in the classification
of news documents. Word embeddings are multi-dimensional vectors that can be created by …

被引用次数：45 相关文章所有 2 个版本

[PDF] psu.edu

Bangla word clustering based on n-gram language model

S Ismail, MS Rahman - 2014 international conference on …, 2014 - ieeexplore.ieee.org

In this paper, we describe a method for producing Bangla word clusters based on semantic
and contextual similarity. Word clustering is important for parts of speech (POS) tagging …

被引用次数：34 相关文章所有 4 个版本

A framework for word clustering of Bangla sentences using higher order n-gram language model

A Husna, M Mostofa, A Khatun, J Islam… - … on Innovation in …, 2018 - ieeexplore.ieee.org

Clustering of words is the method that is used to partition the sets of words into subsets of
semantically similar words. Word clustering has crucial in many uses of natural language …

被引用次数：5 相关文章

[PDF] psu.edu

A new word clustering method for building n-gram language models in continuous speech recognition systems

M Bahrani, H Sameti, N Hafezi, S Momtazi - New Frontiers in Applied …, 2008 - Springer

In this paper a new method for automatic word clustering is presented. We used this method
for building n-gram language models for Persian continuous speech recognition (CSR) …

被引用次数：13 相关文章所有 7 个版本

[PDF] psu.edu

A POS-based fuzzy word clustering algorithm for continuous speech recognition systems

S Momtazi, H Sameti, M Bahrani… - 2007 9th International …, 2007 - ieeexplore.ieee.org

Using word base n-gram language models in continuous speech recognition systems is so
prevalent. For using this type of language models, we should extract them from large …

被引用次数：3 相关文章所有 3 个版本

[PDF] 139.91.210.27

A Possibilistic Approach for Building Statistical Language Models

S Momtazi, H Sameti - 2009 Ninth International Conference on …, 2009 - ieeexplore.ieee.org

Class-based n-gram language models are those most frequently-used in continuous speech
recognition systems, especially for languages for which no richly annotated corpora are …

被引用次数：2 相关文章所有 6 个版本

[PDF] academia.edu

[PDF][PDF] Hybrid syntactic category induction

B Jurish - Workshop on Computational Modeling of Language …, 2005 - academia.edu

Much research has been devoted to the task of learning lexical classes from unannotated
input text. Among the chief difficulties facing any approach to the unsupervised induction of …

被引用次数：1 相关文章所有 5 个版本

[引用][C] CHOOSING A DISTANCE METRIC FOR AUTOMATIC WORD CATEGORIZATION

EEKG Ucoluk