Bengali word embeddings and it's application in solving document classification problem

A Ahmad, MR Amin - 2016 19th international conference on …, 2016 - ieeexplore.ieee.org
In this paper, we present Bengali word embeddings and it's application in the classification
of news documents. Word embeddings are multi-dimensional vectors that can be created by …

Bangla word clustering based on n-gram language model

S Ismail, MS Rahman - 2014 international conference on …, 2014 - ieeexplore.ieee.org
In this paper, we describe a method for producing Bangla word clusters based on semantic
and contextual similarity. Word clustering is important for parts of speech (POS) tagging …

A framework for word clustering of Bangla sentences using higher order n-gram language model

A Husna, M Mostofa, A Khatun, J Islam… - … on Innovation in …, 2018 - ieeexplore.ieee.org
Clustering of words is the method that is used to partition the sets of words into subsets of
semantically similar words. Word clustering has crucial in many uses of natural language …

A new word clustering method for building n-gram language models in continuous speech recognition systems

M Bahrani, H Sameti, N Hafezi, S Momtazi - New Frontiers in Applied …, 2008 - Springer
In this paper a new method for automatic word clustering is presented. We used this method
for building n-gram language models for Persian continuous speech recognition (CSR) …

A POS-based fuzzy word clustering algorithm for continuous speech recognition systems

S Momtazi, H Sameti, M Bahrani… - 2007 9th International …, 2007 - ieeexplore.ieee.org
Using word base n-gram language models in continuous speech recognition systems is so
prevalent. For using this type of language models, we should extract them from large …

A Possibilistic Approach for Building Statistical Language Models

S Momtazi, H Sameti - 2009 Ninth International Conference on …, 2009 - ieeexplore.ieee.org
Class-based n-gram language models are those most frequently-used in continuous speech
recognition systems, especially for languages for which no richly annotated corpora are …

[PDF][PDF] Hybrid syntactic category induction

B Jurish - Workshop on Computational Modeling of Language …, 2005 - academia.edu
Much research has been devoted to the task of learning lexical classes from unannotated
input text. Among the chief difficulties facing any approach to the unsupervised induction of …

[引用][C] CHOOSING A DISTANCE METRIC FOR AUTOMATIC WORD CATEGORIZATION

EEKG Ucoluk