Bengali word embeddings and it's application in solving document classification problem
In this paper, we present Bengali word embeddings and it's application in the classification
of news documents. Word embeddings are multi-dimensional vectors that can be created by …
of news documents. Word embeddings are multi-dimensional vectors that can be created by …
Bangla word clustering based on n-gram language model
In this paper, we describe a method for producing Bangla word clusters based on semantic
and contextual similarity. Word clustering is important for parts of speech (POS) tagging …
and contextual similarity. Word clustering is important for parts of speech (POS) tagging …
A framework for word clustering of Bangla sentences using higher order n-gram language model
Clustering of words is the method that is used to partition the sets of words into subsets of
semantically similar words. Word clustering has crucial in many uses of natural language …
semantically similar words. Word clustering has crucial in many uses of natural language …
A new word clustering method for building n-gram language models in continuous speech recognition systems
In this paper a new method for automatic word clustering is presented. We used this method
for building n-gram language models for Persian continuous speech recognition (CSR) …
for building n-gram language models for Persian continuous speech recognition (CSR) …
A POS-based fuzzy word clustering algorithm for continuous speech recognition systems
Using word base n-gram language models in continuous speech recognition systems is so
prevalent. For using this type of language models, we should extract them from large …
prevalent. For using this type of language models, we should extract them from large …
A Possibilistic Approach for Building Statistical Language Models
Class-based n-gram language models are those most frequently-used in continuous speech
recognition systems, especially for languages for which no richly annotated corpora are …
recognition systems, especially for languages for which no richly annotated corpora are …
[PDF][PDF] Hybrid syntactic category induction
B Jurish - Workshop on Computational Modeling of Language …, 2005 - academia.edu
Much research has been devoted to the task of learning lexical classes from unannotated
input text. Among the chief difficulties facing any approach to the unsupervised induction of …
input text. Among the chief difficulties facing any approach to the unsupervised induction of …