Morphological analyzer and generator for Russian and Ukrainian languages
M Korobov - Analysis of Images, Social Networks and Texts: 4th …, 2015 - Springer
Abstract pymorphy2 is a morphological analyzer and generator for Russian and Ukrainian
languages. It uses large efficiently encoded lexicons built from OpenCorpora and …
languages. It uses large efficiently encoded lexicons built from OpenCorpora and …
Information extraction meets the semantic web: a survey
JL Martinez-Rodriguez, A Hogan… - Semantic …, 2020 - content.iospress.com
We provide a comprehensive survey of the research literature that applies Information
Extraction techniques in a Semantic Web setting. Works in the intersection of these two …
Extraction techniques in a Semantic Web setting. Works in the intersection of these two …
[PDF][PDF] A fast morphological algorithm with unknown word guessing induced by a dictionary for a web search engine.
I Segalovich - MLMTA, 2003 - Citeseer
This paper describes a {simple yet practical} algorithm of morphological analysis and
synthesis that uses a limited dictionary to obtain {rather precise} morphology of a wide …
synthesis that uses a limited dictionary to obtain {rather precise} morphology of a wide …
Fast string correction with Levenshtein automata
KU Schulz, S Mihov - International Journal on Document Analysis and …, 2002 - Springer
The Levenshtein distance between two words is the minimal number of insertions, deletions
or substitutions that are needed to transform one word into the other. Levenshtein automata …
or substitutions that are needed to transform one word into the other. Levenshtein automata …
A hybrid approach to word segmentation of Vietnamese texts
L Hông Phuong, N Thi Minh Huyên… - Language and Automata …, 2008 - Springer
We present in this article a hybrid approach to automatically tokenize Vietnamese text. The
approach combines both finite-state automata technique, regular expression parsing and …
approach combines both finite-state automata technique, regular expression parsing and …
Indexing methods for approximate dictionary searching: Comparative analysis
L Boytsov - Journal of Experimental Algorithmics (JEA), 2011 - dl.acm.org
The primary goal of this article is to survey state-of-the-art indexing methods for approximate
dictionary searching. To improve understanding of the field, we introduce a taxonomy that …
dictionary searching. To improve understanding of the field, we introduce a taxonomy that …
Morfeusz—a practical tool for the morphological analysis of Polish
M Woliński - Intelligent Information Processing and Web Mining …, 2006 - Springer
Morfeusz — a Practical Tool for the Morphological Analysis of Polish Page 1 Marcin Woliński
Institute of Computer Science, Polish Academy of Sciences, ul. Ordona 21, 01-237 Warsaw …
Institute of Computer Science, Polish Academy of Sciences, ul. Ordona 21, 01-237 Warsaw …
Hfst tools for morphology–an efficient open-source package for construction of morphological analyzers
Morphological analysis of a wide range of languages can be implemented efficiently using
finite-state transducer technologies. Over the last 30 years, a number of attempts have been …
finite-state transducer technologies. Over the last 30 years, a number of attempts have been …
[图书][B] The Correctness-by-Construction Approach to Programming
The focus of this book is on bridging the gap between two extreme methods for developing
software. On the one hand, there are texts and approaches that are so formal that they scare …
software. On the one hand, there are texts and approaches that are so formal that they scare …
Correcting noisy OCR: Context beats confusion
J Evershed, K Fitch - Proceedings of the First International Conference …, 2014 - dl.acm.org
We describe a system for automatic post OCR text correction of digital collections of
historical texts. Documents, such as old newspapers, are often degraded, so even the best …
historical texts. Documents, such as old newspapers, are often degraded, so even the best …