[PDF][PDF] Development and Use of Computational Morphology of Finnish in the Open Source and Open Science Era: Notes on Experiences with Omorfi Development.
TA Pirinen - SKY Journal of Linguistics, 2015 - linguistics.fi
This article describes a contemporary system for the computational modelling of the
morphology of Finnish word-forms called Omorfi. The purpose of this article is to present …
morphology of Finnish word-forms called Omorfi. The purpose of this article is to present …
[PDF][PDF] A set of open source tools for Turkish natural language processing.
Ç Çöltekin - LREC, 2014 - coltekin.net
This paper introduces a set of freely available, open-source tools for Turkish that are built
around TRmorph, a morphological analyzer introduced earlier in Cöltekin (2010a). The …
around TRmorph, a morphological analyzer introduced earlier in Cöltekin (2010a). The …
[PDF][PDF] Open-source infrastructures for collaborative work on under-resourced languages
In order to support crowd sourcing for a language, certain social and technical prerequisites
must be met. Both the size of the community and the level of technical support available are …
must be met. Both the size of the community and the level of technical support available are …
Multilingwis–a multilingual search tool for multi-word units in multiparallel corpora
We describe a web-based application for searching translations of multi-word units in large,
openly available multiparallel corpora. This web application offers a unique resource for …
openly available multiparallel corpora. This web application offers a unique resource for …
Can Morphological Analyzers Improve the Quality of Optical Character Recognition?
M Silfverberg, J Rueter - Septentrio Conference Series, 2015 - septentrio.uit.no
Abstract Optical Character Recognition (OCR) can substantially improve the usability of
digitized documents. Language modeling using word lists is known to improve OCR quality …
digitized documents. Language modeling using word lists is known to improve OCR quality …
[PDF][PDF] Effect of language and error models on efficiency of finite-state spell-checking and correction
TA Pirinen, S Hardwick - … of the 10th International Workshop on …, 2012 - aclanthology.org
We inspect the viability of finite-state spellchecking and contextless correction of nonword
errors in three languages with a large degree of morphological variety. Overviewing …
errors in three languages with a large degree of morphological variety. Overviewing …
Baltic and nordic parts of the european linguistic infrastructure
This paper describes scientific, technical and legal work done on the creation of the
linguistic infrastructure for the Nordic and Baltic countries. The paper describes the research …
linguistic infrastructure for the Nordic and Baltic countries. The paper describes the research …
[PDF][PDF] Open-ource nfrastructuresfor ollaborative ork on nder-esourced anguages
In order to support crowd sourcing for a language, certain social and technical prerequisites
must be met. Both the size of the community and the level of technical support available are …
must be met. Both the size of the community and the level of technical support available are …
[PDF][PDF] Building a Finnish SOM-based ontology concept tagger and harvester
ASA Nyrkkö - International Workshop on Computational …, 2018 - researchportal.helsinki.fi
I demonstrate here an experiment of word sense disambiguation method based on the Self-
Organizing Map (SOM) and a pre-existing set of tools for analyzing text in Finnish. It is given …
Organizing Map (SOM) and a pre-existing set of tools for analyzing text in Finnish. It is given …
Multi-script morphological transducers and transcribers for seven Turkic languages
JN Washington, FM Tyers… - Proceedings of the …, 2020 - journals.linguisticsociety.org
This paper describes ongoing work to augment morphological transducers for seven Turkic
languages with support for multiple scripts each, as well as preliminary work adding IPA …
languages with support for multiple scripts each, as well as preliminary work adding IPA …