[PDF][PDF] Development and Use of Computational Morphology of Finnish in the Open Source and Open Science Era: Notes on Experiences with Omorfi Development.

TA Pirinen - SKY Journal of Linguistics, 2015 - linguistics.fi
This article describes a contemporary system for the computational modelling of the
morphology of Finnish word-forms called Omorfi. The purpose of this article is to present …

[PDF][PDF] A set of open source tools for Turkish natural language processing.

Ç Çöltekin - LREC, 2014 - coltekin.net
This paper introduces a set of freely available, open-source tools for Turkish that are built
around TRmorph, a morphological analyzer introduced earlier in Cöltekin (2010a). The …

[PDF][PDF] Open-source infrastructures for collaborative work on under-resourced languages

S Moshagen, J Rueter, T Pirinen… - … and Computing for …, 2014 - syros.aegean.gr
In order to support crowd sourcing for a language, certain social and technical prerequisites
must be met. Both the size of the community and the level of technical support available are …

Multilingwis–a multilingual search tool for multi-word units in multiparallel corpora

S Clematide, J Graën, M Volk, G Corpas Pastor - 2016 - zora.uzh.ch
We describe a web-based application for searching translations of multi-word units in large,
openly available multiparallel corpora. This web application offers a unique resource for …

Can Morphological Analyzers Improve the Quality of Optical Character Recognition?

M Silfverberg, J Rueter - Septentrio Conference Series, 2015 - septentrio.uit.no
Abstract Optical Character Recognition (OCR) can substantially improve the usability of
digitized documents. Language modeling using word lists is known to improve OCR quality …

[PDF][PDF] Effect of language and error models on efficiency of finite-state spell-checking and correction

TA Pirinen, S Hardwick - … of the 10th International Workshop on …, 2012 - aclanthology.org
We inspect the viability of finite-state spellchecking and contextless correction of nonword
errors in three languages with a large degree of morphological variety. Overviewing …

Baltic and nordic parts of the european linguistic infrastructure

I Skadina, A Vasiljevs, L Borin, K Lindén… - …, 2013 - researchportal.helsinki.fi
This paper describes scientific, technical and legal work done on the creation of the
linguistic infrastructure for the Nordic and Baltic countries. The paper describes the research …

[PDF][PDF] Open-ource nfrastructuresfor ollaborative ork on nder-esourced anguages

S Moshagen, J Rueter, T Pirinen, T Trosterud, FM Tyers - giellatekno.uit.no
In order to support crowd sourcing for a language, certain social and technical prerequisites
must be met. Both the size of the community and the level of technical support available are …

[PDF][PDF] Building a Finnish SOM-based ontology concept tagger and harvester

ASA Nyrkkö - International Workshop on Computational …, 2018 - researchportal.helsinki.fi
I demonstrate here an experiment of word sense disambiguation method based on the Self-
Organizing Map (SOM) and a pre-existing set of tools for analyzing text in Finnish. It is given …

Multi-script morphological transducers and transcribers for seven Turkic languages

JN Washington, FM Tyers… - Proceedings of the …, 2020 - journals.linguisticsociety.org
This paper describes ongoing work to augment morphological transducers for seven Turkic
languages with support for multiple scripts each, as well as preliminary work adding IPA …