[PDF][PDF] The tagged Icelandic corpus (MÍM)

S Helgadóttir, Á Svavarsdóttir… - Proceedings of the …, 2012 - academia.edu
In this paper, we describe the development of a morphosyntactically tagged corpus of
Icelandic, the MÍM corpus. The corpus consists of about 25 million tokens of contemporary …

[PDF][PDF] The database of modern Icelandic inflection (Beygingarlýsing íslensks nútímamáls)

K Bjarnadóttir - Language Technology for Normalisation of Less …, 2012 - academia.edu
The topic of this paper is the Database of Modern Icelandic Inflection (DMII), containing
about 270,000 paradigms from Modern Icelandic, with over 5.8 million inflectional forms. The …

[PDF][PDF] Developing a PoS-tagged corpus using existing tools

H Loftsson, JH Yngvason, S Helgadóttir… - … SaLTMiL Workshop on …, 2010 - academia.edu
In this paper, we describe the development of a new tagged corpus of Icelandic, consisting
of about 1 million tokens. The goal is to use the corpus, among other things, as a new gold …

[PDF][PDF] Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian.

AK Ingason, H Loftsson, E Rögnvaldsson… - LREC, 2014 - Citeseer
This paper presents ongoing work that aims to improve machine parsing of Faroese using a
combination of Faroese and Icelandic training data. We show that even if we only have a …

[PDF][PDF] Extraction of semantic relations as a basis for a future semantic database for Icelandic

A Nikulásdóttir, M Whelpton - … Workshop on Creation and Use of …, 2010 - academia.edu
This paper describes work in progress on semi-automatically constructing a semantic
database for Icelandic. The focus is on methods for the extraction of semantic relations used …

[PDF][PDF] Developing an Icelandic to English shallow transfer machine translation system

MD Brandt - Language, 2011 - en.ru.is
This thesis describes the author's contribution to creating a prototype of a machine
translation system that translates from Icelandic to English. The prototype is an open source …

Speech recognition for telephone conversations in Icelandic using Kaldi

TD Gunnarsson - 2019 - diva-portal.org
In this thesis we train and evaluate an Automatic Speech Recognition system for phone
communication in Icelandic. We use Kaldi, an open source toolkit, to build both GMM-HMM …

[PDF][PDF] Open source code and low resource languages

R Littauer, D Klakow, A Palmer - Saarland University, 2018 - raw.githubusercontent.com
Of the roughly seven thousand languages currently spoken, fewer than fifty have a
significant digital presence. In order for a language to be used digitally and to survive in the …

[PDF][PDF] Lexicon Acquisition through Noun Clustering

AB Nikulásdóttir, M Whelpton - LexicoNordica, 2010 - tidsskrift.dk
This paper describes an experiment with clustering of Icelandic nouns based on semantic
relatedness. This work is part of a larger project aiming at semi-automatically constructing a …

From human-oriented dictonaries to computer-oriented lexical resources-trying to pin down words

M Whelpton - Orð og tunga, 2012 - ordogtunga.arnastofnun.is
Dictionaries are designed for the human user; electronic lexical resources are often
designed with computers in mind: to represent information about the form, use and meaning …