A comprehensive review and synthesis of open source research
The open source movement has grown steadily and matured in recent years, and this
growth has been mirrored by a rise in open source related research. The objective of this …
growth has been mirrored by a rise in open source related research. The objective of this …
[引用][C] Phonetic Analysis of Speech Corpora
J Harrington - 2010 - books.google.com
An accessible introduction to the phonetic analysis of speech corpora, this workbook-style
text provides an extensive set of exercises to help readers develop the necessary skills to …
text provides an extensive set of exercises to help readers develop the necessary skills to …
The NXT-format Switchboard Corpus: a rich resource for investigating the syntax, semantics, pragmatics and prosody of dialogue
S Calhoun, J Carletta, JM Brenier, N Mayo… - Language resources …, 2010 - Springer
This paper describes a recently completed common resource for the study of spoken
discourse, the NXT-format Switchboard Corpus. Switchboard is a long-standing corpus of …
discourse, the NXT-format Switchboard Corpus. Switchboard is a long-standing corpus of …
[图书][B] Reinforcement learning for adaptive dialogue systems: a data-driven methodology for dialogue management and natural language generation
The past decade has seen a revolution in the field of spoken dialogue systems. As in other
areas of Computer Science and Artificial Intelligence, data-driven methods are now being …
areas of Computer Science and Artificial Intelligence, data-driven methods are now being …
GATE Teamware: a web-based, collaborative text annotation framework
K Bontcheva, H Cunningham, I Roberts… - Language Resources …, 2013 - Springer
This paper presents GATE Teamware—an open-source, web-based, collaborative text
annotation framework. It enables users to carry out complex corpus annotation projects …
annotation framework. It enables users to carry out complex corpus annotation projects …
Recognition and understanding of meetings the AMI and AMIDA projects
The AMI and AMIDA projects are concerned with the recognition and interpretation of
multiparty meetings. Within these projects we have: developed an infrastructure for …
multiparty meetings. Within these projects we have: developed an infrastructure for …
ANNIS: A search tool for multi-layer annotated corpora
A Zeldes, A Lüdeling, J Ritz, C Chiarcos - 2009 - edoc.hu-berlin.de
ANNIS (see Dipper & Götze 2005; Chiarcos et al. 2008) is a flexible web-based corpus
architecture for search and visualization of multi-layer linguistic corpora. By multi-layer we …
architecture for search and visualization of multi-layer linguistic corpora. By multi-layer we …
Towards open data for linguistics: Linguistic linked data
Abstract 'Open Data'has become very important in a wide range of fields. However for
linguistics, much data is still published in proprietary, closed formats and is not made …
linguistics, much data is still published in proprietary, closed formats and is not made …
The TEI and current standards for structuring linguistic data. An overview
M Stührenberg - Journal of the text encoding initiative, 2012 - journals.openedition.org
The TEI has served for many years as a mature annotation format for corpora of different
types, including linguistically annotated data. Although it is based on the consensus of a …
types, including linguistically annotated data. Although it is based on the consensus of a …
[PDF][PDF] A flexible framework for integrating annotations from different tools and tag sets
We present a general framework for integrating annotations from different tools and tag sets.
When annotating corpora at multiple linguistic levels, annotators may use different expert …
When annotating corpora at multiple linguistic levels, annotators may use different expert …