BABEL: An Eastern European multi-language database

Findings of the 2023 ml-superb challenge: Pre-training and evaluation over more languages and beyond

J Shi, W Chen, D Berrebbi, HH Wang… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org

The 2023 Multilingual Speech Universal Performance Benchmark (ML-SUPERB) Challenge
expands upon the acclaimed SUPERB framework, emphasizing self-supervised models in …

被引用次数：8 相关文章所有 5 个版本

[PDF] mtak.hu

Development of a large spontaneous speech database of agglutinative Hungarian language

T Neuberger, D Gyarmathy, TE Gráczi… - Text, Speech and …, 2014 - Springer

In this paper, a large Hungarian spoken language database is introduced. This phonetically-
based multi-purpose database contains various types of spontaneous and read speech from …

被引用次数：77 相关文章所有 9 个版本

[PDF] isca-archive.org

[PDF][PDF] Leveraging a Character, Word and Prosody Triplet for an ASR Error Robust and Agglutination Friendly Punctuation Approach.

G Szaszák, MA Tündik - Interspeech, 2019 - isca-archive.org

Punctuating ASR transcript has received increasing attention recently, and well-performing
approaches were presented based on sequence-to-sequence modelling, exploiting textual …

被引用次数：29 相关文章所有 4 个版本

[PDF] mpg.de

FMRI repetition suppression for voices is modulated by stimulus expectations

A Andics, V Gál, K Vicsi, G Rudas, Z Vidnyánszky - Neuroimage, 2013 - Elsevier

According to predictive coding models of sensory processing, stimulus expectations have a
profound effect on sensory cortical responses. This was supported by experimental results …

被引用次数：44 相关文章所有 12 个版本

[PDF] arxiv.org

AP16-OL7: A multilingual database for oriental languages and a language recognition baseline

D Wang, L Li, D Tang, Q Chen - 2016 Asia-Pacific Signal and …, 2016 - ieeexplore.ieee.org

We present the AP16-OL7 database which was released as the training and test data for the
oriental language recognition (OLR) challenge on APSIPA 2016. Based on the database, a …

被引用次数：30 相关文章所有 10 个版本

The statistical signature of morphosyntax: A study of Hungarian and Italian infant-directed speech

J Gervain, RG Erra - Cognition, 2012 - Elsevier

Does statistical learning (Saffran, Aslin, & Newport, 1996) offer a universal segmentation
strategy for young language learners? Previous studies on large corpora of English and …

被引用次数：38 相关文章所有 9 个版本

[PDF] arxiv.org

Ring that bell: A corpus and method for multimodal metaphor detection in videos

K Alnajjar, M Hämäläinen, S Zhang - arXiv preprint arXiv:2301.01134, 2022 - arxiv.org

We present the first openly available multimodal metaphor annotated corpus. The corpus
consists of videos including audio and subtitles that have been annotated by experts …

被引用次数：8 相关文章所有 10 个版本

[PDF] researchgate.net

[PDF][PDF] Bulgarian Speech Corpora: A Review

D Dimitrova - Preprint. https://www. researchgate. net/publication …, 2023 - researchgate.net

This paper aims to provide a summary of the existing corpora of spoken Bulgarian. The
corpora are examined for their scope, genre, speech spontaneity, presence of phonetic …

被引用次数：4 相关文章所有 4 个版本

[PDF] frontiersin.org

Word frequency cues word order in adults: cross-linguistic evidence

J Gervain, N Sebastián-Gallés, B Díaz, I Laka… - Frontiers in …, 2013 - frontiersin.org

One universal feature of human languages is the division between grammatical functors and
content words. From a learnability point of view, functors might provide entry points or …

被引用次数：32 相关文章所有 20 个版本

[PDF] arxiv.org

Detecting English speech in the air traffic control voice communication

I Szoke, S Kesiraju, O Novotny, M Kocour… - arXiv preprint arXiv …, 2021 - arxiv.org

We launched a community platform for collecting the ATC speech world-wide in the ATCO2
project. Filtering out unseen non-English speech is one of the main components in the data …

被引用次数：10 相关文章所有 3 个版本