Automatic close captioning for live hungarian television broadcast speech: A fast and resource-ef...

P Smit, S Virpioja, M Kurimo - Interspeech, 2017 - research.aalto.fi

Because in agglutinative languages the number of observed word forms is very high,
subword units are often utilized in speech recognition. However, the proper use of subword …

被引用次数：59 相关文章所有 12 个版本

[PDF] isca-archive.org

[PDF][PDF] Leveraging a Character, Word and Prosody Triplet for an ASR Error Robust and Agglutination Friendly Punctuation Approach.

G Szaszák, MA Tündik - Interspeech, 2019 - isca-archive.org

Punctuating ASR transcript has received increasing attention recently, and well-performing
approaches were presented based on sequence-to-sequence modelling, exploiting textual …

被引用次数：29 相关文章所有 4 个版本

[PDF] mtak.hu

[PDF][PDF] User-centric evaluation of automatic punctuation in ASR closed captioning

MÁ Tündik, G Szaszák, G Gosztolya, A Beke - 2018 - real.mtak.hu

Punctuation of ASR-produced transcripts has received increasing attention in the recent
years; RNN-based sequence modelling solutions which exploit textual and/or acoustic …

被引用次数：23 相关文章所有 9 个版本

[PDF] researchgate.net

Joint word-and character-level embedding CNN-RNN models for punctuation restoration

MÁ Tündik, G Szaszák - 2018 9th IEEE International …, 2018 - ieeexplore.ieee.org

The sequence-to-sequence modelling paradigm has been successfully used in automatic
punctuation of text generated by Automatic Speech Recognizers (ASR), using bidirectional …

被引用次数：22 相关文章所有 3 个版本

[PDF] isca-archive.org

[PDF][PDF] Assessing the Semantic Space Bias Caused by ASR Error Propagation and its Effect on Spoken Document Summarization.

MA Tündik, V Kaszás, G Szaszák - INTERSPEECH, 2019 - isca-archive.org

Ambitions in artificial intelligence involve machine understanding of human language. The
state-of-the-art approach for Spoken Language Understanding is using an Automatic …

被引用次数：10 相关文章所有 5 个版本

A prosody inspired RNN approach for punctuation of machine produced speech transcripts to improve human readability

A Moró, G Szaszák - 2017 8th IEEE International Conference …, 2017 - ieeexplore.ieee.org

Speech communication human-machine interfaces exploit automatic speech recognition to
implement speech-to-text conversion. Unfortunately, in the past, not much effort has been …

被引用次数：11 相关文章

[PDF] bme.hu

[PDF][PDF] Towards abstractive summarization in Hungarian

M Makrai, ÁM Tündik, B Indig, G Szaszák - XVIII. Magyar Számítógépes …, 2022 - hlt.bme.hu

We publish an abstractive summarizer for Hungarian, an encoder-decoder model initialized
with huBERT, and fine-tuned on the ELTE. DH corpus of former Hungarian news portals …

被引用次数：6 相关文章

[PDF] niif.hu

[PDF][PDF] An audio-based sequential punctuation model for asr and its effect on human readability

G Szaszák - Acta Polytechnica Hungarica, 2019 - epa.niif.hu

Inserting punctuation marks into the word chain hypothesis produced by automatic speech
recognition (ASR) has long been a neglected task. In several application domains of ASR …

被引用次数：9 相关文章所有 3 个版本

[PDF] researchgate.net

A bilingual comparison of maxent-and rnn-based punctuation restoration in speech transcripts

MÁ Tündik, B Tarjan, G Szaszák - 2017 8th IEEE International …, 2017 - ieeexplore.ieee.org

Closed captioning is a common method to improve accessibility of TV programs for people
who are hearing impaired or hard of hearing, while representing an application relevant for …

被引用次数：8 相关文章所有 3 个版本

[PDF] researchgate.net

Low Latency MaxEnt-and RNN-Based Word Sequence Models for Punctuation Restoration of Closed Caption Data

MÁ Tündik, B Tarján, G Szaszák - … , SLSP 2017, Le Mans, France, October …, 2017 - Springer

Abstract Automatic Speech Recognition (ASR) rarely addresses the punctuation of the
obtained transcriptions. Recently, Recurrent Neural Network (RNN) based models were …

被引用次数：8 相关文章所有 3 个版本