On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic...

Method and system for creating frugal speech corpus using internet resources and conventional speech corpus

S Kopparapu, IA Sheikh - US Patent 8,756,064, 2014 - Google Patents

A speech corpus creation method and system are disclosed. The method comprising
identifying a publicly accessible first source of the first speech data and its corresponding …

被引用次数：52 相关文章所有 4 个版本

[PDF] isca-archive.org

[PDF][PDF] Unsupervised language model adaptation for automatic speech recognition of broadcast news using web 2.0.

T Schlippe, L Gren, NT Vu, T Schultz - Interspeech, 2013 - isca-archive.org

We improve the automatic speech recognition of broadcast news using paradigms from Web
2.0 to obtain time-and topicrelevant text data for language modeling. We elaborate an …

被引用次数：22 相关文章所有 9 个版本

[PDF] hal.science

Towards the automatic processing of language registers: Semi-supervisedly built corpus and classifier for french

G Lecorvé, H Ayats, B Fournier, J Mekki… - … and Intelligent Text …, 2019 - Springer

Abstract Language registers are a strongly perceptible characteristic of texts and speeches.
However, they are still poorly studied in natural language processing. In this paper, we …

被引用次数：5 相关文章所有 8 个版本

[PDF] hal.science

Construction conjointe d'un corpus et d'un classifieur pour les registres de langue en français

G Lecorvé, HA Ayats, B Fournier, J Mekki… - … du langage naturel …, 2018 - inria.hal.science

Les registres de langue sont un trait stylistique marquant dans l'appréciation d'un texte ou
d'un discours. Cependant, il sont encore peu étudiés en traitement automatique des …

被引用次数：3 相关文章所有 9 个版本

[PDF] hal.science

Adaptation thématique non supervisée d'un système de reconnaissance automatique de la parole

G Lecorvé - 2010 - theses.hal.science

Les systèmes actuels de reconnaissance automatique de la parole (RAP) reposent sur un
modèle de langue (ML) qui les aide à déterminer les hypothèses de transcription les plus …

被引用次数：4 相关文章所有 6 个版本

[PDF] hal.science

Toward robust information extraction models for multimedia documents

AR Ebadat - 2012 - theses.hal.science

During the last decade, huge amounts of multimedia documents have been generated. It is
therefore important to find a way to manage this data. Every approach to facilitate this …

被引用次数：3 相关文章所有 7 个版本

[PDF] irisa.fr

[PDF][PDF] Towards the Automatic Processing of Language Registers: Semi-supervisedly Built Corpus and Classifier for French

GLH Ayats, B Fournier, J Mekki, J Chevelu, D Battistelli… - people.irisa.fr

Language registers are a strongly perceptible characteristic of texts and speeches.
However, they are still poorly studied in natural language processing. In this paper, we …

[PDF] uni-bremen.de

[PDF][PDF] Unsupervised Language Model Adaptation for Automatic Speech Recognition of Broadcast News Using Web 2.0

IT Schultz, L Gren, DIT Schlippe, DINT Vu - csl.uni-bremen.de

We improve the automatic speech recognition of broadcast news using paradigms from Web
2.0 to obtain time-and topic-relevant text data for language modeling. We elaborate an …

Recherche d'information textuelle et phonétique pour le contrôle de l'étiquetage automatique d'émissions dans un flux télévisuel

C Guinaudeau - 4es rencontres des jeunes chercheurs en recherche d' …, 2009 - hal.science

En 2007, Naturel (Naturel, 2007) a proposé un système qui associe automatiquement une
étiquette, c'est-à-dire un titre, à des émissions issues du découpage d'un flux TV …