Live subtitling with speech recognition: causes and consequences of text reduction

Luyckx, Bieke; Delbeke, Tijs; Van Waes, Luuk; Leijten, Mariëlle; Remael, Aline

Title

Author

Luyckx, Bieke

Delbeke, Tijs

Van Waes, Luuk

Leijten, Mariëlle

Remael, Aline

Abstract

Speech technology has made it possible to use speech recognition for simultaneous subtitling of live television broadcasts via the technique of respeaking. Despite the considerable prior research into the quality of live subtitling using speech recognition, little research has focused on the quantitative aspects of subtitles. Although live subtitles are nearly always a reduced form of the spoken comments, the exact causes of text reduction are still largely unidentified. This study aims at a better understanding of the causes and consequences of text reduction in a live subtitling context. Three excerpts of an infotainment talk show were subtitled by twelve respeakers of the Flemish public television. They were instructed to do this in three different reduction conditions. Various subtitle features, such as reduction percentages and delay, as well as measures of the respeakers working memory were collected. Both a quantitative and qualitative analysis were carried out. In the quantitative analysis we opted for a multilevel analysis to take into account the hierarchical nature of the data. In the qualitative analysis, we discussed the effects of commonly used reduction strategies. The results show that reduction is not a random process. In contrast, it is largely determined by a number of external factors, viz. delay, amount of source text and the proportion of full reductions. There is a large amount of evidence suggesting that respeakers prefer to omit certain comments rather than reducing them to a certain extent. It also appears that the decision to fully omit a comment seems not to be primarily based on the amount of input, while the decision to partially reduce is. Differences in the capacity of the working memory do not seem to affect text reduction as such. Finally, the qualitative analysis demonstrated that respeakers use a wide variety of strategies to reduce the spoken comments in order to limit the loss of information as much as possible.

Language

English

Source (series)

UA, Faculty of Applied Economics ; 2010:10

Publication

Antwerp : UA , 2010

Volume/pages

34 p.

Full text (open access)

https://repository.uantwerpen.be/docman/irua/7418cf/963a308c.pdf

Faculty/Department				Faculty of Business and Economics

Research group				Management

Publication type				Minutes and reports

Subject				Linguistics

Affiliation				Publications with a UAntwerp address

Identifier

Creation

20.05.2010

Last edited

07.10.2022

To cite this reference

https://hdl.handle.net/10067/822920151162165141