• search hit 6 of 6
Back to Result List

Better data for more researchers – using the audio features of BNCweb

  • In spite of the wide agreement among linguists as to the significance of spoken language data, actual speech data have not formed the basis of empirical work on English as much as one would think. The present paper is intended to contribute to changing this situation, on a theoretical and on a practical level. On a theoretical level, we discuss different research traditions within (English) linguistics. Whereas speech data have become increasingly important in various linguistic disciplines, major corpora of English developed within the corpus-linguistic community, carefully sampled to be representative of language usage, are usually restricted to orthographic transcriptions of spoken language. As a result, phonological phenomena have remained conspicuously understudied within traditional corpus linguistics. At the same time, work with current speech corpora often requires a considerable level of specialist knowledge and tailor-made solutions. On a practical level, we present a new feature of BNCweb (Hoffmann et al. 2008), a user-friendly interface to the British National Corpus, which gives users access to audio and phonemic transcriptions of more than five million words of spontaneous speech. With the help of a pilot study on the variability of intrusive r we illustrate the scope of the new possibilities.
Metadaten
Author:Sebastian Hoffmann, Sabine Arndt-Lappe
URN:urn:nbn:de:hbz:385-1-19462
DOI:https://doi.org/10.2478/icame-2021-0004
Parent Title (English):ICAME Journal
Series (Volume no.):Trier Center for Language and Communication (45(1))
Document Type:Article
Language:English
Date of completion:2021/06/12
Date of publication:2021/06/12
Publishing institution:Universität Trier
Release Date:2023/01/30
Institutes:Fachbereich 2
Licence (German):License LogoCC BY-NC-ND: Creative-Commons-Lizenz 4.0 International

$Rev: 13581 $