Littera Deusto

Modern Languages, Basque Studies and Humanities

British National Corpus

mayo 12th, 2009 · No hay Comentarios

Una vez aclarado el concepto del corpus, es hora de centrarse en uno de los que concretamente mi grupo ha trabajado: British National Corpus (BNC). Ya  que el corpus aqui descrito es el britanico, lo mejor será definirlo y explicarlo en su idioma originario.

The British National Corpus (BNC) is a 100 million word collection of samples of written and spoken language from a wide range of sources, designed to represent a wide cross-section of British English from the later part of the 20th century, both spoken and written.

The written part of the BNC (90%) includes, for example, extracts from regional and national newspapers, specialist periodicals and journals for all ages and interests, academic books and popular fiction, published and unpublished letters and memoranda, school and university essays, among many other kinds of text.

The spoken part (10%) consists of orthographic transcriptions of unscripted informal conversations (recorded by volunteers selected from different age, region and social classes in a demographically balanced way) and spoken language collected in different contexts, ranging from formal business or government meetings to radio shows and phone-ins.

Source:

Retrieved from : Pagina oficial del BNC:   http://www.natcorp.ox.ac.uk/ May 12, 2009 10:02

Etiquetas:

  • Etiquetas