Access to the BAWE Corpus

Campus Map

Working with Coventry University

Working at Coventry University

Coventry University is a diverse and exciting place to work and we share the enthusiasm of our staff and students to be the best at whatever they choose to do. As one of the City’s biggest employers, we offer some impressive benefits for our staff and are committed to delivering the very best opportunities. We have a comprehensive training, personal and professional development programme that provides our employees with the skills to enhance their performance in the workplace and grow in their careers. There are pension schemes, a generous holiday allowance and flexible working opportunities as well as lifestyle benefits including childcare vouchers, discounted membership to the £4 million sports and recreation centre and schemes such as Cycle to Work and the CU Car Share initiative.

View current job vacancies.


Staff portal

Access the central point of information for all staff across the University.


Student Portal

Check your assessments, access Solar and get course information.


Access to the BAWE Corpus

With your own Corpus Query Tools

Registered users can download the corpus from the Oxford Text Archive. BAWE is listed as resource number 2539. To date 1136 copies have been downloaded from this site. The corpus is suitable for use with concordancing programs such as WordSmith and AntConC.

With Sketch Engine Open Access*

The Sketch Engine open access interface will allow you to view concordance lines and surrounding contexts. You can select the files you want to examine by filtering for features contained in the file header (for example you can choose a specific genre family, or the discipline / level / gender / L1 of contributors).

This manual will help you get started with Sketch Engine.

See also:

  • Diana McCarthy's slides from the BAAL Corpus Linguistics SIG event, December 2010.
  • Mat Terrett’s blog ‘Applying Corpus Tools to EAP Instruction’.(August 15, 2016)

With Lextutor

Concordances from an untagged version of the BAWE corpus can be created using Lextutor. This is a freely accessible, easy-to-use tool for teachers and learners.

By Subscription

Subscribing to Sketch Engine provides access to a number pre-loaded corpora, including BASE and BAWE, and offers a wider range of search features. You can register for a 30-day free trial account.

*Notes about the BAWE corpus in Sketch Engine

This version of the corpus has been prepared by Paul Thompson and Alois Heuboeck at Reading University. The files have been tagged by Paul Rayson at Lancaster University for POS (CLAWS tagset) and for semantic category using WMatrix. The Sketch Engine website describes query options for this version, as some of the BAWE markup has been modified.

BAWE contains 6,506,995 running words, but in SketchEngine the total number of tokens is reported as 8,336,262. This is because the SketchEngine token counts include punctuation.