Coventry University logo

Welcome to Coventry University website

Skip Navigation

Sketch Engine

Search the BAWE Corpus

BAWE can be searched in 5 different ways, depending on the degree of detail you require.

1.  Sketch Engine Open
This open access interface will allow you to view concordance lines and surrounding contexts. You can select the files you want to examine by filtering for features contained in the file header (for example you can choose a specific genre family, or the discipline / level / gender / L1 of contributors).

This manual will help you get started with Sketch Engine. Using Sketch Engine with BAWE

2.  Visualisations
For a quick view of collocations and word contexts in subsections of the corpus.

3.  The search Interface
This prototype interface allows filtered searching of the corpus files. It is a prototype and there are occasional failures.
Specifications of the search interface

4.  Sketch Engine by Subscription
This provides access to a number of pre-loaded corpora, including BASE and BAWE, and offers a wider range of search features. You can register for a 30-day free trial account.

5.  Your own Corpus Query Tools
Registered users can download the corpus from the Oxford Text Archive. BAWE is listed as resource number 2539. The corpus is suitable for use with concordancing programs such as AntConc and WordSmith Tools.

Notes about the BAWE corpus in Sketch Engine
This version of the corpus has been prepared by Paul Thompson and Alois Heuboeck at Reading University. The files have been tagged by Paul Rayson at Lancaster University for POS (CLAWS tagset) and for semantic category using WMatrix. The Sketch Engine website describes query options for this version, as some of the BAWE markup has been modified.

BAWE contains 6,506,995 running words, but in SketchEngine the total number of tokens is reported as 8,336,262. This is because the SketchEngine token counts include punctuation.

 

top of page