Please enable JavaScript to view this site.

Handling the British National Corpus 

Navigation: WordSmith's Handling > Word list

Word list of verbs only

Scroll Prev Top Next More

 

To achieve this, we will need to process the BNC's mark-up, using any that tells you a given word is a verb of some sort.

 

Here is a fragment of an original XML text:

BNC XM Orignal fragment

It contains some unnecessary mark-up. All we need is the words in yellow and an indication of their part of speech. We could just use the "VERB" attribute but I decided I wanted to know what sort of verb each one is.

 

Let's get rid of all the red bits.

 

BNC XM Orignal fragment marked deletion 2

leaving us with just this:

 

BNC XM Orignal fragment reduced 4

 

tog_plus        How to do that?

 

wordlist of verbs

See also: lemmatising the verbs.