Please enable JavaScript to view this site.

Handling the British National Corpus 

Navigation: WordSmith's Handling

First, get a nice clean version!

Scroll Prev Top Next More

This section deals with how WordSmith handles BNC-type tags.

 

But first, it is best to get a nice clean version of your BNC. This is needed because

 

the BNC has always come in Unix format, not in normal Windows format

the files and folders they're stored in have their original BNC names and to most people these are meaningless

the XML version of the text is in UTF8, which is a widely-used but problematic standard which uses clumsy methods for representing many characters.

 

To do this, use Text Converter.