Show/Hide Toolbars

WordSmith Tools Manual

Navigation: KeyWords

KeyWords: text dispersion

Scroll Prev Top Next More

The point of it

The idea as explained by Egbert and Biber (2019) is to help in the study not of single texts but of sub-corpora of text. They argue that text dispersion key words are more representative of the sub-corpus as a whole.  

How to do it

In the Controller settings, check the text dispersion box.

 

keywords_settings_in_controller_text_dispersion

Choose your sub-corpus and reference corpus files, remembering that what will count is the number of texts each word is found in.  

 

Example

 

This was computed using the 24 BNC written academic medicine texts. The reference corpus was the whole BNC XML edition.

 

text_dispersion_keyness_medicine

 

Percentage columns

In the case of text dispersion key words displays, the Freq. column (as before) shows the token frequency of each key word. The % column now gives the percentage of texts that KW was found in (here a total of 24). The Texts column is as before. The RC % column shows the percentage of the number of texts each KW was found in in relation to the total texts in the reference corpus (here over 4,000).

 

Results below were derived from the 44 commerce texts:

 

text_dispersion_keyness_commerce