Settings

Top  Previous  Next

Utility Programs > WebGetter > settings

These are

 

where the texts are to be stored. The folder you specify will act as a root. That is, if you specify c:\temp and search for "besteirol", results will be stored in c:\temp\besteirol. If you do another search on say "Oxford WordSmith Tools", results for that will go into c:\temp\WordSmithTools.
timeout: the number of seconds after which WebGetter robot stops trying a given webpage if there's no response. Suggested value: 20 seconds.
max simultaneous: WebGetter works by sending robots out simulaneously, each one requesting a different web page. Suggested value: 20. That is, up to 20 are being downloaded at once.
language: you specify the language you require.
minimum file length (suggested 20Kbytes): the minimum size for each text file downloaded from the web. Small ones may just contain links to a couple of pictures and nothing much else.
minimum words (suggested: 300): after each download, WebGetter goes through the downloaded text file counting the number of words and won't save unless there are enough.
required words: you may optionally type in some words which you require to be present in each download; you can insist they all be present or any 1 of these.

 

Search Engines

 

Download a choice of search engines by pressing Engines. This gets the latest information about each search engine from www.lexically.net/downloads/searchengines.htm.

 

 

Advanced Options

 

If you work in an environment with a "Proxy Server", WebGetter will recognise this automatically and use the proxy unless you uncheck the relevant box. If in doubt ask your network administrator.

 

The grid of settings

 

This contains:

 

name

The Name to appear above, in the list of Search Engines

ignore

Websites not to visit when downloading (as opposed to requesting a list). That is, when WebGetter gets a page from Google, it only wants Google's list, not more Google web-pages. 

URL

The URL where the Search Engine is found.

Searchstring

The search word syntax

Max

How many hits to try for on each contact

Next

 

Language

Required language

Other

 

 

The search word is specified more or less just as you do when you use the same Search Engine yourself. Few advanced settings for each Search Engine are used; you can try your own preferences by typing in the grid, in the Searchstring column. Learn each Search Engine's current settings by simply trying it and then adapt the Searchstring accordingly. Some Search Engines want to set  cookies on your PC and this might cause a failure to download.

 

You can see the address line in the Advanced tab; WebGetter attempts to tell the Search Engine the search-word, the maximum number of hits to show per contact, what language to use, and how to get more.

 

See also: Display, Limitations