Utility Programs > WebGetter > settings
These are
• | where the texts are to be stored. The folder you specify will act as a root. That is, if you specify c:\temp and search for "besteirol", results will be stored in c:\temp\besteirol. If you do another search on say "Oxford WordSmith Tools", results for that will go into c:\temp\WordSmithTools. |
• | timeout: the number of seconds after which WebGetter robot stops trying a given webpage if there's no response. Suggested value: 20 seconds. |
• | max simultaneous: WebGetter works by sending robots out simulaneously, each one requesting a different web page. Suggested value: 20. That is, up to 20 are being downloaded at once. |
• | language: you specify the language you require. |
• | minimum file length (suggested 20Kbytes): the minimum size for each text file downloaded from the web. Small ones may just contain links to a couple of pictures and nothing much else. |
• | minimum words (suggested: 300): after each download, WebGetter goes through the downloaded text file counting the number of words and won't save unless there are enough. |
• | required words: you may optionally type in some words which you require to be present in each download; you can insist they all be present or any 1 of these. |
Search Engines
Download a choice of search engines by pressing Engines. This gets the latest information about each search engine from www.lexically.net/downloads/searchengines.htm.
Advanced Options
If you work in an environment with a "Proxy Server", WebGetter will recognise this automatically and use the proxy unless you uncheck the relevant box. If in doubt ask your network administrator.
The grid of settings
This contains:
name
|
The Name to appear above, in the list of Search Engines
|
ignore
|
Websites not to visit when downloading (as opposed to requesting a list). That is, when WebGetter gets a page from Google, it only wants Google's list, not more Google web-pages.
|
URL
|
The URL where the Search Engine is found.
|
Searchstring
|
The search word syntax
|
Max
|
How many hits to try for on each contact
|
Next
|
|
Language
|
Required language
|
Other
|
|
The search word is specified more or less just as you do when you use the same Search Engine yourself. Few advanced settings for each Search Engine are used; you can try your own preferences by typing in the grid, in the Searchstring column. Learn each Search Engine's current settings by simply trying it and then adapt the Searchstring accordingly. Some Search Engines want to set cookies on your PC and this might cause a failure to download.
You can see the address line in the Advanced tab; WebGetter attempts to tell the Search Engine the search-word, the maximum number of hits to show per contact, what language to use, and how to get more.
See also: Display, Limitations
|