Help


Other tab


The Other tab enables you to configure miscellaneous settings such as the details of export and email settings.



Settings


Local file encoding (read and write)

This is the character encoding used when reading and writing files on the local disk. For example this encoding is used to load files when trawling a local site and when loading / saving files in the Editor.


Override local file encoding

If selected the local file encoding setting above will be overridden by encoding declarations in the files themselves. For example if an html file states it's encoding in it's <head> this will be used for both loading and saving.


HTML export limit (pages per file)

HTML exports will be paginated with no more than this amount of results in each file.


CSV export limit

When Pages per file is selected, csv exports will be paginated with no more than this amount of results in each file.


When Max file size is selected, csv exports will be paginated with each file being no larger than this size.


Delete browser viewer files upon exit

By default, when it closes down, DeepTrawl deletes all local files generated by previewing (e.g. using by clicking "View in Browser" next to an error). If this is switched off, the files remain so that the preview can be viewed when DeepTrawl is no longer running.


Generate & show Info links in browser viewer

DeepTrawl can show "Info" links in the errors list when View in Browser is used (this feature is deprecated, see the Deprecation tab to enable it). This is switched off by default because because it can cause large numbers of errors to slow down the generation of Browser Previews.


Form Action Capture Port

Port used when DeepTrawl temporarily acts as a web server in order to capture a form action. If you have problems capturing form actions due to your firewall blocking the port you may wish to change it (port 80 is commonly allowed). The default is an arbitrary port chosen to prevent clashes with other services, and set to a high number because services on low numbered ports are often blocked on Mac OSX computers.


Show DeepTrawl header on Form Action Capture page

By default, a DeepTrawl header appears at the top of all pages when capturing a form action. You may wish to deactivate this if there are problems with page layout.


Ignore case, "www" & trailing "/" when comparing Form Action trigger URLs

By default DeepTrawl decides whether to perform a Form Action on the basis of a URL's core name, regardless of case. This means that it ignores the "www.", the sub domain, any trailing slash, and the capitalization of the entire address.


Randomize Hyperlink / Image download order for each page

By default, DeepTrawl downloads the links and images in a random order for each page. This increases the speed of many trawls because it prevents multiple threads working on pages with similar structures from trying to download the same content at the same time.


Switch this off to have links and images appear in an order closer to that of the page being trawled.


Look for space instead of %20 for local files

If a local file's URL contains %20 (an encoded space), DeepTrawl looks for a filename with a corresponding space.