Basic Navigation

Parent Previous Next


On Basic Navigation tab you can define webpages to be used for collecting links to other website pages within given campaign. For more information on why we need to collect links to other webpages, please check basic program algorithm section.


Thus, Datacol will collect links (to other webpages) from certain webpage just in case, when webpage complies with following conditions:


1. Page URL matches at least one regex, specified in URL format setting (regex are specified one per string). If this setting is not defined, restriction is ignored (in this case only Page filter settings are significant for webpage link collecting suitability definition).


2. Page URL complies conditions of URL page filters:

a) Contains strings (or at least one of strings if Consider All option is OFF) specified in Must be present in URL setting.

b) Not contains strings (or at least one of strings if Consider All option is OFF) specified in Must be absent in URL setting.


3. Loaded webpage code complies with conditions of pagecode page filters:

а) Contains strings (or at least one of strings if Consider All option is OFF) specified in Must be present in pagecode setting.

b) Not contains strings (or at least one of strings if Consider All option is OFF) specified in Must be absent in pagecode setting.


For 2 and 3 paragraphs, if Regex option is ON, condition strings are processed as regexes. Thus URL (or pagecode) is checked for matching (or not matching) these regexes.


Parsing depth. This setting is used to limit website parsing depth. If Parsing depth is 0, Datacol will process just URLs from Starting URL list (collecting links to other webpages will not be accomplished). Note, when you harvest website where needed data are located deeper, Parsing depth default value must be increased.

Created with the Personal Edition of HelpNDoc: Create HTML Help, DOC, PDF and print manuals from 1 single source