Error processing and logs

Parent Previous Next


Valid regex. Loaded page code must match all of these regex to be considered as valid. If page code does not match at least one of valid regex, Datacol assume page as loaded with error, thus avoid collecting data and links from the page. If setting is empty, checking is not implemented. Pages validity should be checked if source website can ban parser or if page loading is implemented through unchecked proxies.


Invalid regex. Loaded page code must NOT match all of these regex to be considered as valid. If page code matches at least one of invalid regex, Datacol assume page as loaded with error, thus avoid collecting data and links from the page. If setting is empty, checking is not implemented. Pages validity should be checked if source website can ban parser or if page loading is implemented through unchecked proxies.


Max errors till stop. Maximum serial load errors permissible within given campaign work. When Datacol loaded setting-defined number of pages with error (no successfull loads between), it is finishing its work. If setting is set to zero, restriction is ignored.


Return errors to queue. If this checkbox is ON, Datacol will return URL loaded with error to Queue. Max queue returns setting used to determine maximum permissible times to return specific URL to Queue. If setting is set to zero, restriction is ignored (any URL may be returned to Queue any number of times).


Save log to files. If this checkbox is ON, given campaign logs will be saved to TXT file in Documents directory. Log filename is formed as following: Logs_Dataco5_CAMPAIGN_TREE_RELATIVE_CAMPAIGN_PATH.csv.


Max log records. Setting used to limit maximum permissible log record number. If setting is set to zero, restriction is ignored.

Created with the Personal Edition of HelpNDoc: Free CHM Help documentation generator