Proxy scraper

[tab name=”Solution”]

Proxy scraper

Proxy scraper is Datacol-based module, which implements proxy list extraction from free proxy website. In our example proxy list is saved to xlsx file. You can also adjust Datacol export settings to publish it to database, other file formats etc.

Proxy extractor: data extraction results
Click image to enlarge

Main advantages of Datacol-based proxy scraper are listed below:

[/tab]
[tab name=”Test NOW!”]

Step by Step test of proxy scraper

To test proxy extractor:

1. Install Datacol trial version;
2. Choose seo-parsers/proxy-scraper.par in the campaign tree and click Start button to launch proxy extractor campaign.

Proxy crawler: starting data extraction
Click image to enlarge

Before launching seo-parsers/proxy-scraper.par you can adjust the Input data. Select the campaign in the campaign tree for this purpose. In this way you can setup links to proxy list sources you need to harvest.

Please contact us if the proxy extractor will not collect data after you have made changes to the Starting URL list.

Proxy scraper: setting Starting URL list
Click image to enlarge

3. Wait for data extraction results to appear. When you see the first results, you can force running campaign to stop (click Stop button).

Proxy harvester: working process
Click image to enlarge

4. After campaign is finished/stopped you can find proxy-scraper.xlsx file in Documents folder.

Proxy parser: data extraction results
Click image to enlarge

Datacol Trial VS Activated

Feature Trial License (Full version)
Preset default configuration for data extraction
50+
50+
Maximum data extraction results
Maximum 25
Unlimited
Free software updates
Free email tech support
Paid skype+teamviewer consultations
Paid setup

[spoiler show=”What if the proxy extractor is blocked (banned) by the source website?” hide=”What if the proxy extractor is blocked (banned) by the source website?”]
If the source website blocks your IP-address (after blocking you will get no more extraction results), use proxy.
[/spoiler]

[/tab]

[tab name=”Data processing and Export”]
Data processing options for information, collected by proxy extractor:

Data export options for information, collected by proxy extractor:

  • Basic: CSV/TXT/Database/Excel;
  • Online stores: Magento/PrestaShop/osCommerce/OpenCart/ZENCart/VirtueMart;
  • Content CMS: WordPress/Joomla/DLE;
  • All options.

[/tab]
[tab name=”Ask your question!”]
If you have any questions, related to proxy extractor, please ask via the contact form.
[/tab]

[end_tabset]

Scroll to Top