Manta scraper is Datacol-based module, which implements Manta directory businesses information extraction. In this simple example data are saved to xlsx file. You can also customize Datacol export settings to publish information to database, other file formats, CMS etc.
1. Install Datacol trial version; 2. Choose ad-parsers/manta.com.par in the campaign tree and click Start button to launch Manta extractor campaign.
Click image to enlarge
Before launching ad-parsers/manta.com.par you can adjust the Input data. Select the campaign in the campaign tree for this purpose. In this way you can setup links to Manta categories you need to extract companies from.
Please contact us if the Manta scraper will not collect data after you have made changes to the Starting URL list.
Click image to enlarge
3. Wait for data extraction results to appear. When you see the first results, you can force running campaign to stop (click Stop button).
Click image to enlarge
4. After campaign is finished/stopped you can find manta.com.xlsx file in Documents folder.
Click image to enlarge
Datacol Trial VS Activated
Function
TRIAL
Activated
One webpage data extraction results export (file/CMS/database/email/SMS) in the testing mode
Mass data extraction results export (file/CMS/database/email/SMS) in the running mode
[spoiler show=”What if the Manta extractor is blocked (banned) by the source website?” hide=”What if the Manta extractor is blocked (banned) by the source website?”]
If the source website blocks your IP-address (after blocking you will get no more extraction results), use proxy.
[/spoiler]
[/tab]
[tab name=”Data processing and Export”] Data processing options for information, collected by Manta extractor: