skype
viber
whatsapp
Buy now
29$
350$
download trial

Metadata extractor

Metadata extractor is Datacol-based module, collecting metadata information (title, description, keywords, headers) from provided website list. Most often data are saved to xlsx file. You may also change Datacol export settings to publish metadata information to other file formats, database and popular content management systems (Joomla, DLE, WordPress) etc.

Metadata extractor: data extraction results

Click image to enlarge

Main advantages of Datacol-based metadata extractor are listed below:

Step by Step test of metadata extractor

To test metadata extractor:

1. Install Datacol trial version;
2. Choose seo-parsers/metadata-extractor.par in the campaign tree and click Start button to launch metadata extractor campaign.

Metadata crawler: starting data extraction

Click image to enlarge

Before launching seo-parsers/metadata-extractor.par you can adjust the Input data. Select the campaign in the campaign tree for this purpose. In this way you can setup website list to extract metadata from.

Please contact us if the metadata extractor will not collect data after you have made changes to the Starting URL list.

Metadata scraper: setting Starting URL list

Click image to enlarge

3. Wait for data extraction results to appear. When you see the first results, you can force running campaign to stop (click Stop button).

Metadata harvester: working process

Click image to enlarge

4. After campaign is finished/stopped you can find metadata-extractor.xlsx file in Documents folder.

Metadata parser: data extraction results

Click image to enlarge

Datacol Trial VS Activated

Feature Trial License (Full version)
Preset default configuration for data extraction
50+
50+
Maximum data extraction results
Maximum 25
Unlimited
Free software updates
Free email tech support
Paid skype+teamviewer consultations
Paid setup

What if the metadata extractor is blocked (banned) by the source website? »

If the source website blocks your IP-address (after blocking you will get no more extraction results), use proxy.

Data processing options for content, collected by metadata extractor:

Data export options for content, collected by metadata extractor:

  • Basic: CSV/TXT/Database/Excel;
  • Online stores: Magento/PrestaShop/osCommerce/OpenCart/ZENCart/VirtueMart;
  • Content CMS: WordPress/Joomla/DLE;
  • All options.

If you have any questions, related to metadata extractor, please ask via the contact form.

Datacol Google+ Datacol Facebook Datacol Tvitter Datacol Linkedin Datacol Youtube
site map
X
Do you have a question?
The project manager will contact you within 1 working day.