Yahoo extractor is Datacol-based module, yahoo.com SERP (search engine results page) items by specified keyword. Title, snippet and URL are extracted for each Yahoo SERP item. After data collecting – item information is exported to xlsx file.
1. Install Datacol trial version; 2. Choose seo-parsers/yahoo-extractor.par in the campaign tree and click Start button to launch Yahoo extractor campaign.
Click image to enlarge
Before launching seo-parsers/yahoo-extractor.par you can adjust the Input data. Select the campaign in the campaign tree for this purpose. In this way you can setup keywords to extract Yahoo SERP items for.
Please contact us if the Yahoo extractor will not collect data after you have made changes to the Starting URL list.
Click image to enlarge
3. Wait for data extraction results to appear. When you see the first results, you can force running campaign to stop (click Stop button).
Click image to enlarge
4. After campaign is finished/stopped you can find yahoo-extractor.xlsx file in Documents folder.
[spoiler show=”What if the Yahoo extractor is blocked (banned) by the source website?” hide=”What if the Yahoo extractor is blocked (banned) by the source website?”]
If the source website blocks your IP-address (after blocking you will get no more extraction results), use proxy.
[/spoiler]
[/tab]
[tab name=”Data processing and Export”] Data processing options for information, harvested by Yahoo extractor: