Discussion Thread
Data Extractor
Message Thread
Extract any data, including email addresses and URLs from your files and webpages.
Posted in the Data Extractor Forum.
Search Engine Harvesting Not Working
The software doesn't work for search engine results that have emails on the page. The emails on the page look like these for example: sales@abc.com or jack@dfecorp.com
I suspect the emails are not being extracted because of the bolding. So the software needs to recognize that in all instances where there is an email address on the page that all characters after @ sign will be bolded.
I believe you answered this question for me before, but I can't find it in this forum's history. I had to re-install the software, so this rule got deleted on my end. The answer was adding a small piece of code to the Extraction Rules section. Could you re-post this answer if you can find it? Thanks!
Search Engine Harvesting Not Working
for this rul: "Extract Emails from Webpages"
change this:
objResults = document.body.outerHTML.match(objRE)
to this:
objResults = document.body.innerText.match(objRE)