Discussion Thread
Data Extractor
Message Thread

Extract any data, including email addresses and URLs from your files and webpages.
Posted in the Data Extractor Forum.
How to you get the spider to skip pages or types of files that crash it from the search?
Spidering
http://www.semtech.com/products/ the Product spotlight on the right when it loads in the preview is not supported by 3.3 and it halts the extraction... any way to skip that kind of page so the extraction doesn't get stopped?
How to you get the spider to skip pages or types of files that crash it from the search?
If you click one of the links in the preview window it should let the extraction continue to the next page.
How to you get the spider to skip pages or types of files that crash it from the search?
What if I just wanted to make it skip all pdf files? Does anyone know what kind of code I should use for that in the script?
How to you get the spider to skip pages or types of files that crash it from the search?
It should automatically skip pdf files, please make sure you have the latest version installed.
How to you get the spider to skip pages or types of files that crash it from the search?
I have v3.3, which seems to be the most recent version. Unfortunately, it's not skipping over the PDF files we have where the url ends with something like the following:
downloadDocument.do?id=2225
Is there a way to have the extractor skip over urls that contain the string "downloadDocument.do" ? Because as soon as it hits one, the program crashes.
How to you get the spider to skip pages or types of files that crash it from the search?
If should skip if the filename ends in .pdf, unfortunatly that isn't the case for you. Sorry but we don't have a solution to that at this time.