public interface LinkFilter
LinkExtractor to select only
those links that should really be followed in follow-up tasks.| Modifier and Type | Method and Description |
|---|---|
boolean |
allowLink(java.lang.String link,
WebCrawlingContext context)
Check if it is allowed to follow a given link.
|
java.util.Collection<Record> |
filterLinks(java.util.Collection<Record> extractedLinks,
WebCrawlingContext context)
filter extracted links.
|
java.util.Collection<Record> filterLinks(java.util.Collection<Record> extractedLinks, WebCrawlingContext context) throws WebCrawlerException
extractedLinks - result from LinkExtractor service.context - the WebCrawlingContext.WebCrawlerException - error in processing the links.boolean allowLink(java.lang.String link,
WebCrawlingContext context)
throws WebCrawlerException
link - a String containing the link to be checkedcontext - the WebCrawlingContext.WebCrawlerException - error in processing the links.