public class DefaultLinkExtractor extends java.lang.Object implements LinkExtractor
LinkExtractor implementation using an HTML extractor.| Constructor and Description |
|---|
DefaultLinkExtractor() |
| Modifier and Type | Method and Description |
|---|---|
java.util.Collection<Record> |
extractLinks(Record inputRecord,
WebCrawlingContext context) |
void |
setLinkExtractorHtml(LinkExtractorHtml linkExtractorHtml)
sets the HTML extractor implementation to use.
|
public java.util.Collection<Record> extractLinks(Record inputRecord, WebCrawlingContext context) throws WebCrawlerException
extractLinks in interface LinkExtractorinputRecord - input record with contentcontext - the web crawling contextWebCrawlerExceptionpublic void setLinkExtractorHtml(LinkExtractorHtml linkExtractorHtml)