SMILA 1.0 API documentation

org.eclipse.smila.importing.crawler.web.extractor
Interface LinkExtractorHtml

All Known Implementing Classes:
LinkExtractorHtmlNeko, LinkExtractorHtmlSoup

public interface LinkExtractorHtml

Extracts links from HTML Content.


Method Summary
 java.util.Collection<java.lang.String> extractLinks(java.io.InputStream input, AnyMap parameters)
           
 

Method Detail

extractLinks

java.util.Collection<java.lang.String> extractLinks(java.io.InputStream input,
                                                    AnyMap parameters)
                                                    throws java.lang.Exception
Returns:
links extracted from (HTML) input. F
Throws:
java.lang.Exception

SMILA 1.0 API documentation