SMILA 1.0 API documentation

org.eclipse.smila.importing.crawler.web.extractor
Class LinkExtractorHtmlSoup

java.lang.Object
  extended by org.eclipse.smila.importing.crawler.web.extractor.LinkExtractorHtmlSoup
All Implemented Interfaces:
LinkExtractorHtml

public class LinkExtractorHtmlSoup
extends java.lang.Object
implements LinkExtractorHtml

LinkExtractorHtml implementations using tagsoup.


Constructor Summary
LinkExtractorHtmlSoup()
           
 
Method Summary
 java.util.Collection<java.lang.String> extractLinks(java.io.InputStream input, AnyMap parameters)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

LinkExtractorHtmlSoup

public LinkExtractorHtmlSoup()
Method Detail

extractLinks

public java.util.Collection<java.lang.String> extractLinks(java.io.InputStream input,
                                                           AnyMap parameters)
                                                    throws java.lang.Exception
Specified by:
extractLinks in interface LinkExtractorHtml
Returns:
links extracted from (HTML) input. F
Throws:
java.lang.Exception

SMILA 1.0 API documentation