SMILA (incubation) API documentation

org.eclipse.smila.connectivity.framework.crawler.web.parse
Interface Parser

All Superinterfaces:
Configurable
All Known Implementing Classes:
BinaryParser, HtmlParser, JavascriptParserImpl

public interface Parser
extends Configurable

Interface for parsing the raw content. Each webcrawler parser must be registered as a declarative service that provides this interface.


Method Summary
 java.lang.String[] getContentTypes()
          Returns array of content-types that are supported by this parser.
 Parse getParse(Content content)
          Creates the parse for some content.
 
Methods inherited from interface org.eclipse.smila.connectivity.framework.crawler.web.configuration.Configurable
getConf, setConf
 

Method Detail

getParse

Parse getParse(Content content)
Creates the parse for some content.

Parameters:
content - Content
Returns:
Parse

getContentTypes

java.lang.String[] getContentTypes()
Returns array of content-types that are supported by this parser.

Returns:
array of content-types.

SMILA (incubation) API documentation