|
SMILA (incubation) API documentation | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.eclipse.smila.connectivity.framework.crawler.web.parse.html.HtmlParser
public class HtmlParser
The Class HtmlParser.
| Constructor Summary | |
|---|---|
HtmlParser()
|
|
| Method Summary | |
|---|---|
Configuration |
getConf()
Return the configuration used by this object. |
java.lang.String[] |
getContentTypes()
Returns array of content-types that are supported by this parser. |
Parse |
getParse(Content content)
Returns the Parse result for the given Content. |
void |
setConf(Configuration configuration)
Set the configuration to be used by this object. |
void |
setJavascriptParser(Parser parser)
Sets javascript parser reference that is needed for extracting js links. |
void |
unsetJavascriptParser(Parser parser)
Removes javascript parser reference. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public HtmlParser()
| Method Detail |
|---|
public Parse getParse(Content content)
Parse result for the given Content.
getParse in interface Parsercontent - Content to be parsed.
public void setConf(Configuration configuration)
setConf in interface Configurableconfiguration - Configurationpublic Configuration getConf()
getConf in interface Configurablepublic java.lang.String[] getContentTypes()
getContentTypes in interface Parserpublic void setJavascriptParser(Parser parser)
parser - Javascript parser reference.public void unsetJavascriptParser(Parser parser)
parser - javascript parser reference
|
SMILA (incubation) API documentation | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||