|
SMILA (incubation) API documentation | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.eclipse.smila.connectivity.framework.crawler.web.configuration.Configured
org.eclipse.smila.connectivity.framework.crawler.web.parse.ParseData
public final class ParseData
Data extracted from a page's content.
| Field Summary |
|---|
| Fields inherited from class org.eclipse.smila.connectivity.framework.crawler.web.configuration.Configured |
|---|
_configuration |
| Constructor Summary | |
|---|---|
ParseData()
Empty constructor. |
|
ParseData(ParseStatus status,
java.lang.String title,
Outlink[] outlinks,
Metadata contentMeta)
Creates new object with empty html metatags. |
|
ParseData(ParseStatus status,
java.lang.String title,
Outlink[] outlinks,
Metadata contentMeta,
HTMLMetaTags htmlMetaTags)
Creates new object with empty parse meta data. |
|
ParseData(ParseStatus status,
java.lang.String title,
Outlink[] outlinks,
Metadata contentMeta,
Metadata parseMeta,
HTMLMetaTags htmlMetaTags)
Creates new ParseData object with given configuration. |
|
| Method Summary | |
|---|---|
boolean |
equals(java.lang.Object o)
|
Metadata |
getContentMeta()
The original Meta data retrieved from content. |
HTMLMetaTags |
getHtmlMetaTags()
Returns HTML meta tags information. |
java.lang.String |
getMeta(java.lang.String name)
Get a meta data single value. |
Outlink[] |
getOutlinks()
The outlinks of the page. |
Metadata |
getParseMeta()
Other content properties. |
ParseStatus |
getStatus()
The status of parsing the page. |
java.lang.String |
getTitle()
The title of the page. |
int |
hashCode()
|
void |
setHtmlMetaTags(HTMLMetaTags htmlMetaTags)
Assigns HTML meta tags information. |
void |
setParseMeta(Metadata parseMeta)
Assigns parse meta data. |
java.lang.String |
toString()
|
| Methods inherited from class org.eclipse.smila.connectivity.framework.crawler.web.configuration.Configured |
|---|
getConf, setConf |
| Methods inherited from class java.lang.Object |
|---|
clone, finalize, getClass, notify, notifyAll, wait, wait, wait |
| Constructor Detail |
|---|
public ParseData()
public ParseData(ParseStatus status,
java.lang.String title,
Outlink[] outlinks,
Metadata contentMeta)
status - ParseStatustitle - String title of the pageoutlinks - OutLinks arraycontentMeta - Meta data extracted from content
public ParseData(ParseStatus status,
java.lang.String title,
Outlink[] outlinks,
Metadata contentMeta,
HTMLMetaTags htmlMetaTags)
status - ParseStatustitle - String title of the pageoutlinks - OutLinks arraycontentMeta - Meta data extracted from contenthtmlMetaTags - Meta data extracted from HTML tags
public ParseData(ParseStatus status,
java.lang.String title,
Outlink[] outlinks,
Metadata contentMeta,
Metadata parseMeta,
HTMLMetaTags htmlMetaTags)
status - ParseStatustitle - String title of the pageoutlinks - OutLinks arraycontentMeta - Meta data extracted from contentparseMeta - Meta data parse Meta datahtmlMetaTags - Meta data extracted from HTML tags| Method Detail |
|---|
public ParseStatus getStatus()
public java.lang.String getTitle()
public Outlink[] getOutlinks()
public Metadata getContentMeta()
public Metadata getParseMeta()
public void setParseMeta(Metadata parseMeta)
parseMeta - parser specific content properties.public java.lang.String getMeta(java.lang.String name)
name - Name of meta data element
getContentMeta(),
getParseMeta()public HTMLMetaTags getHtmlMetaTags()
public void setHtmlMetaTags(HTMLMetaTags htmlMetaTags)
htmlMetaTags - meta tags extracted from HTML tagspublic boolean equals(java.lang.Object o)
equals in class java.lang.Objectpublic int hashCode()
hashCode in class java.lang.Objectpublic java.lang.String toString()
toString in class java.lang.Object
|
SMILA (incubation) API documentation | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||