SMILA (incubation) API documentation

org.eclipse.smila.connectivity.framework.crawler.web.parse.html
Class HTMLMetaTags

java.lang.Object
  extended by org.eclipse.smila.connectivity.framework.crawler.web.parse.html.HTMLMetaTags

public class HTMLMetaTags
extends java.lang.Object

This class holds the information about HTML "meta" tags extracted from a page. Some special tags have convenience methods for easy checking.


Constructor Summary
HTMLMetaTags()
           
 
Method Summary
 java.net.URL getBaseHref()
          A convenience method.
 java.util.Properties getGeneralTags()
          Returns all collected values of the general meta tags.
 java.util.Properties getHttpEquivTags()
          Returns all collected values of the "http-equiv" meta tags.
 boolean getNoCache()
          A convenience method.
 boolean getNoFollow()
          A convenience method.
 boolean getNoIndex()
          A convenience method.
 boolean getRefresh()
          A convenience method.
 java.net.URL getRefreshHref()
          A convenience method.
 int getRefreshTime()
          A convenience method.
 void reset()
          Sets all boolean values to false.
 void setBaseHref(java.net.URL baseHref)
          Sets the _baseHref.
 void setNoCache()
          Sets _noCache to true.
 void setNoFollow()
          Sets _noFollow to true.
 void setNoIndex()
          Sets _noIndex to true.
 void setRefresh(boolean refresh)
          Sets _refresh to the supplied value.
 void setRefreshHref(java.net.URL refreshHref)
          Sets the _refreshHref.
 void setRefreshTime(int refreshTime)
          Sets the _refreshTime.
 java.util.List<java.lang.String> toArrayList()
          Returns ArrayList representation of the HTML meta tags for further indexing.
 java.lang.String toString()
          Returns information about html meta tags.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Constructor Detail

HTMLMetaTags

public HTMLMetaTags()
Method Detail

reset

public void reset()
Sets all boolean values to false. Clears all other tags.


setNoFollow

public void setNoFollow()
Sets _noFollow to true.


setNoIndex

public void setNoIndex()
Sets _noIndex to true.


setNoCache

public void setNoCache()
Sets _noCache to true.


setRefresh

public void setRefresh(boolean refresh)
Sets _refresh to the supplied value.

Parameters:
refresh - boolean

setBaseHref

public void setBaseHref(java.net.URL baseHref)
Sets the _baseHref.

Parameters:
baseHref - Base URL

setRefreshHref

public void setRefreshHref(java.net.URL refreshHref)
Sets the _refreshHref.

Parameters:
refreshHref - URL

setRefreshTime

public void setRefreshTime(int refreshTime)
Sets the _refreshTime.

Parameters:
refreshTime - int time

getNoIndex

public boolean getNoIndex()
A convenience method. Returns the current value of _noIndex.

Returns:
boolean

getNoFollow

public boolean getNoFollow()
A convenience method. Returns the current value of _noFollow.

Returns:
boolean

getNoCache

public boolean getNoCache()
A convenience method. Returns the current value of _noCache.

Returns:
boolean

getRefresh

public boolean getRefresh()
A convenience method. Returns the current value of _refresh.

Returns:
boolean

getBaseHref

public java.net.URL getBaseHref()
A convenience method. Returns the _baseHref, if set, or null otherwise.

Returns:
URL

getRefreshHref

public java.net.URL getRefreshHref()
A convenience method. Returns the _refreshHref, if set, or null otherwise. The value may be invalid if getRefresh()returns false.

Returns:
URL

getRefreshTime

public int getRefreshTime()
A convenience method. Returns the current value of _refreshTime. The value may be invalid if getRefresh()returns false.

Returns:
int

getGeneralTags

public java.util.Properties getGeneralTags()
Returns all collected values of the general meta tags. Property names are tag names, property values are "content" values.

Returns:
Properties

getHttpEquivTags

public java.util.Properties getHttpEquivTags()
Returns all collected values of the "http-equiv" meta tags. Property names are tag names, property values are "content" values.

Returns:
Properties

toString

public java.lang.String toString()
Returns information about html meta tags.

Overrides:
toString in class java.lang.Object
Returns:
String

toArrayList

public java.util.List<java.lang.String> toArrayList()
Returns ArrayList representation of the HTML meta tags for further indexing.

Returns:
ArrayList

SMILA (incubation) API documentation