org.eclipse.smila.connectivity.framework.crawler.web.parse.html
Class HTMLMetaProcessor
java.lang.Object
org.eclipse.smila.connectivity.framework.crawler.web.parse.html.HTMLMetaProcessor
public final class HTMLMetaProcessor
- extends java.lang.Object
Class for parsing META Directives from DOM trees. This class handles specifically Robots META directives (all, none,
nofollow, noindex), finding BASE HREF tags, and HTTP-EQUIV no-cache instructions. All meta directives are stored in a
HTMLMetaTags instance.
|
Method Summary |
static void |
getMetaTags(HTMLMetaTags metaTags,
org.w3c.dom.Node node,
java.net.URL currURL)
Sets the indicators in robotsMeta to appropriate values, based on any META tags found under the
given node. |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
getMetaTags
public static void getMetaTags(HTMLMetaTags metaTags,
org.w3c.dom.Node node,
java.net.URL currURL)
- Sets the indicators in
robotsMeta to appropriate values, based on any META tags found under the
given node.
- Parameters:
metaTags - HTMLMetaTagsnode - NodecurrURL - URL