SMILA (incubation) API documentation

org.eclipse.smila.connectivity.framework.crawler.web.net
Interface UrlNormalizer

All Superinterfaces:
Configurable
All Known Implementing Classes:
BasicUrlNormalizer

public interface UrlNormalizer
extends Configurable

Interface used to convert URLs to normal form and optionally do RegEx substitutions .


Method Summary
 Configuration getConf()
          Return the configuration used by this object.
 java.lang.String normalize(java.lang.String urlString)
          Returns normalized URL.
 void setConf(Configuration conf)
          Set the configuration to be used by this object.
 

Method Detail

normalize

java.lang.String normalize(java.lang.String urlString)
                           throws java.net.MalformedURLException
Returns normalized URL.

Parameters:
urlString - String
Returns:
String
Throws:
java.net.MalformedURLException - if given URL was broken.

setConf

void setConf(Configuration conf)
Set the configuration to be used by this object.

Specified by:
setConf in interface Configurable
Parameters:
conf - Configuration

getConf

Configuration getConf()
Return the configuration used by this object.

Specified by:
getConf in interface Configurable
Returns:
Configuration

SMILA (incubation) API documentation