|
SMILA 1.0 API documentation | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.eclipse.smila.importing.compounds.ExtractorWorkerBase
org.eclipse.smila.importing.crawler.file.FileExtractorWorker
public class FileExtractorWorker
Compound extractor worker to use in file crawling workflows.
| Field Summary | |
|---|---|
static java.lang.String |
NAME
name of worker. |
| Constructor Summary | |
|---|---|
FileExtractorWorker()
|
|
| Method Summary | |
|---|---|
protected Record |
convertRecord(Record compoundRecord,
Record extractedRecord,
TaskContext taskContext)
create a record from the extracted record that conforms to the records produced by the matching crawler. |
protected boolean |
filterRecord(Record record,
TaskContext taskContext)
Filter extracted records. |
protected ContentFetcher |
getContentFetcher()
get a content fetcher for the data source type. |
java.lang.String |
getName()
|
protected java.util.Iterator<Record> |
invokeExtractor(CompoundExtractor extractor,
Record compoundRecord,
java.io.InputStream compoundContent,
TaskContext taskContext)
invoke extractor with data from the crawled record. |
void |
setFileCrawlerService(FileCrawlerService fileCrawler)
DS service reference bind method. |
void |
unsetFileCrawlerService(FileCrawlerService fileCrawler)
DS service reference unbind method. |
| Methods inherited from class org.eclipse.smila.importing.compounds.ExtractorWorkerBase |
|---|
concatAttributeValues, copyAttachment, copyAttribute, copyCompoundAttributes, copySetToStringAttribute, mapRecord, perform, setCompoundExtractor, unsetCompoundExtractor |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final java.lang.String NAME
| Constructor Detail |
|---|
public FileExtractorWorker()
| Method Detail |
|---|
public java.lang.String getName()
protected java.util.Iterator<Record> invokeExtractor(CompoundExtractor extractor,
Record compoundRecord,
java.io.InputStream compoundContent,
TaskContext taskContext)
throws CompoundExtractorException
ExtractorWorkerBase
invokeExtractor in class ExtractorWorkerBaseCompoundExtractorException
protected Record convertRecord(Record compoundRecord,
Record extractedRecord,
TaskContext taskContext)
ExtractorWorkerBase
convertRecord in class ExtractorWorkerBase
protected boolean filterRecord(Record record,
TaskContext taskContext)
filterRecord in class ExtractorWorkerBaserecord - the record to checktaskContext - the task context containing the task parameters
true if the record passes the filter(s), false if not.protected ContentFetcher getContentFetcher()
ExtractorWorkerBase
getContentFetcher in class ExtractorWorkerBasepublic void setFileCrawlerService(FileCrawlerService fileCrawler)
public void unsetFileCrawlerService(FileCrawlerService fileCrawler)
|
SMILA 1.0 API documentation | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||