public class FileExtractorWorker extends ExtractorWorkerBase
| Modifier and Type | Field and Description |
|---|---|
static java.lang.String |
NAME
name of worker.
|
| Constructor and Description |
|---|
FileExtractorWorker() |
| Modifier and Type | Method and Description |
|---|---|
protected Record |
convertRecord(Record compoundRecord,
Record extractedRecord,
TaskContext taskContext)
create a record from the extracted record that conforms to the records produced by the matching crawler.
|
protected boolean |
filterRecord(Record record,
TaskContext taskContext)
Filter extracted records.
|
protected ContentFetcher |
getContentFetcher()
get a content fetcher for the data source type.
|
java.lang.String |
getName() |
protected java.util.Iterator<Record> |
invokeExtractor(CompoundExtractor extractor,
Record compoundRecord,
java.io.InputStream compoundContent,
TaskContext taskContext)
invoke extractor with data from the crawled record.
|
void |
setFileCrawlerService(FileCrawlerService fileCrawler)
DS service reference bind method.
|
void |
unsetFileCrawlerService(FileCrawlerService fileCrawler)
DS service reference unbind method.
|
concatAttributeValues, copyAttachment, copyAttribute, copyCompoundAttributes, copySetToStringAttribute, mapRecord, perform, setCompoundExtractor, unsetCompoundExtractorpublic static final java.lang.String NAME
public java.lang.String getName()
protected java.util.Iterator<Record> invokeExtractor(CompoundExtractor extractor, Record compoundRecord, java.io.InputStream compoundContent, TaskContext taskContext) throws CompoundExtractorException
ExtractorWorkerBaseinvokeExtractor in class ExtractorWorkerBaseCompoundExtractorExceptionprotected Record convertRecord(Record compoundRecord, Record extractedRecord, TaskContext taskContext)
ExtractorWorkerBaseconvertRecord in class ExtractorWorkerBaseprotected boolean filterRecord(Record record, TaskContext taskContext)
filterRecord in class ExtractorWorkerBaserecord - the record to checktaskContext - the task context containing the task parameterstrue if the record passes the filter(s), false if not.protected ContentFetcher getContentFetcher()
ExtractorWorkerBasegetContentFetcher in class ExtractorWorkerBasepublic void setFileCrawlerService(FileCrawlerService fileCrawler)
public void unsetFileCrawlerService(FileCrawlerService fileCrawler)