|
SMILA 1.0 API documentation | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.eclipse.smila.importing.state.objectstore.ObjectStoreDeltaService
public class ObjectStoreDeltaService
ObjectStore based implementation of the DeltaService for the jobmanager based importing framework.
| Nested Class Summary |
|---|
| Nested classes/interfaces inherited from interface org.eclipse.smila.importing.DeltaService |
|---|
DeltaService.EntryId |
| Field Summary | |
|---|---|
static java.lang.String |
BUNDLE_ID
bundle ID for configuration area access. |
static java.lang.String |
STORENAME
objectstore store name. |
| Constructor Summary | |
|---|---|
ObjectStoreDeltaService()
|
|
| Method Summary | |
|---|---|
protected void |
activate(ComponentContext context)
service activation. |
State |
checkState(java.lang.String sourceId,
java.lang.String recordId,
java.lang.String jobRunId,
java.lang.String hashCode)
Determine delta state of record identified by sourceId and recordId. |
State |
checkState(java.lang.String sourceId,
java.lang.String recordId,
java.lang.String compoundRecordId,
java.lang.String jobRunId,
java.lang.String hashCode)
Determine delta state of record identified by sourceId and recordId. |
void |
clearAll()
delete all state information in the service about all data sources. |
void |
clearSource(java.lang.String sourceId)
delete all state information in the service about the given data source. |
long |
countEntries(java.lang.String sourceId,
boolean countExact)
|
protected void |
deactivate(ComponentContext context)
service deactivation. |
void |
deleteEntry(java.lang.String sourceId,
DeltaService.EntryId entryId)
remove an entry, e.g. after it has been deleted. |
java.util.Collection<java.lang.String> |
getShardPrefixes(java.lang.String sourceId)
get possible input values for #getRecordIdsToDelete(String). |
java.util.Collection<java.lang.String> |
getSourceIds()
get Ids of all sources that currently have entries in the DeltaService. |
java.util.Collection<DeltaService.EntryId> |
getUnvisitedEntries(java.lang.String sourceAndShardPrefix,
java.lang.String jobRunId)
get the record IDs in the given data source and shard that have not been visited in the given job run and therefore must be sent as deleted records to the target job. |
void |
markAsUpdated(java.lang.String sourceId,
java.lang.String recordId,
java.lang.String jobRunId,
java.lang.String hashCode)
Mark the record as visited in the current crawl job run. |
void |
markAsUpdated(java.lang.String sourceId,
java.lang.String recordId,
java.lang.String compoundRecordId,
java.lang.String jobRunId,
java.lang.String hashCode)
Mark the record that was extracted from a compound as visited in the current crawl job run. |
void |
markCompoundElementsVisited(java.lang.String sourceId,
java.lang.String compoundRecordId,
java.lang.String jobRunId)
Set jobRunId of all elements of the given compound record, because the compound itself has not changed. |
void |
setObjectStore(ObjectStoreService objectStore)
used by DS to set service reference. |
void |
unsetObjectStore(ObjectStoreService objectStore)
used by DS to remove service reference. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final java.lang.String BUNDLE_ID
public static final java.lang.String STORENAME
| Constructor Detail |
|---|
public ObjectStoreDeltaService()
| Method Detail |
|---|
protected void activate(ComponentContext context)
protected void deactivate(ComponentContext context)
public State checkState(java.lang.String sourceId,
java.lang.String recordId,
java.lang.String jobRunId,
java.lang.String hashCode)
throws DeltaException
DeltaServiceState.UPTODATE the
service also marks the record as visited in the current crawl job run already, so there is no need to call
DeltaService.markAsUpdated(String, String, String, String) afterwards. In the other cases the crawler should call
DeltaService.markAsUpdated(String, String, String, String) only if the record is actually submitted to a processing
job.
checkState in interface DeltaServicesourceId - the name of the data source that contains the record.recordId - the record idjobRunId - the current job run id in which the crawler is running.hashCode - a string that reflects changes in the record content. This can be as simple as a version identifier if
such is available in record metadata, or even a hash calculated on the actual content of the record.
State value.
DeltaException
public State checkState(java.lang.String sourceId,
java.lang.String recordId,
java.lang.String compoundRecordId,
java.lang.String jobRunId,
java.lang.String hashCode)
throws DeltaException
DeltaServiceState.UPTODATE the
service also marks the record as visited in the current crawl job run already, so there is no need to call
DeltaService.markAsUpdated(String, String, String, String) afterwards. In the other cases the crawler should call
DeltaService.markAsUpdated(String, String, String, String) only if the record is actually submitted to a processing
job.
checkState in interface DeltaServicesourceId - the name of the data source that contains the record.recordId - the record idcompoundRecordId - the record id of the compound this record was extracted from. May be null.jobRunId - the current job run id in which the crawler is running.hashCode - a string that reflects changes in the record content. This can be as simple as a version identifier if
such is available in record metadata, or even a hash calculated on the actual content of the record.
State value.
DeltaException
public void markCompoundElementsVisited(java.lang.String sourceId,
java.lang.String compoundRecordId,
java.lang.String jobRunId)
throws DeltaException
DeltaService
markCompoundElementsVisited in interface DeltaServiceDeltaException
public void markAsUpdated(java.lang.String sourceId,
java.lang.String recordId,
java.lang.String jobRunId,
java.lang.String hashCode)
throws DeltaException
DeltaService
markAsUpdated in interface DeltaServicesourceId - the name of the data source that contains the record.recordId - the record idjobRunId - the current job run id in which the crawler is running.hashCode - a string that reflects changes in the record content. This can be as simple as a version identifier if
such is available in record metadata, or even a hash calculated on the actual content of the record.
DeltaException
public void markAsUpdated(java.lang.String sourceId,
java.lang.String recordId,
java.lang.String compoundRecordId,
java.lang.String jobRunId,
java.lang.String hashCode)
throws DeltaException
DeltaService
markAsUpdated in interface DeltaServicesourceId - the name of the data source that contains the record.recordId - the record idcompoundRecordId - the record id of the compound this record was extracted from. May be null.jobRunId - the current job run id in which the crawler is running.hashCode - a string that reflects changes in the record content. This can be as simple as a version identifier if
such is available in record metadata, or even a hash calculated on the actual content of the record.
DeltaException
public void clearSource(java.lang.String sourceId)
throws DeltaException
DeltaService
clearSource in interface DeltaServicesourceId - data source name.
DeltaException
public void clearAll()
throws DeltaException
DeltaService
clearAll in interface DeltaServiceDeltaException
public java.util.Collection<java.lang.String> getSourceIds()
throws DeltaException
DeltaService
getSourceIds in interface DeltaServiceDeltaException
public long countEntries(java.lang.String sourceId,
boolean countExact)
throws DeltaException
countEntries in interface DeltaServicesourceId - the name of the data source to examinecountExact - set to true to get an exact reault, but this may take some time. Else the service may return only an
estimated value.
DeltaException
public java.util.Collection<java.lang.String> getShardPrefixes(java.lang.String sourceId)
throws DeltaException
DeltaService#getRecordIdsToDelete(String). This makes it possible to parallelize and
distribute the check for records to delete.
getShardPrefixes in interface DeltaServicesourceId - the name of the data source to examine.
DeltaException
public java.util.Collection<DeltaService.EntryId> getUnvisitedEntries(java.lang.String sourceAndShardPrefix,
java.lang.String jobRunId)
throws DeltaException
DeltaServiceDeltaService.getShardPrefixes(String) and call this method with each of the shard-prefix
values.
getUnvisitedEntries in interface DeltaServicesourceAndShardPrefix - one of the values returned by DeltaService.getShardPrefixes(String)
DeltaException
public void deleteEntry(java.lang.String sourceId,
DeltaService.EntryId entryId)
throws DeltaException
DeltaService
deleteEntry in interface DeltaServicesourceId - data source IdentryId - ID of the entry, e.g. as returned by DeltaService.getUnvisitedEntries(String, String)
DeltaExceptionpublic void setObjectStore(ObjectStoreService objectStore)
public void unsetObjectStore(ObjectStoreService objectStore)
|
SMILA 1.0 API documentation | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||