public class PackWriter extends Object implements AutoCloseable
PackWriter class is responsible for generating pack files from specified set of objects from repository. This implementation produce pack files in format version 2.
Source of objects may be specified in two ways:
RevObject
specifying exact list and order of objects in pack
Typical usage consists of creating an instance, configuring options,
preparing the list of objects by calling preparePack(Iterator)
or
preparePack(ProgressMonitor, Set, Set)
, and streaming with
writePack(ProgressMonitor, ProgressMonitor, OutputStream)
. If the
pack is being stored as a file the matching index can be written out after
writing the pack by writeIndex(OutputStream)
. An optional bitmap
index can be made by calling prepareBitmapIndex(ProgressMonitor)
followed by writeBitmapIndex(OutputStream)
.
Class provide set of configurable options and
ProgressMonitor
support, as operations may take
a long time for big repositories. Deltas searching algorithm is NOT
IMPLEMENTED yet - this implementation relies only on deltas and objects
reuse.
This class is not thread safe. It is intended to be used in one thread as a single pass to produce one pack. Invoking methods multiple times or out of order is not supported as internal data structures are destroyed during certain phases to save memory when packing large repositories.
Modifier and Type | Class and Description |
---|---|
static class |
PackWriter.PackingPhase
Possible states that a PackWriter can be in.
|
class |
PackWriter.State
Summary of the current state of a PackWriter.
|
Modifier and Type | Field and Description |
---|---|
static Set<ObjectId> |
NONE
Empty set of objects for
preparePack() . |
Constructor and Description |
---|
PackWriter(ObjectReader reader)
Create a writer to load objects from the specified reader.
|
PackWriter(PackConfig config,
ObjectReader reader)
Create writer with a specified configuration.
|
PackWriter(PackConfig config,
ObjectReader reader,
PackStatistics.Accumulator statsAccumulator)
Create writer with a specified configuration.
|
PackWriter(Repository repo)
Create writer for specified repository.
|
PackWriter(Repository repo,
ObjectReader reader)
Create writer for specified repository.
|
Modifier and Type | Method and Description |
---|---|
void |
addObject(RevObject object)
Include one object to the output file.
|
void |
close() |
ObjectId |
computeName()
Computes SHA-1 of lexicographically sorted objects ids written in this
pack, as used to name a pack file in repository.
|
void |
excludeObjects(ObjectIdSet idx)
Add a pack index whose contents should be excluded from the result.
|
ObjectToPack |
get(AnyObjectId id)
Lookup the ObjectToPack object for a given ObjectId.
|
int |
getIndexVersion()
Returns the index format version that will be written.
|
static Iterable<PackWriter> |
getInstances()
Get all allocated, non-released PackWriters instances.
|
long |
getObjectCount()
Returns objects number in a pack file that was created by this writer.
|
ObjectIdOwnerMap<ObjectIdOwnerMap.Entry> |
getObjectSet()
Returns the object ids in the pack file that was created by this writer.
|
PackWriter.State |
getState()
Get snapshot of the current state of this PackWriter.
|
PackStatistics |
getStatistics()
Get statistics of what this PackWriter did in order to create the final
pack stream.
|
boolean |
isDeltaBaseAsOffset()
Check whether writer can store delta base as an offset (new style
reducing pack size) or should store it as an object id (legacy style,
compatible with old readers).
|
boolean |
isIgnoreMissingUninteresting()
Whether to ignore missing uninteresting objects
|
boolean |
isIndexDisabled()
Whether the index file cannot be created by this PackWriter.
|
boolean |
isReuseDeltaCommits()
Check if the writer will reuse commits that are already stored as deltas.
|
boolean |
isReuseValidatingObjects()
Check if the writer validates objects before copying them.
|
boolean |
isThin()
Whether this writer is producing a thin pack.
|
boolean |
isUseBitmaps()
Whether to use bitmaps
|
boolean |
isUseCachedPacks()
Whether to reuse cached packs.
|
boolean |
prepareBitmapIndex(ProgressMonitor pm)
Prepares the bitmaps to be written to the bitmap index file.
|
void |
preparePack(Iterator<RevObject> objectsSource)
Prepare the list of objects to be written to the pack stream.
|
void |
preparePack(ProgressMonitor countingMonitor,
ObjectWalk walk,
Set<? extends ObjectId> interestingObjects,
Set<? extends ObjectId> uninterestingObjects,
Set<? extends ObjectId> noBitmaps)
Prepare the list of objects to be written to the pack stream.
|
void |
preparePack(ProgressMonitor countingMonitor,
Set<? extends ObjectId> want,
Set<? extends ObjectId> have)
Prepare the list of objects to be written to the pack stream.
|
void |
preparePack(ProgressMonitor countingMonitor,
Set<? extends ObjectId> want,
Set<? extends ObjectId> have,
Set<? extends ObjectId> shallow)
Prepare the list of objects to be written to the pack stream.
|
void |
preparePack(ProgressMonitor countingMonitor,
Set<? extends ObjectId> want,
Set<? extends ObjectId> have,
Set<? extends ObjectId> shallow,
Set<? extends ObjectId> noBitmaps)
Prepare the list of objects to be written to the pack stream.
|
void |
select(ObjectToPack otp,
StoredObjectRepresentation next)
Select an object representation for this writer.
|
void |
setClientShallowCommits(Set<ObjectId> clientShallowCommits)
Records the set of shallow commits in the client.
|
void |
setDeltaBaseAsOffset(boolean deltaBaseAsOffset)
Set writer delta base format.
|
void |
setFilterBlobLimit(long bytes) |
void |
setIgnoreMissingUninteresting(boolean ignore)
Whether writer should ignore non existing uninteresting objects
|
void |
setIndexDisabled(boolean noIndex)
Whether to disable creation of the index file.
|
PackWriter |
setObjectCountCallback(ObjectCountCallback callback)
Set the
ObjectCountCallback . |
void |
setReuseDeltaCommits(boolean reuse)
Set the writer to reuse existing delta versions of commits.
|
void |
setReuseValidatingObjects(boolean validate)
Enable (or disable) object validation during packing.
|
void |
setShallowPack(int depth,
Collection<? extends ObjectId> unshallow)
Configure this pack for a shallow clone.
|
void |
setTagTargets(Set<ObjectId> objects)
Set the tag targets that should be hoisted earlier during packing.
|
void |
setThin(boolean packthin)
Whether writer may pack objects with delta base object not within set of
objects to pack
|
void |
setUseBitmaps(boolean useBitmaps)
Whether to use bitmaps
|
void |
setUseCachedPacks(boolean useCached)
Whether to use cached packs
|
boolean |
willInclude(AnyObjectId id)
Determine if the pack file will contain the requested object.
|
void |
writeBitmapIndex(OutputStream bitmapIndexStream)
Create a bitmap index file to match the pack file just written.
|
void |
writeIndex(OutputStream indexStream)
Create an index file to match the pack file just written.
|
void |
writePack(ProgressMonitor compressMonitor,
ProgressMonitor writeMonitor,
OutputStream packStream)
Write the prepared pack to the supplied stream.
|
public PackWriter(Repository repo)
Objects for packing are specified in preparePack(Iterator)
or
preparePack(ProgressMonitor, Set, Set)
.
repo
- repository where objects are stored.public PackWriter(ObjectReader reader)
Objects for packing are specified in preparePack(Iterator)
or
preparePack(ProgressMonitor, Set, Set)
.
reader
- reader to read from the repository with.public PackWriter(Repository repo, ObjectReader reader)
Objects for packing are specified in preparePack(Iterator)
or
preparePack(ProgressMonitor, Set, Set)
.
repo
- repository where objects are stored.reader
- reader to read from the repository with.public PackWriter(PackConfig config, ObjectReader reader)
Objects for packing are specified in preparePack(Iterator)
or
preparePack(ProgressMonitor, Set, Set)
.
config
- configuration for the pack writer.reader
- reader to read from the repository with.public PackWriter(PackConfig config, ObjectReader reader, @Nullable PackStatistics.Accumulator statsAccumulator)
Objects for packing are specified in preparePack(Iterator)
or
preparePack(ProgressMonitor, Set, Set)
.
config
- configuration for the pack writer.reader
- reader to read from the repository with.statsAccumulator
- accumulator for staticspublic static Iterable<PackWriter> getInstances()
public PackWriter setObjectCountCallback(ObjectCountCallback callback)
ObjectCountCallback
.
It should be set before calling
writePack(ProgressMonitor, ProgressMonitor, OutputStream)
.
callback
- the callback to setpublic void setClientShallowCommits(Set<ObjectId> clientShallowCommits)
clientShallowCommits
- the shallow commits in the clientpublic boolean isDeltaBaseAsOffset()
public void setDeltaBaseAsOffset(boolean deltaBaseAsOffset)
deltaBaseAsOffset
- boolean indicating whether delta base can be stored as an
offset.public boolean isReuseDeltaCommits()
public void setReuseDeltaCommits(boolean reuse)
reuse
- if true, the writer will reuse any commits stored as deltas.
By default the writer does not reuse delta commits.public boolean isReuseValidatingObjects()
public void setReuseValidatingObjects(boolean validate)
validate
- if true the pack writer will validate an object before it is
put into the output. This additional validation work may be
necessary to avoid propagating corruption from one local pack
file to another local pack file.public boolean isThin()
public void setThin(boolean packthin)
packthin
- a boolean indicating whether writer may pack objects with
delta base object not within set of objects to pack, but
belonging to party repository (uninteresting/boundary) as
determined by set; this kind of pack is used only for
transport; true - to produce thin pack, false - otherwise.public boolean isUseCachedPacks()
true
to reuse cached packs. If true index creation isn't
available.public void setUseCachedPacks(boolean useCached)
useCached
- if set to true
and a cached pack is present, it will
be appended onto the end of a thin-pack, reducing the amount
of working set space and CPU used by PackWriter. Enabling this
feature prevents PackWriter from creating an index for the
newly created pack, so its only suitable for writing to a
network client, where the client will make the index.public boolean isUseBitmaps()
true
to use bitmaps for ObjectWalks, if available.public void setUseBitmaps(boolean useBitmaps)
useBitmaps
- if set to true, bitmaps will be used when preparing a pack.public boolean isIndexDisabled()
true
if the index file cannot be created by this
PackWriter.public void setIndexDisabled(boolean noIndex)
noIndex
- true
to disable creation of the index file.public boolean isIgnoreMissingUninteresting()
true
to ignore objects that are uninteresting and also
not found on local disk; false to throw a
MissingObjectException
out of
preparePack(ProgressMonitor, Set, Set)
if an
uninteresting object is not in the source repository. By default,
true, permitting gracefully ignoring of uninteresting objects.public void setIgnoreMissingUninteresting(boolean ignore)
ignore
- true
if writer should ignore non existing
uninteresting objects during construction set of objects to
pack; false otherwise - non existing uninteresting objects may
cause MissingObjectException
public void setTagTargets(Set<ObjectId> objects)
Callers may put objects into this set before invoking any of the preparePack methods to influence where an annotated tag's target is stored within the resulting pack. Typically these will be clustered together, and hoisted earlier in the file even if they are ancient revisions, allowing readers to find tag targets with better locality.
objects
- objects that annotated tags point at.public void setShallowPack(int depth, Collection<? extends ObjectId> unshallow)
depth
- maximum depth of history to return. 1 means return only the
"wants".unshallow
- objects which used to be shallow on the client, but are being
extended as part of this fetchpublic void setFilterBlobLimit(long bytes)
bytes
- exclude blobs of size greater than thispublic long getObjectCount() throws IOException
IOException
- a cached pack cannot supply its object count.public ObjectIdOwnerMap<ObjectIdOwnerMap.Entry> getObjectSet() throws IOException
This method can only be invoked after
writePack(ProgressMonitor, ProgressMonitor, OutputStream)
has
been invoked and completed successfully.
IOException
- a cached pack cannot supply its object ids.public void excludeObjects(ObjectIdSet idx)
idx
- objects in this index will not be in the output pack.public void preparePack(@NonNull Iterator<RevObject> objectsSource) throws IOException
Iterator exactly determines which objects are included in a pack and order they appear in pack (except that objects order by type is not needed at input). This order should conform general rules of ordering objects in git - by recency and path (type and delta-base first is internally secured) and responsibility for guaranteeing this order is on a caller side. Iterator must return each id of object to write exactly once.
objectsSource
- iterator of object to store in a pack; order of objects within
each type is important, ordering by type is not needed;
allowed types for objects are
Constants.OBJ_COMMIT
,
Constants.OBJ_TREE
,
Constants.OBJ_BLOB
and
Constants.OBJ_TAG
; objects
returned by iterator may be later reused by caller as object
id and type are internally copied in each iteration.IOException
- when some I/O problem occur during reading objects.public void preparePack(ProgressMonitor countingMonitor, @NonNull Set<? extends ObjectId> want, @NonNull Set<? extends ObjectId> have) throws IOException
Basing on these 2 sets, another set of objects to put in a pack file is
created: this set consists of all objects reachable (ancestors) from
interesting objects, except uninteresting objects and their ancestors.
This method uses class ObjectWalk
extensively to find out that appropriate set of output objects and their
optimal order in output pack. Order is consistent with general git
in-pack rules: sort by object type, recency, path and delta-base first.
countingMonitor
- progress during object enumeration.want
- collection of objects to be marked as interesting (start
points of graph traversal). Must not be null
.have
- collection of objects to be marked as uninteresting (end
points of graph traversal). Pass NONE
if all objects
reachable from want
are desired, such as when serving
a clone.IOException
- when some I/O problem occur during reading objects.public void preparePack(ProgressMonitor countingMonitor, @NonNull Set<? extends ObjectId> want, @NonNull Set<? extends ObjectId> have, @NonNull Set<? extends ObjectId> shallow) throws IOException
Like preparePack(ProgressMonitor, Set, Set)
but also allows
specifying commits that should not be walked past ("shallow" commits).
The caller is responsible for filtering out commits that should not be
shallow any more ("unshallow" commits as in setShallowPack(int, java.util.Collection<? extends org.eclipse.jgit.lib.ObjectId>)
) from
the shallow set.
countingMonitor
- progress during object enumeration.want
- objects of interest, ancestors of which will be included in
the pack. Must not be null
.have
- objects whose ancestors (up to and including shallow
commits) do not need to be included in the pack because they
are already available from elsewhere. Must not be
null
.shallow
- commits indicating the boundary of the history marked with
have
. Shallow commits have parents but those parents
are considered not to be already available. Parents of
shallow
commits and earlier generations will be
included in the pack if requested by want
. Must not be
null
.IOException
- an I/O problem occurred while reading objects.public void preparePack(ProgressMonitor countingMonitor, @NonNull Set<? extends ObjectId> want, @NonNull Set<? extends ObjectId> have, @NonNull Set<? extends ObjectId> shallow, @NonNull Set<? extends ObjectId> noBitmaps) throws IOException
Like preparePack(ProgressMonitor, Set, Set)
but also allows
specifying commits that should not be walked past ("shallow" commits).
The caller is responsible for filtering out commits that should not be
shallow any more ("unshallow" commits as in setShallowPack(int, java.util.Collection<? extends org.eclipse.jgit.lib.ObjectId>)
) from
the shallow set.
countingMonitor
- progress during object enumeration.want
- objects of interest, ancestors of which will be included in
the pack. Must not be null
.have
- objects whose ancestors (up to and including shallow
commits) do not need to be included in the pack because they
are already available from elsewhere. Must not be
null
.shallow
- commits indicating the boundary of the history marked with
have
. Shallow commits have parents but those parents
are considered not to be already available. Parents of
shallow
commits and earlier generations will be
included in the pack if requested by want
. Must not be
null
.noBitmaps
- collection of objects to be excluded from bitmap commit
selection.IOException
- an I/O problem occurred while reading objects.public void preparePack(ProgressMonitor countingMonitor, @NonNull ObjectWalk walk, @NonNull Set<? extends ObjectId> interestingObjects, @NonNull Set<? extends ObjectId> uninterestingObjects, @NonNull Set<? extends ObjectId> noBitmaps) throws IOException
Basing on these 2 sets, another set of objects to put in a pack file is
created: this set consists of all objects reachable (ancestors) from
interesting objects, except uninteresting objects and their ancestors.
This method uses class ObjectWalk
extensively to find out that appropriate set of output objects and their
optimal order in output pack. Order is consistent with general git
in-pack rules: sort by object type, recency, path and delta-base first.
countingMonitor
- progress during object enumeration.walk
- ObjectWalk to perform enumeration.interestingObjects
- collection of objects to be marked as interesting (start
points of graph traversal). Must not be null
.uninterestingObjects
- collection of objects to be marked as uninteresting (end
points of graph traversal). Pass NONE
if all objects
reachable from want
are desired, such as when serving
a clone.noBitmaps
- collection of objects to be excluded from bitmap commit
selection.IOException
- when some I/O problem occur during reading objects.public boolean willInclude(AnyObjectId id) throws IOException
id
- the object to test the existence of.IOException
- a cached pack cannot be examined.public ObjectToPack get(AnyObjectId id)
id
- the object to find in the pack.public ObjectId computeName()
public int getIndexVersion()
This method can only be invoked after
writePack(ProgressMonitor, ProgressMonitor, OutputStream)
has
been invoked and completed successfully.
public void writeIndex(OutputStream indexStream) throws IOException
Called after
writePack(ProgressMonitor, ProgressMonitor, OutputStream)
.
Writing an index is only required for local pack storage. Packs sent on the network do not need to create an index.
indexStream
- output for the index data. Caller is responsible for closing
this stream.IOException
- the index data could not be written to the supplied stream.public void writeBitmapIndex(OutputStream bitmapIndexStream) throws IOException
Called after prepareBitmapIndex(ProgressMonitor)
.
bitmapIndexStream
- output for the bitmap index data. Caller is responsible for
closing this stream.IOException
- the index data could not be written to the supplied stream.public void writePack(ProgressMonitor compressMonitor, ProgressMonitor writeMonitor, OutputStream packStream) throws IOException
Called after
preparePack(ProgressMonitor, ObjectWalk, Set, Set, Set)
or
preparePack(ProgressMonitor, Set, Set)
.
Performs delta search if enabled and writes the pack stream.
All reused objects data checksum (Adler32/CRC32) is computed and validated against existing checksum.
compressMonitor
- progress monitor to report object compression work.writeMonitor
- progress monitor to report the number of objects written.packStream
- output stream of pack data. The stream should be buffered by
the caller. The caller is responsible for closing the stream.IOException
- an error occurred reading a local object's data to include in
the pack, or writing compressed object data to the output
stream.WriteAbortedException
- the write operation is aborted by
ObjectCountCallback
.public PackStatistics getStatistics()
public PackWriter.State getState()
public void close()
Release all resources used by this writer.
close
in interface AutoCloseable
public void addObject(RevObject object) throws IncorrectObjectTypeException
Objects are written in the order they are added. If the same object is added twice, it may be written twice, creating a larger than necessary file.
object
- the object to add.IncorrectObjectTypeException
- the object is an unsupported type.public void select(ObjectToPack otp, StoredObjectRepresentation next)
An ObjectReader
implementation should invoke
this method once for each representation available for an object, to
allow the writer to find the most suitable one for the output.
otp
- the object being packed.next
- the next available representation from the repository.public boolean prepareBitmapIndex(ProgressMonitor pm) throws IOException
Bitmaps can be used to speed up fetches and clones by storing the entire object graph at selected commits. Writing a bitmap index is an optional feature that not all pack users may require.
Called after writeIndex(OutputStream)
.
To reduce memory internal state is cleared during this method, rendering
the PackWriter instance useless for anything further than a call to write
out the new bitmaps with writeBitmapIndex(OutputStream)
.
pm
- progress monitor to report bitmap building work.IOException
- when some I/O problem occur during reading objects.Copyright © 2019 Eclipse JGit Project. All rights reserved.