motiejus/jgit - jgit - gitea: Gitea Service

motiejus

jgit

Author	SHA1	Message	Date
David Turner	6b1e3c58b1	Run auto GC in the background When running an automatic GC on a FileRepository, when the caller passes a NullProgressMonitor, run the GC in a background thread. Use a thread pool of size 1 to limit the number of background threads spawned for background gc in the same application. In the next minor release we can make the thread pool configurable. In some cases, the auto GC limit is lower than the true number of unreachable loose objects, so auto GC will run after every (e.g) fetch operation. This leads to the appearance of poor fetch performance. Since these GCs will never make progress (until either the objects become referenced, or the two week timeout expires), blocking on them simply reduces throughput. In the event that an auto GC would make progress, it's still OK if it runs in the background. The progress will still happen. This matches the behavior of regular git. Git (and now jgit) uses the lock file for gc.log to prevent simultaneous runs of background gc. Further, it writes errors to gc.log, and won't run background gc if that file is present and recent. If gc.log is too old (according to the config gc.logexpiry), it will be ignored. Change-Id: I3870cadb4a0a6763feff252e6eaef99f4aa8d0df Signed-off-by: David Turner <dturner@twosigma.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-06-06 01:18:29 +02:00
Shawn Pearce	0d20573d9c	fetch: Accept any SHA-1 on lhs of refspec Allow fetch to accept a SHA-1 on the left hand side of a RefSpec, enabling callers to pass a specific SHA-1 they want that may not have been advertised by the remote repository. This can be passed along to the network protocol to be sent in a "want" line. Rest of the plumbing only cares about the ObjectId of the Ref in the askFor map, so make up a fake name using ObjectId.name() to pass the desired ObjectId into the network code. Change-Id: I620a189f3de005c403aa68b7d0442d6aa94e6056	2017-06-04 13:58:16 -07:00
Matthias Sohn	df9ce4b981	Prepare 4.9.0-SNAPSHOT builds Change-Id: I52a4153d573799e861ab104939f51fac1aceb9ee Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-05-30 13:42:07 +02:00
Han-Wen Nienhuys	832808bd50	Fix out-of-bounds exception in RepoCommand#relative Change-Id: I9c91aa2ff037bff27a8131fba54be22f5f27d80d Signed-off-by: Han-Wen Nienhuys <hanwen@google.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-05-24 23:43:59 +02:00
Bryan Donlan	2204cc9866	Fix null return from FS.readPipe when command fails to launch When a command invoked from readPipe fails to launch (i.e. the exec call fails due to a missing command executable), Process.start() throws, which gets caught by the generic IOException handler, resulting in a null return. This change detects this case and rethrows a CommandFailedException instead. Additionally, this change uses /bin/sh instead of bash for its posix command failure test, to accomodate building in environments where bash is unavailable. Change-Id: Ifae51e457e5718be610c0a0914b18fe35ea7b008 Signed-off-by: Bryan Donlan <bdonlan@gmail.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-05-24 23:35:39 +02:00
Dave Borowitz	a46b28808b	RenameDetector: Clarify rename limits <= 0 Change-Id: I8da386e02272316b8e5e5c2f31ce10ad98bcdb28	2017-05-24 09:26:40 -04:00
Zhen Chen	099dbe6ef5	Remove unnecessary cast for DfsReader Change-Id: I22aaccfc9d589750f9d1d711b655dd0fd543fa57 Signed-off-by: Zhen Chen <czhen@google.com>	2017-05-22 10:27:20 -07:00
David Pursehouse	9a4486003f	Merge "Fix javadoc of TooLargeObjectInPackException"	2017-05-22 01:12:10 -04:00
Shawn Pearce	1513a5632d	Allow DfsReader to be subclassed Necessary if a DFS implementation wants to override close() to record DfsReaderIoStats. Change-Id: I144575f9bf1abf2c1fd72030550c4f0795fcf44d	2017-05-19 13:50:36 -07:00
Shawn Pearce	562de51239	Track read IO for DfsReader Compute how much disk IO a DfsReader is performing, and how long the sum of those operations took on this reader instance. Implementations of DFS and interested applications can get the stats by calling the new DfsReader.getIoStats() method at or after close(). Change-Id: If585741301f29182617933d6406d4a70497f2ca7	2017-05-19 12:23:02 -07:00
Matthias Sohn	ef0237564e	Fix javadoc of TooLargeObjectInPackException The API exception should have the same javadoc like the internal exception org.eclipse.jgit.errors.TooLargeObjectInPackException Change-Id: Ia7508c77609e53c8e808412ac523a93194648e49 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-05-19 11:22:05 +02:00
Terry Parker	c46c720e99	Exclude refs/tags from bitmap commit selection Commit `db77610` ensured that all refs/tags commits are added to the primary GC pack. It did that by adding all of the refs/tags commits to the primary GC pack PackWriter's "interesting" object set. Unfortunately, all commit objects in the "interesting" set are selected as commits for which bitmap indices will be built. In a repository like chromium with lots of tags, this changed the number of bitmaps created from <700 to >10000. That puts huge memory pressure on the GC task. This change restores the original behavior of ignoring tags when selecting commits for bitmaps. In the "uninteresting" set, commits for refs/heads and refs/tags for unannotated tags can not be differentiated. We instead identify refs/tags commits by passing their ObjectIds as a new "noBitmaps" parameter to the PackWriter.preparePack() methods. PackWriterBitmapPreparer.setupTipCommitBitmaps() can then use that "noBitmaps" parameter to exclude those commits. Change-Id: Icd287c6b04fc1e48de773033fe432a9b0e904ac5 Signed-off-by: Terry Parker <tparker@google.com>	2017-05-18 15:25:21 -07:00
Matthias Sohn	69d5e89e99	[findBugs] Use UTF-8 to write to the error stream in TextProgressMonitor Change-Id: Ic85db2043d6f673f268bf781917daad45d28f8cd Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-05-15 10:30:24 +02:00
Matthias Sohn	f1dd61f646	[findBugs] Use UTF-8 to read git-rebase-todo file Change-Id: I7c6f71e13ef106678157eae1aa3f9d39712e577b Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-05-15 10:29:47 +02:00
Matthias Sohn	0aa1a19cab	[findBugs] Use UTF-8 when writing to the error stream in GitHook Change-Id: Ica8a40b909ed45cf8e538714e4f26b64ff9a3d21 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-05-15 10:28:53 +02:00
Matthias Sohn	9f98d3e2e4	Add shutdown hook to cleanup unfinished clone when JVM is killed Bug: 516303 Change-Id: I5181b0e8096af3537296848ac7dd74dff0b6d279 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-05-13 17:23:34 +02:00
Thomas Wolf	09d96f8d46	Clean up the disk when cloning fails CloneCommand.call() has three stages: preparation, then the actual clone (init/fetch), and finally maybe checking out the working directory. Restructure such that if we fail or are cancelled during the actual clone (middle phase), we do clean up the disk again. This prevents leaving behind a partial clone in an inconsistent state: either we have a fully successfully built clone, or nothing at all. Bug: 516303 Change-Id: I9b18c60f8f99816d42a3deb7d4a33a9f22eeb709 Signed-off-by: Thomas Wolf <thomas.wolf@paranor.ch>	2017-05-12 05:11:13 -04:00
Christian Halstrick	501af12c19	Checkout should not use too long filenames DirCacheCheckout is generating names for temporary files. It was not checking the length of this filenames. It may happen that a generated filename is longer than 255 chars which causes problems on certain platforms. Make sure that filenames for temporary files do not exceed 255 chars. Bug: 508823 Change-Id: I9475c04351ce3faebdc6ad40ea4faa3c326815f4	2017-05-10 00:33:44 +02:00
Mickael Istria	5b84e25fa3	Support pull on detached HEAD Bug: 485396 Change-Id: I82be09385c9b0bcc0054fea5a9cb9d178a41e278 Signed-off-by: Mickael Istria <mistria@redhat.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-05-08 00:38:25 +02:00
Zhen Chen	8f7d0a4fbe	Reset ObjectWalker when it starts a new walk The ObjectWalker in PackWriterBitmapWalker needs to be reset whenever it starts a new walk. Move this responsibility from the caller to the method when the new walk starts. Change-Id: Ib66003be1b5bdc80f46b9bbbb17d45e616714912 Signed-off-by: Zhen Chen <czhen@google.com>	2017-05-03 15:02:33 -07:00
Shawn Pearce	d377a885a9	Fix stack overflow in MergeBaseGenerator Some repository topologies can cause carryOntoHistory to overflow the thread stack, due to its strategy of recursing into the 2nd+ parents of a merge commit. This can easily happen if a project maintains a local fork, and frequently pulls from the upstream repository, which itself may have a branchy history. Rewrite the carryOntoHistory algorithm to use a fixed amount of thread stack, pushing the save points onto the heap. By using heap space the thread stack depth is no longer a concern. Repositories are instead limited by available memory. The algorithm is now structured as two loops: carryOntoHistory: This outer loop pops saved commits off the top of the stack, allowing the inner loop algorithm to dive down that path and carry bits onto commits along that part of the graph. The loop ends when there are no more stack elements. carryOntoHistoryInner: The inner loop walks along a single path of the graph. For a string of pearls (commits with one parent each) r <- s <- t <- u the algorithm walks backwards from u to r by iteratively updating its local variable 'c'. This avoids heap allocation along a simple path that does not require remembering state. The inner loop breaks in the HAVE_ALL case, when all bits have been found to be previously set on the commit. This occurs when a prior iteration of the outer loop (carryOntoHistory) explored a different path to this same commit, and copied the bits onto it. When the inner loop encounters a merge commit, it pushes all parents onto the heap based stack by allocating individual CarryStack elements for each parent. Parents are pushed in order, allowing side branches to be explored first. A small optimization is taken for the last parent, avoiding pushing it and instead updating 'c', allowing the side branch to be entered without allocating a CarryStack. Change-Id: Ib7b67d90f141c497fbdc61a31b0caa832e4b3c04	2017-05-02 11:38:59 -07:00
David Pursehouse	005e5feb4e	Clone: add --recurse-submodules option Add the --recurse-submodules option on the command, which causes submodules to also be initialized and updated. Add a callback interface on CloneCommand and SubmoduleUpdateCommand to them to provide progress feedback for clone operations. Change-Id: I41b1668bc0d0bdfa46a9a89882c9657ea3063fc1 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-27 09:19:08 +02:00
Thirumala Reddy Mutchukota	5e250e45be	Delete expired garbage even when there is no GC pack present. Delete the condition to check whether the garbage pack creation time is older than the last GC operation, because it's not possible to find the last GC operation time when there is no GC pack. Add additional tests to make sure the contents of the expired garbage packs are considered during the GC operation and any actively referenced objects from the garbage packs are copied successfully into the GC pack before deleting the garbage pack. Change-Id: I09e8b2656de8ba7f9b996724ad1961d908e937b6 Signed-off-by: Thirumala Reddy Mutchukota <thirumala@google.com>	2017-04-21 14:06:58 -07:00
Martin Fick	f9b69677f6	Add parseCommit(AnyObjectId) method to Repository. It is quite common to want to parse a commit without already having a RevWalk. Provide a shortcut to do so to make it more convenient, and to ensure that the RevWalk is released afterwards. Signed-off-by: Martin Fick<mfick@codeaurora.org> Change-Id: I9528e80063122ac318f115900422a24ae49a920e	2017-04-19 09:42:47 +02:00
Dan Willemsen	b6fc8e2f3c	RepoCommand: Add linkfile support. Android wants them to work, and we're only interested in them for bare repos, so add them just for that. Make sure to use symlinks instead of just using the copyfile implementation. Some scripts look up where they're actually located in order to find related files, so they need the link back to their project. Change-Id: I929b69b2505f03036f69e25a55daf93842871f30 Signed-off-by: Dan Willemsen <dwillemsen@google.com> Signed-off-by: Stefan Beller <sbeller@google.com> Signed-off-by: Jeff Gaston <jeffrygaston@google.com> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-04-18 10:33:37 +02:00
Jonathan Nieder	f9e13efe47	Merge "Process all "have"s even when MissingObjectException is encountered"	2017-04-17 14:53:27 -04:00
Jonathan Nieder	c2e6e7abc9	Process all "have"s even when MissingObjectException is encountered Because objects described by the client using "have" lines do not need to be reachable by any ref on the server, it is possible for them to point to missing objects in the reachability graph. When such an object is encountered, I1097a2defa4a9dcf502ca8baca5d32880378818f (Only throw MissingObjectException when necessary, 2017-03-29) aborts the "have" walk early to salvage the fetch. The downside of that change is that remaining "have"s are ignored unless they pointed directly to an object with a bitmap. In the worst case this can increase the bandwidth cost of a fetch to the cost of a clone because most "have"s are ignored. Avoid this cost by bypassing the failed "have" completely and moving on to the remaining "have"s. Change-Id: Iac236b6d05f735078c9935abfa6e58d1eb47f388	2017-04-17 11:50:28 -07:00
David Pursehouse	a6df70569a	Merge "Prevent alternates loop"	2017-04-17 12:01:55 -04:00
Martin Fick	e4714a2a5f	Prevent alternates loop When looping through alternates, prevent visiting the same object directory twice. This could happen when the objects/info/alternates file includes itself directly or indirectly via a another repo and its alternates file. Change-Id: I79bb3da099ebc3c262d2e6c61ed4578eb1aa3474 Signed-off-by: James Melvin <jmelvin@codeaurora.org> Signed-off-by: Martin Fick <mfick@codeaurora.org> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-14 23:35:17 +02:00
Matthias Sohn	3af4afdfbf	Add missing @since tag for new API RepoCommand.setTargetURI() Change-Id: I4531b94e3a04606a69eeb3c3d154510b87507012 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-14 19:49:19 +02:00
David Pursehouse	c80d8c5901	Bazel: Restrict src globs to Java source files Generating the src list with an unrestricted wildcard causes all files in the source tree to be included. This results in junk files such as .orig (generated during merge conflict resolution) to be included, which causes in a build error: in srcs attribute of java_library rule //org.eclipse.jgit:jgit: file '//org.eclipse.jgit:src/org/eclipse/jgit/gitrepo/RepoCommand.java.orig' is misplaced here (expected .java, .srcjar or .properties). Modify the globs to only include Java source files. Change-Id: Iaef3db33ac71d71047cd28acb0378e15cb09ece9 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-04-13 14:14:55 +09:00
Han-Wen Nienhuys	fe5437e96b	Fix RepoCommand to allow for relative URLs This is necessary for deploying submodules on android.googlesource.com. * Allow an empty base URL. This is useful if the 'fetch' field is "." and all names are relative to some host root. * The URLs in the resulting superproject are relative to the superproject's URL. Add RepoCommand#setDestinationURI to set this. If unset, the existing behavior is maintained. * Add two tests for the Android and Gerrit case, checking the URL format in .gitmodules; the tests use a custom RemoteReader which is representative of the use of this class in Gerrit's Supermanifest plugin. Change-Id: Ia75530226120d75aa0017c5410fd65d0563e91b Signed-off-by: Han-Wen Nienhuys <hanwen@google.com> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-04-13 10:53:58 +09:00
Jonathan Nieder	e730fcce77	Merge "BundleWriter: Allow constructing from only an ObjectReader"	2017-04-12 21:12:15 -04:00
Terry Parker	56a1cced74	Merge "Only throw MissingObjectException when necessary"	2017-04-12 10:25:11 -04:00
Dave Borowitz	c9c9e672e5	BundleWriter: Allow constructing from only an ObjectReader Change-Id: I01821d6a9fbed7a5fe4619884e42937fbd6909ce	2017-04-12 08:27:57 -04:00
Matthias Sohn	cc0dbbae43	Merge branch 'stable-4.7' * stable-4.7: Cleanup and test trailing slash handling in ManifestParser ManifestParser: Throw exception if remote does not have fetch attribute Change-Id: Ia9dc3110bcbdae05175851ce647ffd11c542f4c0 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-11 00:54:16 +02:00
Han-Wen Nienhuys	f17ec3928c	Cleanup and test trailing slash handling in ManifestParser This is a workaround for https://bugs.openjdk.java.net/browse/JDK-4666701. Change-Id: Idd04657e8d95a841d72230f8881b6b899daadbc2 Signed-off-by: Han-Wen Nienhuys <hanwen@google.com> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-11 00:37:38 +02:00
Han-Wen Nienhuys	84d855cda7	ManifestParser: Throw exception if remote does not have fetch attribute In the repo manifest documentation [1] the fetch attribute is marked as "#REQUIRED". If the fetch attribute is not specified, this would previously result in NullPointerException. Throw a SAXException instead. [1] https://gerrit.googlesource.com/git-repo/+/master/docs/manifest-format.txt Change-Id: Ib8ed8cee6074fe6bf8f9ac6fc7a1664a547d2d49 Signed-off-by: Han-Wen Nienhuys <hanwen@google.com> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-04-10 15:08:32 +02:00
Matthias Sohn	b3cc05d886	Remove unused API filters Change-Id: I1e00d71395228265aad4071b023024ee1bf855d5 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-09 23:43:43 +02:00
Matthias Sohn	3db0f507ee	Prepare 4.5.3-SNAPSHOT builds Change-Id: I69681b7a5687ca76bd0dd5d3e7ce2cff841d0e32 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-08 00:31:09 +02:00
Matthias Sohn	c1d3ecbeab	JGit v4.5.2.201704071617-r Change-Id: I66402643d7c84c90bf5cefed4d2ec3aa68c94cfb Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-07 22:17:27 +02:00
Matthias Sohn	7adacbd19a	Silence API error for new method added to abstract MergeStrategy OSGi semantic versioning rules allow to break implementors of an API in a minor version. Change-Id: I4ada3e6455e8e8e1bb8fb71affa0a1b36bd46fc4 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-06 18:17:22 +02:00
Matthias Sohn	4e8655c74d	Fix @since tags of new API added after 4.7.0 Change-Id: I356f71cdef8e23a9b06cf0a4079060a116b9ed27 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-06 18:16:04 +02:00
Zhen Chen	f5368dc97f	Only throw MissingObjectException when necessary When preparing the bitmap, the flag ignoreMissingStart only applied to the start object. However, sometime the start object is present but some related objects are not present during the walk, we should only release the MissingObjectException when the ignoreMissingStart is set false. Change-Id: I1097a2defa4a9dcf502ca8baca5d32880378818f Signed-off-by: Zhen Chen <czhen@google.com>	2017-04-05 19:09:16 -04:00
Matthias Sohn	6a311a071f	Prepare 4.7.1-SNAPSHOT Change-Id: I16a45035258276217446bccc0ad1b0991383aa0c Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-06 00:16:53 +02:00
Dave Borowitz	4c3e274588	Support creating Mergers without a Repository All that's really required to run a merge operation is a single ObjectInserter, from which we can construct a RevWalk, plus a Config that declares a diff algorithm. Provide some factory methods that don't take Repository. Change-Id: Ib884dce2528424b5bcbbbbfc043baec1886b9bbd	2017-04-05 17:50:54 -04:00
Matthias Sohn	9f4c10784b	JGit v4.7.0.201704051617-r Change-Id: Ic2bd6aca0b7a7e0597ffc1f7cf647b49878f9950 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-05 22:17:44 +02:00
Matthias Sohn	aec22e74cf	Prepare 4.8.0-SNAPSHOT builds Change-Id: Ifea6750e79d417a8a2a891b3b5f96d68c7200011 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-04-05 14:49:49 +02:00
Andrey Loskutov	7476baebfc	Fixed NP dereference error reported by ecj in UploadPack.stopBuffering() Introduced via commit `3b2508b514`. Change-Id: I2b6175c095aea2868a8c302103095accde5170e3 Signed-off-by: Andrey Loskutov <loskutov@gmx.de>	2017-04-05 09:51:12 +02:00
Shawn Pearce	db2493e7d8	Merge "Make diff locations more consistent"	2017-04-04 22:26:38 -04:00
Dave Borowitz	e4672d1c16	NameConflictTreeWalk: Mark repo param @Nullable This is passed directly to the super constructor, where it is also @Nullable. Marking it here saves the reader a jump. Change-Id: Icc8db2f2dc6aae6e591aa4f09a3c283336a5424c	2017-04-04 14:53:17 -04:00
Jonathan Nieder	db58abbbe8	Merge "Buffer the response until request parsing has done"	2017-04-04 14:25:41 -04:00
Masaya Suzuki	3b2508b514	Buffer the response until request parsing has done This is a continuation from https://git.eclipse.org/r/#/c/4716/. For a non-bidirectional request, we need to consume the request before writing any response. In UploadPack, we write "shallow"/"unshallow" responses before parsing "have" lines. This has happened not to be a problem most of the time in the smart HTTP protocol because the underlying InputStream has a 32 KiB buffer in SmartOutputStream. Change-Id: I7c61659e7c4e8bd49a8b17e2fe9be67bb32933d3 Signed-off-by: Masaya Suzuki <masayasuzuki@google.com>	2017-04-04 10:52:49 -07:00
KB Sriram	4a985f5aa8	Make diff locations more consistent DiffAlgorithms can return different edit locations for inserts or deletes, if they can be "shifted" up or down repeating blocks of lines. This causes the 3-way merge to apply both edits, resulting in incorrectly removing or duplicating lines. Augment an existing "tidy-up" stage in DiffAlgorithm to move all shiftable edits (not just the last INSERT edit) to a consistent location, and add test cases for previously incorrect merges. Bug: 514095 Change-Id: I5fe150a2fc04e1cdb012d22609d86df16dfb0b7e Signed-off-by: KB Sriram <kbsriram@google.com>	2017-04-03 16:45:13 -07:00
Matthias Sohn	b65a764b6b	Remove unused import from ManifestParser Change-Id: Ie60ef9c7bc6ce0fdf017949ebfb9a21753e70506 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-31 00:38:36 +02:00
Han-Wen Nienhuys	f32d65759c	Document the intended use of RepoCommand#setURI() Signed-off-by: Han-Wen Nienhuys <hanwen@google.com> Change-Id: I4a59dd8278b7b0026094692127b7f55e89c10bae	2017-03-29 16:54:29 +02:00
Han-Wen Nienhuys	6e652846f6	Noop changes to ManifestParser * Parse the base URL in ManifestParser construction. This will signal errors earlier. * Simplify stripping of trailing slashes. Signed-off-by: Han-Wen Nienhuys <hanwen@google.com> Change-Id: I4a86f68c9d7737f71cf20352cfe26288fbd2b463	2017-03-29 13:51:37 +02:00
Han-Wen Nienhuys	27b05c7d71	Consistently use 'path' for the path to a subrepo in RepoCommand Signed-off-by: Han-Wen Nienhuys <hanwen@google.com> Change-Id: I79ea7eb7b4d319e0100e3121aca5ef82eb8ad92a	2017-03-27 17:36:56 -04:00
Matthias Sohn	251abbfcd1	Merge branch 'stable-4.6' * stable-4.6: Only mark packfile invalid if exception signals permanent problem Don't flag a packfile invalid if opening existing file failed Prepare 4.5.2-SNAPSHOT builds Change-Id: Ife4efad1135d3870a5a0fb71e60b9524fb8777ab Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-27 22:45:59 +02:00
David Pursehouse	7f013924a8	Merge branch 'stable-4.5' into stable-4.6 * stable-4.5: Only mark packfile invalid if exception signals permanent problem Don't flag a packfile invalid if opening existing file failed Prepare 4.5.2-SNAPSHOT builds Change-Id: I20b50981adc54c426666015ff04fe3bb1db9abd9 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-03-27 10:14:50 +09:00
Matthias Sohn	aaf3c5154e	Only mark packfile invalid if exception signals permanent problem Add NoPackSignatureException and UnsupportedPackVersionException to explicitly mark permanent unrecoverable problems with a pack Assume problem with a pack is permanent only if we are sure the exception signals a non-transient problem we can't recover from: - AccessDeniedException: we lack permissions - CorruptObjectException: we detected corruption - EOFException: file ended unexpectedly - NoPackSignatureException: pack has no pack signature - NoSuchFileException: file has gone missing - PackMismatchException: pack no longer matches its index - UnpackException: unpacking failed - UnsupportedPackIndexVersionException: unsupported pack index version - UnsupportedPackVersionException: unsupported pack version Do not attempt to handle Errors since they are thrown for serious problems applications should not try to recover from. Change-Id: I2c416ce2b0e23255c4fb03a3f9a0ee237f7a484a Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-26 11:40:47 +02:00
Luca Milanesio	363a3657b1	Don't flag a packfile invalid if opening existing file failed A packfile random file open operation may fail with a FileNotFoundException even if the file exists, possibly for the temporary lack of resources. Instead of managing the FileNotFoundException as any generic IOException it is best to rethrow the exception but prevent the packfile for being flagged as invalid until it is actually opened and read successfully or unsuccessfully. Bug: 514170 Change-Id: Ie37edba2df77052bceafc0b314fd1d487544bf35 Signed-off-by: Luca Milanesio <luca.milanesio@gmail.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-25 01:33:18 +01:00
Matthias Sohn	11a12ceb0b	Prepare 4.5.2-SNAPSHOT builds Change-Id: I8485de1f3f63dc9ec445b8fb08093ca144aedc59 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-25 01:21:58 +01:00
David Pursehouse	5f902f07cc	PullCommand: Add support for recursing into submodules Add a new API method to set the recurse mode, and pass the mode into the fetch command. Extend the existing FetchCommandRecurseSubmodulesTest to also perform the same tests for fetch. Rename the test class accordingly. Change-Id: I12553af47774b4778f7011e1018bd575a7909bd0 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-03-24 00:02:45 +01:00
Matthias Sohn	61f830d3a2	Explain in error message how to recover from lock failure Bug: 483897 Change-Id: I70f8d9c82c1efe2928f072a2fb69461160f7c5f7 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-22 18:17:01 -04:00
David Pursehouse	2d0ce094b4	Remove Buck build Buck will be replaced with Bazel Change-Id: I3cf07d7aaaa2a58bac34e16c50af5416693254ac Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-22 01:41:21 +01:00
Matthias Sohn	a9a84b7235	JGit v4.5.1.201703201650-r Change-Id: I88de7c9f52abbc4921a82208ed74d22aa19fb3cd Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-20 21:44:47 +01:00
Jonathan Nieder	bc5014faec	bazel: Add explicit targets for library dependencies This provides a place to declare visibility restrictions and transitive dependencies for each library. Other targets should only declare dependencies on what they directly use, making dependencies easier to maintain. Trim the dependencies of org.eclipse.jgit:jgit to follow that rule. It declares dependencies on Apache httpcomponents and the servlet API but doesn't use them. Tested: * 'bazel build //...' succeeds * applying the change https://gerrit-review.googlesource.com/90843 to a copy of Gerrit, following the instructions there, and running 'bazel test //...' in that copy of Gerrit still succeeds Change-Id: I3ab958ce8b3227019cdbe4cc81e0f042e1541034	2017-03-19 18:51:03 -07:00
David Ostrovsky	7e4258113c	Move SHA1 compress/recompress files to resource folder This fixes Bazel build: in srcs attribute of java_library rule //org.eclipse.jgit:jgit: file '//org.eclipse.jgit:src/org/eclipse/jgit/util/sha1/SHA1.recompress' is misplaced here (expected .java, .srcjar or .properties). Another option that was considered is to exclude the non source files. Change-Id: I7083f27a4a49bf6681c85c7cf7b08a83c9a70c77 Signed-off-by: David Ostrovsky <david@ostrovsky.org>	2017-03-18 16:46:58 +01:00
Matthias Sohn	50ac852551	Merge "Merge branch 'stable-4.6'"	2017-03-15 19:50:04 -04:00
Matthias Sohn	dab8e0e7cb	Merge branch 'stable-4.6' * stable-4.6: Don't remove pack when FileNotFoundException is transient Change-Id: I82941a98385cda27c89e1e6750b7b6db4e39f414 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-16 00:29:43 +01:00
Matthias Sohn	405fdf76d5	Merge branch 'stable-4.5' into stable-4.6 * stable-4.5: Don't remove pack when FileNotFoundException is transient Change-Id: Ic17c542d78a4cad48ff1ed77dcdc853a4ef2dc06 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-16 00:26:37 +01:00
Luca Milanesio	4c558225dc	Don't remove pack when FileNotFoundException is transient The FileNotFoundException is typically raised in three conditions: 1. file doesn't exist 2. incompatible read vs. read/write open modes 3. filesystem locking 4. temporary lack of resources (e.g. too many open files) 1. is already managed, 2. would never happen as packs are not overwritten while with 3. and 4. it is worth logging the exception and retrying to read the pack again. Log transient errors using an exponential backoff strategy to avoid flooding the logs with the same error if consecutive retries to access the pack fail repeatedly. Bug: 513435 Change-Id: I03c6f6891de3c343d3d517092eaa75dba282c0cd Signed-off-by: Luca Milanesio <luca.milanesio@gmail.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-15 23:43:39 +01:00
Andrey Loskutov	a4b9c73391	Don't try to strip new line if the message buffer is empty Bug: 513726 Change-Id: I0e7c19f8883b93bad1b9de166f671d28f3e9c240 Signed-off-by: Andrey Loskutov <loskutov@gmx.de>	2017-03-15 20:29:21 +01:00
David Pursehouse	2fe1a3abbe	FetchCommand: Fix detection of submodule recursion mode The submodule.name.fetchRecurseSubmodules value was being read from the configuration of the submodule, but it should be read from the config of the parent repository. Also, the fetch.recurseSubmodules value from the parent repository's configuration was not being considered at all. Fix both of these and add tests. Now the precedence of the recurse mode is determined as follows: 1. Value passed to the API 2. Value configured in submodule.name.fetchRecurseSubmodules 3. Value configured in fetch.recurseSubmodules 4. Default to "on demand" Change-Id: Ic23b7c40b5f39135fb3fd754c597dd4bcc94240c	2017-03-10 13:17:39 +09:00
Matthias Sohn	79f85d1cf2	Prepare 4.6.2-SNAPSHOT builds Change-Id: I8835f79145e6a989787d47322c3d8cb9baf0624a Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-07 20:49:24 +01:00
Matthias Sohn	258dc5a715	JGit v4.6.1.201703071140-r Change-Id: I842dc95313e5b47b0b7ec983c4a0a91915ed4183 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-07 17:40:32 +01:00
David Pursehouse	503d59044f	FetchCommand: Add basic support for recursing into submodules Extend FetchCommand to expose a new method, setRecurseSubmodules(mode), which allows to set the mode to ON, OFF or ON_DEMAND. After fetching a repository, its submodules are recursively fetched: - When the mode is YES, submodules are always fetched. - When the mode is NO, submodules are not fetched. - When the mode is ON_DEMAND, submodules are only fetched when the parent repository receives an update of the submodule and the new revision is not already in the submodule. The mode is determined in the following order of precedence: - Value specified in the API call using setRecurseSubmodules. - Value specified in the repository's config under the key submodule.name.fetchRecurseSubmodules - Defaults to ON_DEMAND if neither of the previous is set. Extend FetchResult to recursively include results for submodules, as a map of the submodule path to an instance of FetchResult. Test setup is based on testCloneRepositoryWithNestedSubmodules. Change-Id: Ibc841683763307cb76e78e142e0da5b11b1add2a Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-03-04 09:31:16 +09:00
David Pursehouse	d4895c7160	Remove unnecessary @SuppressWarnings("nls") Change-Id: Idc5f82af17ecc944b5657b02823412ea46b38413 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-03-04 09:27:14 +09:00
Thomas Wolf	1f3e74ed9f	Make Repository.normalizeBranchName less strict This operation was added recently with the goal to provide some way to auto-correct invalid user input, or to provide a correction suggestion to the user -- EGit uses it now that way. But the initial implementation was very restrictive; it removed all non-ASCII characters and even slashes. Understandably end users were not happy with that. Git has no such restriction to ASCII-only; nor does JGit. Branch names should be meaningful to the end user, and if a user-supplied branch name is invalid for technical reasons, a "normalized" name should still be meaningful to the user. Rewrite to attempt a minimal fix such that the result will pass isValidRefName. * Replace all Unicode whitespace by underscore. * Replace troublesome special characters by dash. * Collapse sequences of underscores, dots, and dashes. * Remove underscores, dots, and dashes following slashes, and collapse sequences of slashes. * Strip leading and trailing sequences of slashes, dots, dashes, and underscores. * Avoid the ".lock" extension. * Avoid the Windows reserved device names. * If input name is null return an empty String so callers don't need to check for null. This still allows branch names with single slashes as separators between components, avoids some pitfalls that isValidRefName() tests for, and leaves other character untouched and thus allows non-ASCII branch names. Also move the function from the bottom of the file up to where isValidRefName is implemented. Bug: 512508 Change-Id: Ia0576d9b2489162208c05e51c6d54e9f0c88c3a7 Signed-off-by: Thomas Wolf <thomas.wolf@paranor.ch> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-04 00:23:42 +01:00
Jonathan Nieder	45f62576de	Merge "SHA-1: collision detection support"	2017-03-02 13:26:45 -05:00
Shawn Pearce	83ad74b6b9	SHA-1: collision detection support Update SHA1 class to include a Java port of sha1dc[1]'s ubc_check, which can detect the attack pattern used by the SHAttered[2] authors. Given the shattered example files that have the same SHA-1, this modified implementation can identify there is risk of collision given only one file in the pair: $ jgit ... [main] WARN org.eclipse.jgit.util.sha1.SHA1 - SHA-1 collision 38762cf7f55934b34d179ae6a4c80cadccbb7f0a When JGit detects probability of a collision the SHA1 class now warns on the logger, reporting the object's SHA-1 hash, and then throws a Sha1CollisionException to the caller. From the paper[3] by Marc Stevens, the probability of a false positive identification of a collision is about 14 * 2^(-160), sufficiently low enough for any detected collision to likely be a real collision. git-core[4] may adopt sha1dc before the system migrates to an entirely new hash function. This commit enables JGit to remain compatible with that move to sha1dc, and help protect users by warning if similar attacks as SHAttered are identified. Performance declined about 8% (detection off), now: MessageDigest 238.41 MiB/s MessageDigest 244.52 MiB/s MessageDigest 244.06 MiB/s MessageDigest 242.58 MiB/s SHA1 216.77 MiB/s (was ~240.83 MiB/s) SHA1 220.98 MiB/s SHA1 221.76 MiB/s SHA1 221.34 MiB/s This decline in throughput is attributed to the step loop unrolling in compress(), which was necessary to easily fit the UbcCheck logic into the hash function. Using helper functions s1-s4 reduces the code explosion, providing acceptable throughput. With detection enabled (default): SHA1 detectCollision 180.12 MiB/s SHA1 detectCollision 181.59 MiB/s SHA1 detectCollision 181.64 MiB/s SHA1 detectCollision 182.24 MiB/s sha1dc (native C) ~206.28 MiB/s sha1dc (native C) ~204.47 MiB/s sha1dc (native C) ~203.74 MiB/s Average time across 100,000 calls to hash 4100 bytes (such as a commit or tree) for the various algorithms available to JGit also shows SHA1 is slower than MessageDigest, but by an acceptable margin: MessageDigest 17 usec SHA1 18 usec SHA1 detectCollision 22 usec Time to index-pack for git.git (217982 objects, 69 MiB) has increased: MessageDigest SHA1 w/ detectCollision ------------- ----------------------- 20.12s 25.25s 19.87s 25.48s 20.04s 25.26s avg 20.01s 25.33s +26% Being implemented in Java with these additional safety checks is clearly a penalty, but throughput is still acceptable given the increased security against object name collisions. [1] https://github.com/cr-marcstevens/sha1collisiondetection [2] https://shattered.it/ [3] https://marc-stevens.nl/research/papers/C13-S.pdf [4] https://public-inbox.org/git/20170223230621.43anex65ndoqbgnf@sigill.intra.peff.net/ Change-Id: I9fe4c6d8fc5e5a661af72cd3246c9e67b1b9fee6	2017-02-28 16:38:43 -08:00
Matthias Sohn	9d2a7de65e	Silence API error caused by changed return type of digest() Change-Id: Ic0810ed7fea837c45cbc9a4649ca51d140bad6e6 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-03-01 00:34:59 +01:00
Magnus Vigerlöf	2a5d20c138	Correct the boolean logic for filtering paths The TreeWalk filtering classes need to support the three different meanings of the return value the path comparison generates. A new path comparison method (isPathMatch) is created with three distinct return values (isPathPrefix use value '0' to encode two of these) which will makes it possible for the logical operators (especially NOT) to aggregate a correct verdict. A filter like: AND(Path("path"), NOT(Path("path/to/other"))) Should filter out 'path/to/other/file', but not 'path/to/my/file'. The path-limiting feature when testing path/to/my/file, would result to run test for the following paths: path path/to path/to/my path/to/my/file isPathPrefix('path/to/other') will return '0' for the first two and since there is no way for NOT to distinguish between an exact match and a match indicating that the tested path is a 'parent', it will incorrectly return false and thus remove everything below 'path' immediately. isPathMatch has a distinguished value for 'parent' matches that will be preserved through the logic operators and should not cause an over-eager removal of paths. The functionality of isPathPrefix is required by other parts and is untouched. Unit tests are included to ensure that the logical functionality is correct and can be preserved. Change-Id: Ice2ca9406f09f1b179569e99b86a0e5d77baa20d Signed-off-by: Magnus Vigerlöf <magnus.vigerlof@gmail.com>	2017-02-28 23:56:33 +01:00
Shawn Pearce	1bf7d3f290	SHA1: support reset() and reuse instances Allow SHA1 instances to be reused to compute another hash value, and resume caching them in ObjectInserter and PackParser. This shaves a small amount of running time off parsing git.git's pack file: before after ------ ------ 25.25s 25.55s 25.48s 25.06s 25.26s 24.94s Almost noise (small difference), but recycling the instances reduces some stress on the memory allocator finding two 80 word message block arrays needed for hashing and collision detection. Change-Id: I4af88a720e81460293bc5c5d1d3db1a831e7e228	2017-02-26 15:26:53 -08:00
Shawn Pearce	0f25f64d48	Switch to pure Java SHA1 for ObjectId Generate names for objects using only the pure Java SHA1 implementation, but continue using MessageDigest in tests. This opens the possibility of changing the hashing function to incorporate additional safety measures, such as those used in sha1dc[1]. Since MessageDigest has higher throughput, continue using MessageDigest for computing pack, idx and DirCache trailers. These are less likely to be sensitive to SHAttered[2] types of attacks, as Git uses them to detect random bit flips during transfer, and not for content identity. [1] https://github.com/cr-marcstevens/sha1collisiondetection [2] https://shattered.it/ Change-Id: If6da98334201f7f20cb916e46f782c45f373784e	2017-02-26 11:16:19 -08:00
Shawn Pearce	982f5d1bf1	Pure Java SHA-1 This implementation is derived straight from the description written in RFC 3174. On Mac OS X with Java 1.8.0_91 it offers similar throughput as MessageDigest SHA-1: system 239.75 MiB/s system 244.71 MiB/s system 245.00 MiB/s system 244.92 MiB/s sha1 234.08 MiB/s sha1 244.50 MiB/s sha1 242.99 MiB/s sha1 241.73 MiB/s This is the fastest implementation I could come up with. Common SHA-1 implementation tricks such as unrolling loops creates a method too large for the JIT to effectively optimize, resulting in lower overall hashing throughput. Using a preprocessor to perform the register renaming of A-E also didn't help, as again the method was too large for the JIT to effectively optimize. Fortunately the fastest version is a naive, straight-forward implementation very close to the description in RFC 3174. Change-Id: I228b05c4a294ca2ad51386cf0e47978c68e1aa42	2017-02-26 11:16:19 -08:00
David Pursehouse	3b4448637f	Enable and fix warnings about redundant specification of type arguments Since the introduction of generic type parameter inference in Java 7, it's not necessary to explicitly specify the type of generic parameters. Enable the warning in Eclipse, and fix all occurrences. Change-Id: I9158caf1beca5e4980b6240ac401f3868520aad0 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-02-20 22:47:23 +01:00
Shawn Pearce	07fdc50c07	Fix bad test fix from `0bff481` "Limit receive commands" In `0bff481d45` to accurately use the two limits it was necessary to move the LimitedInputStream out of the PacketLineIn and further down to the PackParser. Unfortuantely this didn't survive review, as a buggy test failed and the "fix" was to drop this part of the code. The maxPackSizeLimit should apply to the pack stream, not the pkt-line framing used to send commands to control the ReceivePack instance. The commands are controlled using a different limit. The failing test allowed too many bytes in the pack and was only failing because it was including the command framing. The correct fix for the test was simply to drop the limit lower, to more closely match the actual pack size. Change-Id: I47d3885b9d7d527e153df7ac9c62fc2865ceecf4	2017-02-20 10:51:27 -08:00
David Pursehouse	fceac7e44d	Add some more missing @Override annotations Change-Id: Ic13160920b986edde87c928c473240cc9c034f50 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-02-20 11:32:22 +09:00
David Pursehouse	7ac182f4e4	Enable and fix 'Should be tagged with @Override' warning Set missingOverrideAnnotation=warning in Eclipse compiler preferences which enables the warning: The method <method> of type <type> should be tagged with @Override since it actually overrides a superclass method Justification for this warning is described in: http://stackoverflow.com/a/94411/381622 Enabling this causes in excess of 1000 warnings across the entire code-base. They are very easy to fix automatically with Eclipse's "Quick Fix" tool. Fix all of them except 2 which cause compilation failure when the project is built with mvn; add TODO comments on those for further investigation. Change-Id: I5772061041fd361fe93137fd8b0ad356e748a29c Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-02-19 20:05:08 -04:00
Thomas Wolf	0a4cf573d3	Fix typo in @since Change-Id: I266b0c72d2827bcf2b86ddc6c1892d1a46c548eb Signed-off-by: Thomas Wolf <thomas.wolf@paranor.ch>	2017-02-19 16:46:44 +01:00
David Pursehouse	1cda4faed4	PullCommand: Allow to set tag behavior Add a new method setTagOpt which sets the annotated tag behavior during fetch. Pass the option to the fetch command. No explicit tests are added; the fetch with tags functionality is already covered by the tests of the fetch command. Change-Id: I131e1f68d8fcced178d8fa48abf7ffab17f8e173 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-02-18 15:21:26 +01:00
Naoki Takezoe	1448ec37f9	Set commit time to ZipArchiveEntry Archived zip files for a same commit have different MD5 hash because mdate and mdate in the header of zip entries are not specified. In this case, Commons Compress sets an archived time. In the original git implementation, it's set a commit time: `e2b2d6a172/archive.c (L378)` By this fix, archive command sets the commit time to ZipArchiveEntry when RevCommit is given as an archiving target. Change-Id: I30dd8710e910cdf42d57742f8709e9803930a123 Signed-off-by: Naoki Takezoe <takezoe@gmail.com> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-02-18 10:47:27 +01:00
David Turner	d3962fef6b	GC: don't loosen doomed objects If the pruneexpire config is set to "now", then any unreferenced loose objects are immediately eligible for gc. So there is no need to actually write the loose objects. Users who run hosting services which sometimes accept large, entirely garbage packs might set the following configurations: gc.pruneExpire = now gc.prunePackExpire = 2.weeks Then garbage objects will be kept around in packs, but after two weeks the packs themselves will get deleted. For client-side users of jgit, the default settings will loosen garbage objects, and, after an hour, delete the old packs in which they resided. Change-Id: I8f686ac60b40181b1ee92ac6c313c3f33b55c44c Signed-off-by: David Turner <dturner@twosigma.com>	2017-02-17 11:26:09 -05:00
Jonathan Nieder	b537e372c9	Update name of InsecureCipherMode error-prone pattern Without this, using bazel 0.4.4 to build fails: ERROR: jgit/org.eclipse.jgit/BUILD:29:1: Java compilation in rule '//org.eclipse.jgit:insecure_cipher_factory' failed: Worker process sent response with exit code: 1. jgit/src/org/eclipse/jgit/transport/InsecureCipherFactory.java:63: error: [InsecureCryptoUsage] Insecure usage of a crypto API: the transformation is not a compile-time constant expression. return Cipher.getInstance(algo); ^ (see http://errorprone.info/bugpattern/InsecureCryptoUsage) Change-Id: I7f9a3a5117e42cb68544674f5312df0368aa3674	2017-02-15 16:01:42 -08:00
Zhen Chen	87d81a7301	Add missing skip garbage pack logic in DfsReader * Missing garbage pack check in getObjectSize(AnyObjectId, int) * Missing `last` pack check in has(AnyObjectId) and open(AnyObjectId, int) Change-Id: Idd1b9dd8db34c92d7da546fef1936ec9b2728718 Signed-off-by: Zhen Chen <czhen@google.com>	2017-02-15 15:40:04 -08:00
Zhen Chen	ff852dad51	Skip first pack if avoid garbage is set and it is a garbage pack At beginning of the OBJECT_SCAN loop, it will first check if the object exists in the last pack, however, it forgot to avoid garbage pack for the first iteration. Change-Id: I8a99c0f439218d19c49cd4dae891b8cc4a57099d Signed-off-by: Zhen Chen <czhen@google.com>	2017-02-13 20:54:35 -04:00
Zhen Chen	8dd5b644dc	Refactor skip garbage pack logic into a method There are multiple places in DfsReader to skip garbage pack if both of the following conditions satisfied: * AvoidUnreachable flag is set * The pack is a garabge pack Refactor them into a shared private method. Change-Id: I67d6bb601db55f904437c807c6a3c36f0a723265 Signed-off-by: Zhen Chen <czhen@google.com>	2017-02-13 15:33:23 -08:00
Shawn Pearce	0bff481d45	Limit receive commands Place a configurable upper bound on the amount of command data received from clients during `git push`. The limit is applied to the encoded wire protocol format, not the JGit in-memory representation. This allows clients to flexibly use the limit; shorter reference names allow for more commands, longer reference names permit fewer commands per batch. Based on data gathered from many repositories at $DAY_JOB, the average reference name is well under 200 bytes when encoded in UTF-8 (the wire encoding). The new 3 MiB default receive.maxCommandBytes allows about 11,155 references in a single `git push` invocation. A Gerrit Code Review system with six-digit change numbers could still encode 29,399 references in the 3 MiB maxCommandBytes limit. Change-Id: I84317d396d25ab1b46820e43ae2b73943646032c Signed-off-by: David Pursehouse <david.pursehouse@gmail.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-02-11 00:20:36 +01:00
David Pursehouse	1834421a7f	BlameGenerator: Annotate #getRenameDetector as Nullable The renameDetector member returned by this method will be null when following file renames has been disabled by previously calling: setFollowFileRenames(false). Annotate it as @Nullable and update the Javadoc to explicitly document the null return. Change-Id: I9bdf443a64cf3c45352d3ab023051a2e11f7426d Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-02-09 22:40:56 +01:00
David Pursehouse	d9d8c507a4	RefLeaseSpec: Fix Eclipse errors - Remove unused import - Remove unused private constructor - Add Javadoc for public constructor Change-Id: I1253e9fe863ca0f63182461ee87357fbf726ea2e Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-02-09 15:10:15 +09:00
Shawn Pearce	8fce17a995	Merge "push: support per-ref force-with-lease"	2017-02-08 22:27:06 -05:00
David Turner	46d35a8502	push: support per-ref force-with-lease When rebasing, force-pushing has a race condition: someone else might have pushed a commit since the one you just rewrote. The force-with-lease option prevents this by ensuring that the ref's old value is the one that you expected. Change-Id: I97ca9f8395396c76332bdd07c486e60549ca4401 Signed-off-by: David Turner <dturner@twosigma.com>	2017-02-08 19:42:33 -05:00
Shawn Pearce	6450d956bc	Assume GC_REST and GC_TXN also attempted deltas during packing In a DFS repository the DfsGarbageCollector will typically attempt delta compression while creating the three main pack files: GC, GC_REST and GC_TXN. Include all of these in the wasDeltaAttempted() decision so that future packers can bypass delta compression of non-delta objects. Change-Id: Ic2330c69fab0c494b920b4df0a290f3c2e1a03d7	2017-02-08 15:34:00 -08:00
Shawn Pearce	d67b183537	Prefer smaller GC files during DFS garbage collection In `8ac65d33ed` PackWriter changed its behavior to always prefer the last object representation presented to it by the ObjectReuseAsIs implementation. This was a fix to avoid delta chain cycles. Unfortunately it can lead to suboptimal compression when concurrent GCs are run on the same repository. One case is automatic GC running (with default settings) in parallel to a manual GC that has disabled delta reuse in order to generate new smaller deltas for the entire history of the repository. Running GC with no-reuse generally requires more CPU time, which also translates to a longer running time. This can lead to a race where the automatic GC completes before the no-reuse GC, leaving the repository in a state such as: no-reuse GC: size 1 GiB, mtime = 18:45 auto GC: size 8 GiB, mtime = 17:30 With the default sort ordering, the smaller no-reuse GC pack is sorted earlier in the pack list, due to its more recent mtime. During object reuse in a future GC, these smaller representations are considered first by PackWriter, but are all discarded when the auto GC file from 17:30 is examined second (due to its older mtime). Work around this in two ways. Well formed DFS repositories should have at most 1 GC pack. If 2 or more GC packs exist, break the sorting tie by selecting the smaller file earlier in the pack list. This allows all normal read code paths to favor the smaller file, which places less pressure on the DfsBlockCache. If any GC race happens, readers serving clone requests will prefer the file that is smaller. During object reuse, flip this ordering so that the smaller file is last. This allows PackWriter to see smaller deltas last, replacing larger representations that were previously considered from other pack files. Change-Id: I0b7dc8bb9711c82abd6bd16643f518cfccc6d31a	2017-02-08 14:37:12 -08:00
Shawn Pearce	61d4922928	Fix missing deltas near type boundaries Delta search was discarding discovered deltas if an object appeared near a type boundary in the delta search window. This has caused JGit to produce larger pack files than other implementations of the packing algorithm. Delta search works by pushing prior objects into a search window, an ordered list of objects to attempt to delta compress the next object against. (The window size is bounded, avoiding O(N^2) behavior.) For implementation reasons multiple object types can appear in the input list, and the window. PackWriter commonly passes both trees and blobs in the input list handed to the DeltaWindow algorithm. The pack file format requires an object to only delta compress against the same type, so the DeltaWindow algorithm must stop doing comparisions if a blob would be compared to a tree. Because the input list is sorted by object type and the window is recently considered prior objects, once a wrong type is discovered in the window the search algorithm stops and uses the current result. Unfortunately the termination condition was discarding any found delta by setting deltaBase and deltaBuf to null when it was trying to break the window search. When this bug occurs, the state of the DeltaWindow looks like this: current \| \ / input list: tree0 tree1 blob1 blob2 window: blob1 tree1 tree0 / \ \| res.prev As the loop iterates to the right across the window, it first finds that blob1 is a suitable delta base for blob2, and temporarily holds this in the bestDelta/deltaBuf fields. It then considers tree1, but tree1 has the wrong type (blob != tree), so the window loop must give up and fall through the remaining code. Moving the condition up and discarding the window contents allows the bestDelta/deltaBuf to be kept, letting the final file delta compress blob1 against blob0. The impact of this bug (and its fix) on real world repositories is likely minimal. The boundary from blob to tree happens approximately once in the search, as the input list is sorted by type. Only the first window size worth of blobs (e.g. 10 or 250) were failing to produce a delta in the final file. This bug fix does produce significantly different results for small test repositories created in the unit test suite, such as when a pack may contains 6 objects (2 commits, 2 trees, 2 blobs). Packing test cases can now better sample different output pack file sizes depending on delta compression and object reuse flags in PackConfig. Change-Id: Ibec09398d0305d4dbc0c66fce1daaf38eb71148f	2017-02-08 14:36:24 -08:00
Shawn Pearce	12c8462602	Merge "Reintroduce garbage pack coalescing when ttl > 0."	2017-02-08 00:23:40 -05:00
Thirumala Reddy Mutchukota	006f4d4d29	Reintroduce garbage pack coalescing when ttl > 0. Disabling the garbage pack coalescing when garbageTtl > 0 can result in lot of garbage packs if they are created within the garbageTtl time. To avoid a large number of garbage packs, re-introducing garbage pack coalescing for the packs that are created within a single calendar day when the garbageTtl is more than one day or one third of the garbageTtl. Change-Id: If969716aeb55fb4fd0ff71d75f41a07638cd5a69 Signed-off-by: Thirumala Reddy Mutchukota <thirumala@google.com>	2017-02-07 20:34:31 -08:00
David Pursehouse	5336a07386	Merge "Branch normalizer should not normalize already valid branch names"	2017-02-07 07:31:06 -05:00
Matthias Sohn	08480c948c	[infer] Fix ObjectWalk leak in PackWriter.preparePack() Change-Id: I5d2455404e507faa717e9d916e9b6cd80aa91473 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-02-07 00:50:09 +01:00
Matthias Sohn	f8d232213c	Branch normalizer should not normalize already valid branch names Change-Id: Ib746655e32a37c4ad323f1d12ac0817de8fa56cf Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-02-07 00:24:39 +01:00
Bo Zhang	d4bd09b78d	Follow redirects in transport Bug: 465167 Change-Id: I6da19c8106201c2a1ac69002bd633b7387f25d96 Signed-off-by: Bo Zhang <zhangbodut@gmail.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-02-02 21:20:23 -04:00
Matthias Sohn	566794d001	Merge branch 'stable-4.6' * stable-4.6: GC: delete empty directories after purging loose objects GC.prune(Set<ObjectId>): return early if objects directory is empty Change-Id: I3d6cacf80d3b4c69ba108e970855963bd9f6ee78 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-02-02 23:36:28 +01:00
Matthias Sohn	18cda3888c	GC: delete empty directories after purging loose objects In order to limit the number of directories we check for emptiness only consider fanout directories which contained unreferenced loose objects we deleted in the same gc run. Change-Id: Idf8d512867ee1c8ed40bd55752122ce83a98ffa2 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-02-01 23:44:07 +01:00
David Pursehouse	b20f7d610e	Organize imports Change-Id: I97044f69d220fc2d3f9fe890fdfec542454f02d2 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-02-01 14:31:44 +09:00
Hongkai Liu	a33663fd4e	Detect stale-file-handle error in causal chain Cover the case where the exception is wrapped up as a cause, e.g., PackIndex#open(File). Change-Id: I0df5b1e9c2ff886bdd84dee3658b6a50866699d1 Signed-off-by: Hongkai Liu <hongkai.liu@ericsson.com>	2017-01-30 22:36:59 -04:00
David Pursehouse	62411453f1	Merge branch 'stable-4.6' * stable-4.6: Clean up orphan files in GC Change-Id: I4fb6b4cd03d032535a9c04ede784bea880b4536b Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-01-31 09:31:10 +09:00
David Pursehouse	25ab5b4d9b	Merge "Don't rely on default locale when using toUpperCase() and toLowerCase()"	2017-01-30 07:32:32 -05:00
Hector Caballero	27b710c394	Make GC cancellable when called programmatically Sometimes, it is necessary to cancel a garbage collection operation. When GC is called using the standalone executable, i.e., from a command line, Control-Cing the process does the trick. When calling GC programmatically, though, there is no mechanism to do it. Add checks in the GC process so that a custom cancellable progress monitor could be passed in order to cancel the operation at specific points. In this case, the calling process set the cancel flag in the progress monitor and the GC process will throw an exception that can be caught and handled by the caller accordingly. Change-Id: Ieaecf3dbdf244539ec734939c065735f6785aacf Signed-off-by: Hector Caballero <hector.caballero@ericsson.com>	2017-01-29 20:14:37 -04:00
Matthias Sohn	a11bb03127	GC.prune(Set<ObjectId>): return early if objects directory is empty Change-Id: Id56b102604c4e0437230e3e7c59c0a3a1b676256 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-01-30 00:55:38 +01:00
Hongkai Liu	8fd500e20c	Clean up orphan files in GC An orphan file is either a bitmap or an idx file in pack folder, and its corresponding pack file is missing. Change-Id: I3c4cb1f7aa99dd7b398bdb8d513f528d7761edff Signed-off-by: Hongkai Liu <hongkai.liu@ericsson.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-01-30 00:55:36 +01:00
David Pursehouse	acc94c475a	RepoCommand#readFile: Don't call Git#getRepository() in try-with-resource Using try-with-resource means that close() will automatically be called on the Repository object. However, according to the javadoc of Git#close(): If the repository was opened by a static factory method in this class, then this method calls Repository#close() on the underlying repository instance. This means that Repository#close() is called twice, by Git.close() and in the outer try-with-resource, leading to a corrupt use count. Change-Id: I37ba517eb2cc67d1cd36813598772c70208d0bc9 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-01-28 17:46:28 +01:00
Matthias Sohn	a4feeb0194	Don't rely on default locale when using toUpperCase() and toLowerCase() Otherwise these methods may produce unexpected results if used for strings that are intended to be interpreted locale independently. Examples are programming language identifiers, protocol keys, and HTML tags. For instance, "TITLE".toLowerCase() in a Turkish locale returns "t\u0131tle", where '\u0131' is the LATIN SMALL LETTER DOTLESS I character. See https://docs.oracle.com/javase/8/docs/api/java/lang/String.html#toLowerCase-- http://blog.thetaphi.de/2012/07/default-locales-default-charsets-and.html Bug: 511238 Change-Id: Id8d8f37d84d62239c918b81f8d883ed798d87656 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-01-28 15:06:15 +01:00
David Pursehouse	2eb1bebd60	Repository: Include repository name when logging corrupt use count Logging the repository name makes it easier to track down what is incorrectly closing a repository. Change-Id: I42a8bdf766c0e67f100adbf76d9616584e367ac2 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-01-27 15:59:09 +09:00
Thirumala Reddy Mutchukota	c9f55032a2	Record the estimated size of the pack files. The Compacter and Garbage Collector will record the estimated size of the newly going to be created compact, gc or garbage packs. This information can be used by the clients to better make a call on how to actually store the pack based on the approximated expected size. Added a new protected method DfsObjDatabase.newPack(PackSource packSource, long estimatedPackSize), so that the clients can override this method to make use of the estimatedPackSize while creating a new PackDescription object. The default implementation of this method is equivalent to newPack(packSource).setEstimatedPackSize(estimatedPackSize). I didn't make it abstract because that would force all the existing sub classes of DfsObjDatabase to implement this method. Due to this default implementation, the estimatedPackSize is added to DfsPackDescription using a setter instead of a constructor parameter (even though constructor parameter would be a better choice as this value is set only during the object creation). Change-Id: Iade1122633ea774c2e842178a6a6cbb4a57b598b Signed-off-by: Thirumala Reddy Mutchukota <thirumala@google.com>	2017-01-26 12:01:59 -08:00
Lars Vogel	71edc8bd6f	Fixes Javadoc error in org.eclipse.jgit created with I59539ac Adds the param information to the private method. These are generated via tooltip to resolve the compile errors. Bug: 511043 Change-Id: I9ba551978eab750326d1a067b296e3ae93925871 Signed-off-by: Lars Vogel <Lars.Vogel@vogella.com>	2017-01-25 12:40:59 -04:00
Jonathan Nieder	061d24f6d5	Remove @since tags from internal packages These packages don't use @since tags because they are not part of the stable public API. Some @since tags snuck in, though. Remove them to make the convention easier to find for new contributors and the expectations clearer for users. Change-Id: I6c17d3cfc93657f1b33cf5c5708f2b1c712b0d31	2017-01-24 14:41:24 -08:00
David Turner	8bec98cec0	gc: loosen unreferenced objects An unreferenced object might appear in a pack. This could only happen because it was previously referenced, and then later that reference was removed. When we gc, we copy the referenced objects into a new pack, and delete the old pack. This would remove the unreferenced object. Now we first create a loose object from any unreferenced object in the doomed pack. This kicks off the two-week grace period for that object, after which it will be collected if it's not referenced. This matches the behavior of regular git. Change-Id: I59539aca1d0d83622c41aa9bfbdd72fa868ee9fb Signed-off-by: David Turner <dturner@twosigma.com> Signed-off-by: Jonathan Nieder <jrn@google.com>	2017-01-24 14:22:45 -08:00
Matthias Sohn	d3c4c0622f	[infer] Mark ManifestParse.getFilteredProjects non-null Change-Id: I05653df7a0337443d2c8e53f47f4e95ec9ca1a9c Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-01-23 19:55:20 +01:00
Matthias Sohn	b686c8468c	[infer] Fix potential NPE in DiffFormatter Change-Id: Ia33e2af9ce3393d9173ca0dc7efefd86c965d8c8 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-01-23 19:55:18 +01:00
Matthias Sohn	423a583fcc	[infer] Fix potential NPE in CloneCommand Change-Id: Ie7eeba3ae719ff207c7535d535a9e0bd6c9e99e6 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-01-23 19:55:12 +01:00
David Pursehouse	dd5e500a57	Format Bazel files with buildifier Change-Id: I934114315d2c7cab917f1011b8e55c52367d429f Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-01-22 22:34:11 +01:00
Shawn Pearce	131b09106f	Change StreamGobbler to Runnable to avoid unused Future It can be considered a programming error to create a Future<T> but do nothing with that object. There is an async computation happening and without holding and checking the Future for done or exception the caller has no idea if it has completed. FS doesn't really care about these StreamGobblers finishing. Instead use Runnable with execute(Runnable), which doesn't return a Future. Change-Id: I93b66d1f6c869e66be5c1169d8edafe781e601f6	2017-01-21 09:44:14 +01:00
Matthias Sohn	f503a9f5b7	Add missing @since tags on new API constants Change-Id: Ia8b861da07fba99644ccc9eb5578a46cc39600a1 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-01-19 23:56:25 +01:00
James Melvin	91132bb05e	gc: Add options to preserve and prune old pack files The new --preserve-oldpacks option moves old pack files into the preserved subdirectory instead of deleting them after repacking. The new --prune-preserved option prunes old pack files from the preserved subdirectory after repacking, but before potentially moving the latest old packfiles to this subdirectory. These options are designed to prevent stale file handle exceptions during git operations which can happen on users of NFS repos when repacking is done on them. The strategy is to preserve old pack files around until the next repack with the hopes that they will become unreferenced by then and not cause any exceptions to running processes when they are finally deleted (pruned). Change-Id: If3f729f0d9ce920ee2c3e6acdde46f2068be61d2 Signed-off-by: James Melvin <jmelvin@codeaurora.org>	2017-01-19 11:00:18 +01:00
David Ostrovsky	e92a0c3adc	Implement initial framework of Bazel build The initial implementation only builds the packages consumed by Gerrit Code Review. Test build and execution is not implemented. We prefer to consume maven_jar custom rule from bazlets repository, for the same reasons as in the Gerrit project: * Caching artifacts across different clones and projects * Exposing source classifiers and neverlink artifact TEST PLAN: $ bazel build :all $ unzip -t bazel-genfiles/all.zip Archive: bazel-genfiles/all.zip testing: libjgit-archive.jar OK testing: libjgit-servlet.jar OK testing: libjgit.jar OK testing: libjunit.jar OK No errors detected in compressed data of bazel-genfiles/all.zip. Change-Id: Ia837ce95d9829fe2515f37b7a04a71a4598672a0 Signed-off-by: David Ostrovsky <david@ostrovsky.org> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-01-18 19:13:16 -04:00
Wim Jongman	b667c182cb	Normalizer creating a valid branch name from a string Generic normalization method for a possible invalid branch name. The method compresses dividers between spaces, then replaces spaces and non word characters with underscores. This method is needed in preparation for subsequent EGit changes. Bug: 509878 Change-Id: Ic0d12f098f90f912a45bcc5693d6accf751d4e58 Signed-off-by: Wim Jongman <wim.jongman@remainsoftware.com>	2017-01-18 22:05:28 +01:00
Christian Halstrick	8a46b60371	Merge "Fix StashApplyCommand for stashes containing untracked changes."	2017-01-16 03:45:00 -05:00
Thomas Wolf	46af7192a2	Fix StashApplyCommand for stashes containing untracked changes. If there are untracked changes, apply only the untracked tree after a successful merge. The merge tree from merging untracked with HEAD would also contain files already reset before (changes in tracked files) and try to reset those again,leading to false checkout conflicts. Bug: 505804 Change-Id: Iaced4d277623334d11e3d1cca5969590d7c5093e Signed-off-by: Thomas Wolf <thomas.wolf@paranor.ch>	2017-01-15 21:54:12 +01:00
Marc Strapetz	1c4b3f8c45	Fix possible InvalidObjectIdException in ObjectDirectory ObjectDirectory.getShallowCommits should throw an IOException instead of an InvalidArgumentException if invalid SHAs are present in .git/shallow (as this file is usually edited by a human). Change-Id: Ia3a39d38f7aec4282109c7698438f0795fbec905 Signed-off-by: Marc Strapetz <marc.strapetz@syntevo.com> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2017-01-15 15:05:51 +01:00
Zhen Chen	d6b354f60f	Skip pack header bytes in DfsPackFile The 12 bytes `PACK...` header is written in PackWriter before reading CachedPack files. In DfsPackFile#copyPackBypassCache, the header was not skipped when the first block is not in cache. Change-Id: Ibbe2e564d36b79922a936657f286addb1044d237 Signed-off-by: Zhen Chen <czhen@google.com>	2017-01-13 22:10:42 -08:00
Dariusz Luksza	0e187f1484	Add LfsPointerFilter TreeFilter Add new variation of TreeFilter in order to detect LFS pointer files in the repository. Additionally, update LfsPointer to support the legacy version URL [1] as described in [2], and to allow arbitrary fields in the pointer file. [1] https://hawser.github.com/spec/v1 [2] https://github.com/git-lfs/git-lfs/blob/master/docs/spec.md Change-Id: I621eb058619fb1b78888a54c4b60bb110a722fc3 Signed-off-by: Dariusz Luksza <dariusz@luksza.org> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2017-01-10 00:13:24 +01:00
Shawn Pearce	db77610256	Pack refs/tags/ with refs/heads/ This fixes a nasty performance issue for repositories that have many objects referenced through refs/tags/, but not in refs/heads/. Situations like this can arise when a project has made releases like refs/tags/v1.0, and then decides to orphan history and start over for version 2. The v1.0 objects are not reachable from master anymore, but are still live due to the v1.0 tag. When tags are packed in the GC_OTHER pack, bitmaps are not able to cover the repository's contents. This may cause very slow counting times during git clone, as the server must enumerate the ancient history under refs/tags/ to respond to the client. Clients by default always ask for all tags when asking for all heads during clone. This has been true since git-core commit 8434c2f1afedb (Apr 27 2008), when clone was converted to a builtin. Including tags in the main GC pack should still allow servers to benefit from the fast full pack reuse path when serving a clone to a client. Change-Id: I22e29517b5bc6fa3d6b19a19f13bef0c68afdca3	2017-01-03 14:46:41 -08:00
Marc Strapetz	6087031469	Get rid of javax.servlet API dependency for core org.eclipse.jgit Change-Id: I57d5d4fab7e0b1bd4cf5f1850e8569c8ac5def88 Signed-off-by: Marc Strapetz <marc.strapetz@syntevo.com> Signed-off-by: Andrey Loskutov <loskutov@gmx.de>	2017-01-03 18:50:55 +01:00
Matthias Sohn	5dc30db56e	[findBugs] PackWriter.NONE should be final Change-Id: I4b5621bcb4db82e0560408b3cde6f18b0cc55b29 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-30 01:19:58 +01:00
Matthias Sohn	29ddbf7fcd	[findBugs] Remove reliance on default encoding in Base64 Change-Id: I6901da975a86c460ce7c783a519669d8be8e23bb Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-29 19:50:29 +01:00
Matthias Sohn	f63267134f	[findBugs] Fix potential NPE in GC Change-Id: I59cda76b2c5039e08612f394ee4f7f1788578c49 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-29 00:59:33 +01:00
James Melvin	d980a3fa85	Fix keep pack filename Previously it was looking for a keep file with the name of a pack file (extenstion included) appended with a '.keep'. However, the keep file name should be the pack file name with a '.keep' extension Change-Id: I9dc4c7c393ae20aefa0b9507df8df83610ce4d42 Signed-off-by: James Melvin <jmelvin@codeaurora.org>	2016-12-27 14:08:56 -07:00
Matthias Sohn	5fee071f6a	Prepare 4.7.0-SNAPSHOT builds Change-Id: I20754d13007e6591d36aae5766f3a9a82b24e120 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-27 01:45:50 +01:00
Matthias Sohn	3857c3168f	Prepare 4.6.1-SNAPSHOT builds Change-Id: I6b05a6f6c3f92365c272e1bdaf76093ca01f2d58 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-24 15:51:54 +01:00
Matthias Sohn	73a4325149	JGit v4.6.0.201612231935-r Change-Id: Iaa88fe1b195dfe6be99a7b4cb064684e75563715 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-24 01:42:38 +01:00
Matthias Sohn	5274da3c3c	Merge branch 'stable-4.5' * origin/stable-4.5: Fix one case of missing object Change-Id: Ia6384f4be71086d5a0a8c42c7521220f57dfd086 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-24 00:30:00 +01:00
Matthias Sohn	1fb2319c18	[infer] Fix resource leak in IndexDiff We only need the tree id to add it to a TreeWalk so change tree's type to AnyObjectId. Bug: 509385 Change-Id: I98dd5fef15cd173fe1fd84273f0f48e64e12e608 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-21 23:51:50 +01:00
Matthias Sohn	325cb35ccd	[infer] Fix resource leak in ObjectChecker Bug: 509385 Change-Id: I6b6ff5b721d959eb0708003a40c8f97d6826ac46 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-21 00:50:21 +01:00
Matthias Sohn	f30fe13ac9	[infer] Fix a resource leak in PackWriter Bug: 509385 Change-Id: Ic8a82895fa39be73f1bd8427cfe9437be6fc4e3e Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-19 08:38:41 +01:00
Matthias Sohn	6cbc99d3ee	[infer] Fix resource leaks in DfsInserter Bug: 509385 Change-Id: Id5dc40bb3fb9da97ea0795cca1f2bcdcde347767 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-19 00:02:43 +01:00
Matthias Sohn	a498a2865e	[infer] Fix resource leak in ManifestParser Bug: 509385 Change-Id: Icfe58ac2e5344546448a55ad14ec082356be968c Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-18 23:02:47 +01:00
Matthias Sohn	e78626f414	[infer] Fix resource leak in RepoCommand Bug: 509385 Change-Id: I30c427f0dd2fc1fceb6b003dfdee0a05efaefca9 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-18 23:02:45 +01:00
Matthias Sohn	1779fb4a57	[infer] Fix resource leak in DirCache Bug: 509385 Change-Id: I5f914c910ef3a7583594fb31c7757d3dddf6a05e Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-18 13:49:37 +01:00
Matthias Sohn	aa199ff648	[infer] Fix SubmoduleWalk leaks in submodule commands Bug: 509385 Change-Id: I4cba81d8ea596800a40799dc9cb763fae01fe508 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-18 13:49:36 +01:00
Matthias Sohn	fbcc2cb4ca	[infer] Fix resource leaks in SubmoduleAddCommand Bug: 509385 Change-Id: I9d25cf117cfb19df108f5fe281232193fd898474 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-18 13:49:33 +01:00
Matthias Sohn	82344bd7a2	[infer] Fix resource leaks in RebaseCommand Bug: 509385 Change-Id: I9fbdfda59f7bc577aab55dc92ff897b00b5cb050 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-18 10:38:24 +01:00
Matthias Sohn	05e8cdf563	[infer] Fix resource leak in BlameCommand Bug: 509385 Change-Id: Ic57fd3bf940752229e35102e7761823f7d3d8732 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-18 10:38:23 +01:00
Hector Oswaldo Caballero	4ddd4a3d1b	Fix one case of missing object When a repository is being GCed and a concurrent push is received, there is the possibility of having a missing object. This is due to the fact that after the list of objects to delete is built, there is a window of time when an unreferenced and ready to delete object can be referenced by the incoming push. In that case, the object would be deleted because there is no way to know it is no longer unreferenced. This will leave the repository in an inconsistent state and most of the operations fail with a missing tree/object error. Given the incoming push change the last modified date for the now referenced object, verify this one is still a candidate to delete before actually performing the delete operation. Change-Id: Iadcb29b8eb24b0cb4bb9335b670443c138a60787 Signed-off-by: Hector Oswaldo Caballero <hector.caballero@ericsson.com>	2016-12-13 10:47:05 -05:00
Christian Halstrick	11d24e6844	Fix FileSnapshot.isModified FileSnapshot.isModified may have reported a file to be clean although it was actually dirty. Imagine you have a FileSnapshot on file f. lastmodified and lastread are both t0. Now time is t1 and you 1) modify the file 2) update the FileSnapshot of the file (lastModified=t1, lastRead=t1) 3) modify the file again 4) wait 3 seconds 5) ask the Filesnapshot whether the file is dirty or not. It erroneously answered it's clean. Any file which has been modified longer than 2.5 seconds ago was reported to be clean. As the test shows that's not always correct. The real-world problem fixed by this change is the following: * A gerrit server using JGit to serve git repositories is processing fetch requests while simultaneously a native git garbage collection runs on the repo. * At time t1 native git writes temporary files in the pack folder setting the mtime of the pack folder to t1. * A fetch request causes JGit to search for new packfiles and JGit remembers this scan in a Filesnapshot on the packs folder. Since the gc is not finished JGit doesn't see any new packfiles. * The fetch is processed and the gc ends while the filesystem timer is still t1. GC writes a new packfile and deletes the old packfile. * 3 seconds later another request arrives. JGit does not yet know about the new packfile but is also not rescanning the pack folder because it cached that the last scan happened at time t1 and pack folder's mtime is also t1. Now JGit will not be able to resolve any object contained in this new pack. This behavior may be persistent if objects referenced by the ref/meta/config branch are affected so gerrit can't read permissions stored in the refs/meta/config branch anymore and will not allow any pushes anymore. The pack folder will not change its mtime and therefore no rescan will take place. Change-Id: I3efd0ccffeb97b01207dc3e7a6b85c6b06928fad Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-12-13 11:28:12 +01:00
Zhen Chen	d621305588	Decide whether to "Accept-Encoding: gzip" on a request-by-request basis When the reply is already compressed (e.g. a packfile fetched using dumb HTTP), "Content-Encoding: gzip" wastes bandwidth relative to sending the content raw. So don't "Accept-Encoding: gzip" for such requests. Change-Id: Id25702c0b0ed2895df8e9790052c3417d713572c Signed-off-by: Zhen Chen <czhen@google.com>	2016-12-09 16:24:50 -08:00
David Pursehouse	654ae82970	Replace usage of deprecated EWAHCompressedBitmap.add(long) The add(long) method was deprecated in favor of addWord(long) in the 0.8.3 release of JavaEWAH [1]. [1] https://github.com/lemire/javaewah/commit/e443cf5e Change-Id: I89c397ed02e040f57663d04504399dfdc0889626 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-12-07 22:23:31 -04:00
Christian Halstrick	930cd43553	Fix merge-base calculation Fix JGits merge-base calculation in case of inconsistent commit times. JGit was potentially failing to compute correct merge-bases when the commit times where inconsistent (a parent commit was younger than a child commit). The code in MergeBaseGenerator was aware of the fact that sometimes the discovery of a merge base x can occur after the parents of x have been seen (see comment in #carryOntoOne()). But in the light of inconsistent commit times it was possible that these parents of a merge-base have already been returned as a merge-base. This commit fixes the bug by buffering all commits generated by MergeBaseGenerator. It is expected that this buffer will be small because the number of merge-bases will be small. Additionally a new flag is used to mark the ancestors of merge-bases. This allows to filter out the unwanted commits. Bug: 507584 Change-Id: I9cc140b784c3231b972bd2c3de61a789365237ab	2016-11-28 09:38:19 +01:00
Grace Wang	fe329f5db4	Specify RevisionSyntaxException message in Repository#resolve This does not address all cases where no message is specified, only cases where Repository#isValidRefName returns false. Change-Id: Ib88cdabfdcdf37be0053e06949b0e21ad87a9575 Signed-off-by: Grace Wang <gracewang92@gmail.com>	2016-11-24 03:56:01 -04:00
Matthias Sohn	c6cfe500b5	Add missing @since tags for new API Change-Id: I900d745195f58c067fadf209bb92cd3c852c59f4 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-11-23 23:29:41 +01:00
Zhen Chen	8803718493	dump HTTP: Avoid being confused by Content-Length of a gzipped stream TransportHttp sets 'Accept-Encoding: gzip' to allow the server to compress HTTP responses. When fetching a loose object over HTTP, it uses the following code to read the response: InputStream in = openInputStream(c); int len = c.getContentLength(); return new FileStream(in, len); If the content is gzipped, openInputStream decompresses it and produces the correct content for the object. Unfortunately the Content-Length header contains the length of the compressed stream instead of the actual content length. Use a length of -1 instead since we don't know the actual length. Loose objects are already compressed, so the gzip encoding typically produces a longer compressed payload. The value from the Content-Length is too high, producing EOFException: Short read of block. Change-Id: I8d5284dad608e3abd8217823da2b365e8cd998b0 Signed-off-by: Zhen Chen <czhen@google.com> Helped-by: Jonathan Nieder <jrn@google.com>	2016-11-22 21:21:24 -04:00
Zhen Chen	5af3f9bd63	Close input stream after use The InputStream in FileStream in downloadPack is never closed. Change-Id: I59975d0b8d51f4b3e3ba9d4496b254d508cb936d Signed-off-by: Zhen Chen <czhen@google.com>	2016-11-22 12:00:42 -08:00
Shawn Pearce	81f9c18433	Define MonotonicClock interface for advanced timestamps MonotonicClock can be implemented to provide more certainity about time than the standard System.currentTimeMillis() can provide. This can be used by classes such as PersonIdent and Ketch to rely on more certainity about time moving in a strictly ascending order. Gerrit Code Review can also leverage this interface through its embedding of JGit and use MonotonicClock and ProposedTimestamp to provide stronger assurance that NoteDb time is moving forward. Change-Id: I1a3cbd49a39b150a0d49b36d572da113ca83a786	2016-11-21 11:34:14 -08:00
Dave Borowitz	5bb434e01f	Update JavaEWAH to 1.1.6 Use Oxygen M3 Orbit repository which provides the bundles built using the new orbit-recipe based build. CQ: 11658 Change-Id: I7f3dcc966732b32830c75d5daa55383bd028d182 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-11-17 00:26:44 +01:00
Matthias Sohn	52fa09c8d4	Add missing @since tags for new API Change-Id: Iaf83f66637d6a13e4a6d096ba8529553af7e30ed Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-11-14 16:10:40 -08:00
Shawn Pearce	2685f4b101	Fix cryptoVer constant name to CRYPTO_VER Change-Id: I46c39f2eceb4d58e49bd6273b87711f35250ab5c	2016-11-14 15:52:43 -08:00
Shawn Pearce	3b2248c5cf	RepositoryCache: simplify code The type parameters can now be inferred when creating ConcurrentHashMap. A for loop over the keys of a ConcurrentHashMap doesn't need to use an Iterator<Map.Entry>; loop syntax handles this just fine over keySet(). Change-Id: I1f85bb81b77f7cd1caec77197f2f0bf78e4a82a1	2016-11-14 15:51:55 -08:00
Shawn Pearce	ca4ef2d24b	Add missing @Override found by ErrorProne Change-Id: I585242507815aad4aa0103fd55a6c369e342ab33	2016-11-14 15:46:28 -08:00
Shawn Pearce	8208da2f59	Deprecate SafeBufferedOutputStream Java 8 fixed the silent flush during close issue by FilterOutputStream (base class of BufferedOutputStream) using try-with-resources to close the stream, getting a behavior matching what JGit's SafeBufferedOutputStream was doing: try { flush(); } finally { out.close(); } With Java 8 as the minimum required version to run JGit it is no longer necessary to override close() or have this class. Deprecate the class, and use the JRE's version of close. Change-Id: Ic0584c140010278dbe4062df2e71be5df9a797b3	2016-11-14 15:33:54 -08:00
David Pursehouse	3e52252622	Merge "Support {get,set}GitwebDescription on InMemoryRepository"	2016-11-14 17:54:17 -05:00
Shawn Pearce	71ea0fe567	Support {get,set}GitwebDescription on InMemoryRepository This simplifies testing for Gerrit Code Review where application code is updating the repository description and the test harness uses InMemoryRepository. Change-Id: I9fbcc028ae24d90209a862f5f4f03e46bfb71db0	2016-11-14 14:40:21 -08:00
David Pursehouse	a45cfee7a3	Organize imports Change-Id: I7c545d06b1bced678c020fab9af1382bc4416b6e Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-11-14 13:25:20 -08:00
Shawn Pearce	1c70dd6d21	Add {get,set}GitwebDescription to Repository This method pair allows the caller to read and modify the description file that is traditionally used by gitweb and cgit when rendering a repository on the web. Gerrit Code Review has offered this feature for years as part of its GitRepositoryManager interface, but its fundamentally a feature of JGit and its Repository abstraction. git-core typically initializes a repository with a default value inside the description file. During getDescription() this string is converted to null as it is never a useful description. Change-Id: I0a457026c74e9c73ea27e6f070d5fbaca3439be5	2016-11-14 11:14:35 -08:00
Shawn Pearce	eeb0705ef3	Merge "Don't serialize internal hash collision chain link"	2016-11-14 13:27:11 -05:00
Shawn Pearce	a0bac65233	Merge "Reduce synchronized scope around ConcurrentHashMap"	2016-11-14 13:25:38 -05:00
Jonathan Nieder	7b8a0a28bf	Merge "StreamCopyThread: Remove unused AtomicInteger import"	2016-11-13 19:05:50 -05:00
Jonathan Nieder	f21233fd0e	StreamCopyThread: Remove unused AtomicInteger import I forgot to do this in `97f3baa0d3` (StreamCopyThread: Remove unnecessary flushCount, 2016-11-13). Change-Id: Iaed9f345848cf0f854c9d0debcf94bc831d53054	2016-11-13 16:01:16 -08:00
Matthias Sohn	707e4538c2	Merge "Extract insecure Cipher factory"	2016-11-13 19:00:27 -05:00
Jonathan Nieder	2185d84c1a	Merge "Get rid of SoftReference in RepositoryCache"	2016-11-13 18:43:57 -05:00
Shawn Pearce	9df75b755f	Extract insecure Cipher factory Bazel runs ErrorProne by default and ErrorProne rightly complains that allowing the user to specify any Cipher can lead to insecure code (in particular, getCipher("AES") operates in ECB mode). Unfortunately this is required to support existing repositories insecurely stored on S3. Extract the insecure factory code to its own class so this can be built as a java_library() with this check disabled. Change-Id: I34f381965bdaa25d5aa8ebf6d8d5271b238334e0	2016-11-13 19:28:45 -04:00
Jonathan Nieder	96941550de	StreamCopyThread: flush cannot interrupt a write Because flush calls interrupt with writeLock held, it cannot interrupt a write. Simplify by no longer defending against that. Change-Id: Ib0b39b425335ff7b0ea1b1733562da5392576a15	2016-11-13 13:35:16 -08:00
Jonathan Nieder	97f3baa0d3	StreamCopyThread: Remove unnecessary flushCount StreamCopyThread#run consistently interrupts itself whenever it discovers it has been interrupted by StreamCopyThread#flush while not reading. The flushCount is not needed to avoid lost flushes. All in-tree users of StreamCopyThread never flush. As a nice side benefit, this avoids the expense of atomic operations that have no purpose for those users. Change-Id: I1afe415cd09a67f1891c3baf712a9003ad553062	2016-11-13 13:32:08 -08:00
Shawn Pearce	6aa126ec42	Merge "Switch JSchSession to simple isolated OutputStream"	2016-11-13 16:13:04 -05:00
Hugo Arès	dea47b9363	Get rid of SoftReference in RepositoryCache Now that RepositoryCache have a time based eviction strategy, get rid of the strategy to evict cache entries if heap memory is running low, i.e. soft references. Main reason why time based eviction was implemented was to offer an alternative to the unpredictable soft references. Relying on soft references is not working, especially in large heap. The JVM GC will consider collecting soft references as last resort before throwing an out of memory error. For example, an application like Gerrit configured with a 128GB heap, GC will wait until all 128GB is filled before collecting the soft references so the application will be suffering long pauses caused by GC for a long time already. In other words, you will have to restart application because it's unusable before JVM eviction kicks in. Keeping the SoftReference in RepositoryCache is causing more harm than good. If you use the time based eviction (which is the default strategy) and want to tune JVM to release soft references more aggressively, it will release repositories from the cache even though they are not expired which defeats the purpose of the repository cache. Gerrit uses Lucene library which uses soft references and this is causing a "memory leak" except if you configure JVM to release soft references more aggressively which have the nasty side effect of evicting non expired repositories from the cache. Change-Id: I9940bd800464c7f007696d0ccde52ea617b2ebce Signed-off-by: Hugo Arès <hugo.ares@ericsson.com>	2016-11-13 16:03:02 -04:00
Shawn Pearce	659cd813a9	Switch JSchSession to simple isolated OutputStream Work around issues with JSch not handling interrupts by isolating the JSch interactions onto another thread. Run write and flush on a single threaded Executor using simple Callable operations wrapping the method calls, waiting on the future to determine the outcome before allowing the caller to continue. If any operation was interrupted the state of the stream becomes fuzzy at close time. The implementation tries to interrupt the pending write or flush, but this is very likely to corrupt the stream object, so exceptions are ignored during such a dirty close. Change-Id: I42e3ba3d8c35a2e40aad340580037ebefbb99b53	2016-11-13 11:02:29 -08:00
Shawn Pearce	92eab1867d	WalkEncryption: Cleanup Java 8 support Java 8 is now the minimum for JGit, so Java 7 only paths are not necessary. Change-Id: I0151625fed4d0da95321ebed5cca648b8c29d5f1	2016-11-13 12:17:20 -04:00
Philipp Marx	df6f2d6860	Reduce synchronized scope around ConcurrentHashMap Change-Id: I982a78070efb6bc2d3395330456d62e0d5ce6da7 Signed-off-by: Philipp Marx <smigfu@googlemail.com>	2016-11-12 11:11:19 +01:00
Philipp Marx	8adbfe4da6	Check that DfsBlockCache#blockSize is a power of 2 In case a value is used which isn’t a power of 2 there will be a high chance of java.lang.ArrayIndexOutBoundsException and org.eclipse.jgit.errors.CorruptObjectException due to a mismatching assumption for the DfsBlockCache#blockSizeShift parameter. Change-Id: Ib348b3704edf10b5f93a3ffab4fa6f09cbbae231 Signed-off-by: Philipp Marx <smigfu@googlemail.com>	2016-11-11 10:43:09 +01:00
Matthias Sohn	f8ac03459a	Fix loop in auto gc * GC.tooManyLooseObjects() always responded true since the loop missed to advance the iterator so it always incremented until the threshold was exceeded. * Also fix loop exit criterion which was off by 1. * Add some tests. Change-Id: I70976dfaa026efbcf3c46bd45941f37277a18e04 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-11-07 22:31:10 +01:00
Jonathan Nieder	881e6b2cbb	StreamCopyThread: Do not drop data when flush is observed before writing StreamCopyThread.flush was introduced in 61645b938bc934fda3b0624c5bac1e3495634750 (Add timeouts to smart transport clients, 2009-06-19) to support timeouts on write in JSch. The commit message from that change explains: JSch made a timeout on write difficult because they explicitly do a catch for InterruptedException inside of their OutputStream. We have to work around that by creating an additional thread that just shuttles data between our own OutputStream and the real JSch stream. The code that runs on that thread is structured as follows: while (!done) { int n = src.read(buf); dst.write(buf, 0, n); } with src being a PipedInputStream representing the data to be written to JSch. To add flush support, that change wanted to add an extra step if (wantFlush) dst.flush(); but to handle the case where the thread is blocked in the read() call waiting for new input, it needs to interrupt the read. So that is how it works: the caller runs pipeOut.write(some data); pipeOut.flush(); copyThread.flush(); to write some data and force it to flush by interrupting the read. After the pipeOut.flush(), the StreamCopyThread reads the data that was written and prepares to copy it out. If the copyThread.flush() call interrupts the copyThread before it acquires writeLock and starts writing, we throw away the data we just read to fulfill the flush. Oops. Noticed during the review of `e67d59df3f` (StreamCopyThread: Do not let flush interrupt a write, 2016-11-04), which introduced this bug. Change-Id: I4aceb5610e1bfb251046097adf46bca54bc1d998	2016-11-04 19:33:02 -04:00
Jonathan Nieder	e67d59df3f	StreamCopyThread: Do not let flush interrupt a write flush calls interrupt() to interrupt a pending read and trigger a flush. Unfortunately that interrupt() call can also interrupt a pending write, putting Jsch in a bad state and triggering "Short read of block" errors. Add locking to ensure the flush only interrupts reads as intended. Change-Id: Ib105d9e107ae43549ced7e6da29c22ee41cde9d8	2016-11-04 13:00:08 -07:00
Zhen Chen	feefcb02b0	Fix flush call race condition in StreamCopyThread If there was a new flush() call during flush previous bytes, we need to catch it in order to process the new bytes between the two flush() calls instead of going to last catch IOException clause and end the thread. Change-Id: Ibc58a1fa97559238c13590aedbb85e482d85e465 Signed-off-by: Zhen Chen <czhen@google.com>	2016-10-31 14:31:48 -07:00
Thomas Wolf	d0023c3c8f	Don't serialize internal hash collision chain link ObjectId is serializable, and so are its subtypes. Ensure that serialization does not follow the hash collision chain internal to the ObjectIdOwnerMap, otherwise completely unrelated objects may get serialized when a RevObject is serialized. Note that serializing a RevCommit or RevTag may serialize quite a few objects due to the parent/object links they contain. A user has no real control over how many objects will be written when a RevCommit is serialized. C.f [1]. This change does not resolve that, but in any case this internal hash collision chain link should not participate in serialization. [1] https://github.com/gitblit/gitblit/pull/1141 Change-Id: Ice331a9dc80a59ca360fcc04adaff8b5e750d847 Signed-off-by: Thomas Wolf <thomas.wolf@paranor.ch>	2016-10-29 11:39:36 +02:00
Matthias Sohn	d1bc809cce	Add missing @since tag for new protected field in ObjectReader Change-Id: I93d67d7fd2fde55be39480944d9d7072dbb6c600 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-10-24 16:05:24 +02:00
Kevin Corcoran	fa0a93119c	Make streamFileThreshold configurable Previously, the streamFileThreshold, the threshold at which a file would be streamed rather than loaded entirely into memory, was only configurable on a global basis. This commit makes this threshold configurable on a per-loader basis. Bug: 490404 Change-Id: I492c18c3155dbf56eedda9044a61d76120fd75f9 Signed-off-by: Kevin Corcoran <kevin.corcoran@puppetlabs.com> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-10-24 14:00:02 +09:00
Christian Halstrick	f30c05fc74	Move constants used for config-files to ConfigConstants Change-Id: I7d8db4bfa1a851afd599bb8eaa8f8273204d2e1d	2016-10-23 01:39:32 +02:00
Matthias Sohn	64a404803e	Implement auto gc With the auto option, gc checks whether any housekeeping is required; if not, it exits without performing any work. Some JGit commands run gc --auto after performing operations that could create many loose objects. Housekeeping is required if there are too many loose objects or too many packs in the repository. If the number of loose objects exceeds the value of the gc.auto option jgit's GC consolidates all existing packs into a single pack (equivalent to -A option), whereas git-core would combine all loose objects into a single pack using repack -d -l. Setting the value of gc.auto to 0 disables automatic packing of loose objects. If the number of packs exceeds the value of gc.autoPackLimit, then existing packs (except those marked with a .keep file) are consolidated into a single pack by using the -A option of repack. Setting gc.autoPackLimit to 0 disables automatic consolidation of packs. Like git the following jgit commands run auto gc: - fetch - merge - rebase - receive-pack The auto gc for receive-pack can be suppressed by setting the config option receive.autogc = false Change-Id: I68a2a051b39ec2c53cb7c4b8f6c596ba65eeba5d Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-10-23 01:34:31 +02:00
David Pursehouse	03046d0f60	CheckoutCommand: Add method to add multiple paths The new method addPaths(List<String>) allows callers to add multiple paths without having to iterate over several calls to addPath(String). Change-Id: I2c3746a97ead7118fb0ed5543a2c843224719031 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-10-22 10:11:54 +09:00
Marc Strapetz	c6459a6167	Fix possible SIOOBE in RefDirectory.parsePackedRefs This SIOOBE happens reproducibly when trying to access a repository containing Cygwin symlinks Change-Id: I25f103fcc723bac7bfaaeee333a86f11627a92c7 Signed-off-by: Marc Strapetz <marc.strapetz@syntevo.com> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-10-21 18:13:04 +09:00
Thomas Meyer	4ab06388ad	TransportBundleFile: Resolve remote repository locally Remove the assumption that the local repository is a file based one. Change-Id: I8f10fe7a54e9fc07f2a23d7901e52b65aa570d45 Signed-off-by: Thomas Meyer <thomas.mey@web.de> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-10-21 00:24:52 +02:00
David Turner	e346873511	TreeFormatter: disallow empty filenames in trees Git barfs on these (and they don't make any sense), so we certainly shouldn't write them. Change-Id: I3faf8554a05f0fd147be2e63fbe55987d3f88099 Signed-off-by: David Turner <dturner@twosigma.com> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-10-19 22:24:24 +09:00
Philipp Marx	ccc899773e	Add "concurrencyLevel" option to DfsBlockCache Allow for higher concurrency on DfsBlockCache by adding a configuration for number of estimated concurrent requests. Change-Id: Ia65e58ecb2c459b6d9c9697a2f715d933270f7e6 Signed-off-by: Philipp Marx <smigfu@googlemail.com>	2016-10-19 21:45:30 +09:00
David Pursehouse	a963273d85	Merge branch 'stable-4.5' * stable-4.5: Config: do not add spaces before units Change-Id: I54185f54e6d78d7aac873ee5f990f09582318857 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-10-19 11:45:11 +09:00
David Turner	a66b4c29a8	Config: do not add spaces before units Adding a space before the unit ('g', 'm', 'k) causes git to fail with the error: fatal: bad numeric config value Change-Id: I57f11d3a1cdcca4549858e773af1a2a80fc0369f Signed-off-by: David Turner <dturner@twosigma.com> Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-10-19 10:58:52 +09:00
David Pursehouse	bdf3e43d76	FS: Fix lazy initialization of non-volatile static field The 'factory' field is lazy initialized in the detect() method. According to FindBugs: Because the compiler or processor may reorder instructions, threads are not guaranteed to see a completely initialized object, if the method can be called by multiple threads. Fix this by declaring the member as 'volatile'. Change-Id: Ib32663bb28c9564584256e01f625b4e7875e6223 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-10-18 14:15:53 +09:00
David Pursehouse	e9107e853f	PackOutputStream: Add comment for intentional use of non-short-circuit logic To avoid that people try to "fix" it. Change-Id: Ib4b35e357e4c068a17243ebd2d57b058c54d5834 Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-10-18 14:15:53 +09:00
Matthias Sohn	ba7ba7a816	Merge branch 'stable-4.5' * stable-4.5: Unconditionally close repositories in RepositoryCache.clear() Fix eviction of repositories with negative usage count Adapt to parameter removed from RepositoryCache.unregisterAndCloseRepository(). Change-Id: I7087667056ced401a3b3a027977f2715cd77a1c5 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-10-14 00:24:00 +02:00
Matthias Sohn	535f0afd13	Unconditionally close repositories in RepositoryCache.clear() Earlier we tried to close the repository before removing it from the cache, so close only reduced refcount but didn't close it. Now that we no longer leak usage count on purpose and the usage count is now ignored anyway, there is no longer a need to run the removal twice. Change-Id: I8b62cec6d8a3e88c096d1f37a1f7f5a5066c90a0 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-10-13 23:39:12 +02:00
Matthias Sohn	cc0d58f9a0	AmazonS3: ensure that errorStream is closed Change-Id: I2abde5dbd4b785d70b7bc0b77188c0a075130eeb Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-10-12 15:11:52 +02:00
Matthias Sohn	59212bbb47	Merge "Add toString() to CherryPickCommand and ResetCommand"	2016-10-12 08:38:33 -04:00
Hugo Arès	5a1e147eaa	Fix eviction of repositories with negative usage count If the repository close method was called twice (or more) for one open, the usage count became negative and the repository was never be evicted from the cache because the method checking if repository is expired was not considering negative usage count. Change-Id: I18a80c415c54c37d1b9def2b311ff2d0afa455ca Signed-off-by: Hugo Arès <hugo.ares@ericsson.com>	2016-10-12 07:07:03 -04:00
Thomas Wolf	26832a00e4	Fix symlink content comparison on MacOS in tree walk Symlinks on MacOS are written as UTF-8 NFD, but readSymbolicLink().toString() converts to NFC with potentially fewer bytes. May occur in particular if the link target has non-ASCII characters for which the NFC and NFD encodings differ. This may lead to an EOFException: Short read of block. This causes all kinds of weird effects in EGit, ranging from failing rebases (which report the exception to the user) to EGit decorations in the navigator silently disappearing (and never coming back). * Rename readContentAsNormalizedString() to readSymlinkTarget() as it's called only for symlinks. Also make it protected. * Fix by allowing the read to succeed even if less than the expected number of bytes are returned by the entry's input stream. * Override in FileTreeIterator to use fs.readSymlink() directly. Includes a new MacOS-only test. Change-Id: I264c5972d67b1cbb1ed690580f5706e671b9affd Signed-off-by: Thomas Wolf <thomas.wolf@paranor.ch> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-10-11 00:20:23 +02:00
Christian Halstrick	295e5a4f1e	Merge "Fix CheckoutCommand to return updated files even on NONDELETED status"	2016-10-10 10:19:09 -04:00
Matthias Sohn	f6df5cf26e	Add toString() to CherryPickCommand and ResetCommand Change-Id: Ie2dd87943350e3b0a2df72a70e5219139d110a8f Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-10-09 23:46:39 +02:00
Zhen Chen	0212a623f8	Delay inserter.flush in WalkFetchConnection Spawn an ObjectReader from the ObjectInserter, so the flush can be delayed at the end of the fetch. Change-Id: I35fe8c8370c06c25262645202aec2b1318057c19 Signed-off-by: Zhen Chen <czhen@google.com>	2016-10-02 00:41:33 +02:00
Rüdiger Herrmann	1bafc304c8	Remove unused code from class Repository The package-private method Repository::gitInternalSlash() is not referenced from anywhere within the package. Last uses were removed with `0f8743d4` "Remove deprecated Tree, TreeEntry, FileTreeEntry and friends" `6e9fdce9` "Kill GitIndex" Signed-off-by: Rüdiger Herrmann <ruediger.herrmann@gmx.de> Change-Id: I514bf684ad0da808f6523e9e46db9674a25e1fb5	2016-09-30 15:27:42 +02:00
Christian Halstrick	83e43f7960	Fix CheckoutCommand to return updated files even on NONDELETED status CheckoutCommand was not returning updated and removed files in case of an overall status of NONDELETED. That's status which occurs especially on the Windows platform when Checkout wanted to delete files but the filesystem doesn't allow this. The situation is more seldom on linux/mac because open filehandles don't stop a deletion attempt and checkout succeeds more often. Change-Id: I4828008e58c09bd8f9edaf0f7eda0a79c629fb57	2016-09-27 15:58:24 +02:00
Matthias Sohn	8ed16fa100	Merge branch 'stable-4.5' * stable-4.5: Fix carrying over flags during a RevWalk Change-Id: Ibf4573c5664271dfa7a6ecc3ede6eaad749f89d8 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-09-24 10:51:14 +02:00
Shawn Pearce	6257e27d49	Merge "Fix carrying over flags during a RevWalk" into stable-4.5	2016-09-24 01:16:49 -04:00
Christian Halstrick	dd585b779f	Fix carrying over flags during a RevWalk There was a bug when carrying over flags from a merge commit to its non-first parents. The first parent of a merge commit was handled differently and correct but the non-first parents are handled by a recursive algorithm. Flags should be copied from the root merge commit to parent-2, to grandparent-2, ... up to the limit of STACK_DEPTH==500 parents-levels. But the recursive algorithm was always copying only to the direct parents of the merge commit and not the grand*-parents. This seems to be no problem when commits are handled in a strict date order because then copying only one level is no problem if children are handled before parents. But when commits are not seperated anymore by distinctive correct dates (e.g. because all commits have the same date) then it may happen that a merge-parent is handled before the merge commit and when dealing later with the merge commit one has to copy flags down to more than one level Bug: 501211 Change-Id: I2d79a7cf1e3bce21a490905ccd9d5e502d7b8421	2016-09-23 11:15:08 +02:00
David Pursehouse	64de9beadb	RepositoryCache#unregisterAndCloseRepository: Remove unused db parameter Change-Id: Ibee48cba85089324dc8db45066d311ad5db4f3ee Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-09-23 09:59:17 +02:00
David Pursehouse	93c788952e	DfsBlockCache: Suppress warnings about unchecked conversion of Ref<T> There are already suppressions for the same warnings in other parts of this class. Change-Id: Ic3b45525c6c8200cba975d14c7650cedb4409a4d Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-09-22 09:46:27 +02:00
David Pursehouse	01935f8f50	Suppress deprecation warnings related to UploadPackLogger UploadPackLogger is deprecated but will not be removed until JGit version 5.0. Suppress the unavoidable deprecation warnings on usages of the interface that are kept for backwards compatibility. Add a TODO so that we don't forget to remove it in 5.0. Change-Id: Id248002b9bdf23db192427196d54c722a012106c Signed-off-by: David Pursehouse <david.pursehouse@gmail.com>	2016-09-22 09:46:25 +02:00
Matthias Sohn	b03be9a3ea	Fix wrong @since tags introduced by `1beb3ccd` `1beb3ccd1e` was submitted in 4.6.0-SNAPSHOT Change-Id: I33e027dbd0995ba36491bd9e2d4b3c84f9f0abd6 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-09-21 23:37:18 +02:00
Matthias Sohn	1b11f4fdda	Merge branch 'stable-4.5' * stable-4.5: Turn off doclint also during Maven site generation Prepare 4.5.1-SNAPSHOT builds JGit v4.5.0.201609210915-r Unconditionally close repository in unregisterAndCloseRepository Change-Id: Ibfd11669cd74d2e62b014c18fd39b646b200c8c5 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-09-21 18:08:00 +02:00
Matthias Sohn	cb617e7ffc	Prepare 4.5.1-SNAPSHOT builds Change-Id: I3305e8a09a3fb06f25a316cff2bdbb551d3ade68 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-09-21 17:05:27 +02:00
Matthias Sohn	e7142a3937	JGit v4.5.0.201609210915-r Change-Id: Idc02a1a1d74f84605d764c239803f0cfbad94eb7 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-09-21 15:06:18 +02:00
Saša Živkov	b7df7d1fd3	Unconditionally close repository in unregisterAndCloseRepository Repository.close() method is used when reference counting and expiration needs to be honored. The RepositoryCache.unregisterAndCloseRepository method should close the repository unconditionally. This is also indicated from its javadoc. Change-Id: I19392d1eaa17f27ae44b55eea49dcff05a52f298	2016-09-21 13:22:51 +02:00
Shawn Pearce	6cd6ccb65a	Merge "Add ReceivedPackStatistics for PackParser"	2016-09-20 15:29:27 -04:00
Matthias Sohn	ab79ce91c1	Change JGit minimum execution environment to JavaSE-1.8 Bug: 500059 Change-Id: I47f3f6749a67da52029f84e002d9b155ed56d2b7 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-09-20 11:32:36 +02:00
Christian Halstrick	e4e39a6d05	Move AtomicObjectOutputStream to lfs/internal The class AtomicObjectOutputStream should be available to all lfs related classes, not only to the server side. Move the class from org.eclipse.jgit.lfs.server.fs to org.eclipse.jgit.lfs.internal to achieve that. Change-Id: I028e1c9ec7c21f316340b21d558b9a6b77e2060d	2016-09-20 10:58:20 +02:00
Christian Halstrick	bcb5a431a5	Add built-in LFS smudge filter for local case Adds a JGit built-in implementation of the "git lfs smudge" filter. This filter should do the same as the one described in [1] besides that it only supports the local case when the lfs objects are already present in the media directory. Remote cases where download of LFS objects from an LFS server is needed will be done in a later commit. [1] https://github.com/github/git-lfs/blob/master/docs/man/git-lfs-smudge.1.ronn Change-Id: I8ff661d4edd3667ef7f86f3b4fa33e568eb4c8f4	2016-09-20 10:58:05 +02:00
Christian Halstrick	b70f3a7457	Add configuration parameter to enable built-in hooks/filters If the configuration parameter filter.<filterDriverName>.useJGitBuiltin is set to true then for all corresponding filters JGit will try to execute the built-in filter instead of the filter-command which is defined in git configuration. It will fallback to the non-built-in filters if no built-in filters are registered or if constructing them leads to exceptions. If set to false JGit will not try to execute built-in filters for the specified filter driver. Example: The configuration contains the following lines [filter "lfs"] clean = git-lfs clean -- %f smudge = git-lfs smudge -- %f useJGitBuiltin = true Addtionally the .gitattributes file in the root of the working tree contains: *.bin filter=lfs In this case when new content is added similar to "git add 1.bin" then the following will happen: - jgit will check whether a built-in command factory was registered for the command "jgit://builtin/lfs/clean". If that is true the factory is used to create a built-in filter command and that command is used to filter the content - Otherwise jgit will call the external program "git lfs clean ..." to do the filtering Change-Id: Idadb1db06b1e89e7031d7ed6319904973c367d38	2016-09-20 10:06:13 +02:00
Christian Halstrick	d97248467a	Add support for built-in smudge filters JGit supports smudge filters defined in repository configuration. The filters are implemented as external programs filtering content by accepting the original content (as seen in git's object database) on stdin and which emit the filtered content on stdout. This content is then written to the file in the working tree. To run such a filter JGit has to start an external process and pump data into/from this process. This commit adds support for built-in smudge filters which are implemented in Java and which are executed by jgit's main thread. When a filter is defined in the configuration as "jgit://builtin/<filterDriverName>/smudge" then JGit will lookup in a static map whether a builtin filter is registered under this name. If found such a filter is called to do the filtering. The functionality in this commit requires that a program using JGit explicitly calls the JGit API to register built-in implementations for specific smudge filters. In follow-up commits configuration parameters will be added which trigger such registrations. Change-Id: Ia743aa0dbed795e71e5792f35ae55660e0eb3c24	2016-09-20 10:02:20 +02:00
Christian Halstrick	b5bec73715	Add support for built-in clean filters JGit supports clean filters defined in repository configuration. The filters are implemented as external programs filtering content by accepting the original content (as seen in the working tree) on stdin and which emit the filtered content on stdout. To run such a filter JGit has to start an external process and pump data into/from this process. This commit adds support for clean filters which are implemented in Java and which are executed by jgit's main thread. When a filter is defined in the configuration as "jgit://builtin/<filterDriverName>/clean" then JGit will lookup in a static map whether a filter is registered under this name. If found such a filter is called to do the filtering. The functionality in this commit requires that a program using JGit explicitly calls the JGit API to register built-in implementations for specific clean filters. In follow-up commits configuration parameters will be added which trigger such registrations. Other commits will add implementations for lfs filters. Change-Id: I0344d3c54801c9a46e5a606c5df17e5f2e17b2be	2016-09-20 09:57:48 +02:00
Masaya Suzuki	1beb3ccd1e	Add ReceivedPackStatistics for PackParser This is like PackStatistics, but for PackParser. Change-Id: I854215c0956fd0b36843d631780be303e021b8be Signed-off-by: Masaya Suzuki <masayasuzuki@google.com>	2016-09-19 15:53:50 -07:00
Matthias Sohn	9d4ffcc1ab	Prepare 4.6.0-SNAPSHOT builds Change-Id: Id2eafc331ee32c332c2a9b867b05c260beb0d10f Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2016-09-19 17:06:42 +02:00
Thomas Wolf	aadbb158e1	Handle all values of branch.[name].rebase BranchConfig treated this config property as a boolean, but git also allows the values "preserve" and "interactive". Config property pull.rebase also allows the same values. Replace private enum PullCommand.PullRebaseMode by new public enum BranchConfig.BranchRebaseMode and adapt all uses. Add a new setter to PullCommand. Note: PullCommand will treat "interactive" like "true", i.e., as a non-interactive rebase. Not sure how "interactive" should be handled. At least it won't balk on it. Bug: 499482 Change-Id: I7309360f5662b2c2efa1bd8ea6f112c63cf064af Signed-off-by: Thomas Wolf <thomas.wolf@paranor.ch>	2016-09-15 00:44:23 +02:00
Andrey Loskutov	619329c84e	Ignore trailing spaces in directory rule patterns Bug: 500967 Change-Id: I7fabc2654af97011c62f46d5c30ee992341e45e2 Signed-off-by: Andrey Loskutov <loskutov@gmx.de>	2016-09-14 07:43:52 -04:00

... 3 4 5 6 7 ...

3877 Commits