motiejus/jgit - jgit - gitea: Gitea Service

motiejus

jgit

Author	SHA1	Message	Date
Shawn O. Pearce	d1e47df0da	ObjectIdSubclassMap: Micro-optimize wrapping at end of table During a review of the class, Josh Bloch pointed out we can use "i = (i + 1) & mask" to wrap around at the end of the table, instead of a conditional with a branch. This is generally faster due to one less branch that will be mis-predicted by the CPU. Change-Id: Ic88c00455ebc6adde9708563a6ad4d0377442bba Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-10 10:09:58 -08:00
Shawn O. Pearce	42f0b11153	Merge changes I0d797533,I128522af,I6dd076eb,Ief6f81b9,I83d01e5c * changes: ObjectIdSubclassMap: Avoid field loads in inner loops ObjectIdSubclassMap: Manually inline index() ObjectIdSubclassMap: Change initial size to 2048 ObjectIdSubclassMap: Grow before insertions ObjectIdSubclassMap: Use & rather than % for hashing	2011-03-10 13:02:59 -05:00
Shawn Pearce	09d2b9f0ed	Merge "Cache gitPrefix in FS_Win32"	2011-03-10 13:02:24 -05:00
Matthias Sohn	561aa98041	Fix Bundle-Version of jgit source bundle Bug: 339033 Change-Id: Idaf965cb684d5ed3f3634b0f3d256c92182d7c58 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-03-10 14:25:17 +01:00
Marc Strapetz	6f066dec02	Cache gitPrefix in FS_Win32 readPipe() may consume rather much time, so gitPrefix should be cached. If the git executable changes, users should run FS.detect() again to get a new instance of FS_Win32.	2011-03-10 13:17:57 +01:00
Shawn O. Pearce	c11756ca4e	ObjectIdSubclassMap: Avoid field loads in inner loops Ensure the JIT knows the table cannot be changed during the critical inner loop of get() or insert() by loading the field into a final local variable. This shouldn't be necessary, but the instance member is declared non-final (to resizing) and it is not very obvious to the JIT that the table cannot be modified by AnyObjectId.equals(). Simplify the JIT's decision making by making it obvious, these values cannot change during the critical inner loop, allowing for better register allocation. Change-Id: I0d797533fc5327366f1207b0937c406f02cdaab3 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-09 16:55:10 -08:00
Shawn O. Pearce	df7b192e26	ObjectIdSubclassMap: Manually inline index() This method is trivial in definition, and is called in only 3 places. Inline the method manually to ensure its really going to be inlined by the JIT at runtime. Change-Id: I128522af8167c07d2de6cc210573599038871dda Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-09 16:55:09 -08:00
Shawn O. Pearce	16350bf9e4	ObjectIdSubclassMap: Change initial size to 2048 32 is way to small for the map. Most applications using the map will need to load more than 16 objects just from the root refs being read from the Repository. Default the initial size to 2048. This cuts out 6 expansions in the early life of the table, reducing garbage and rehashing time. Change-Id: I6dd076ebc0b284f1755855d383b79535604ac547 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-09 14:34:27 -08:00
Shawn O. Pearce	da548dfd2b	ObjectIdSubclassMap: Grow before insertions If the table needs to be grown, do it before the current insertion rather than after. This is a tiny micro-optimization that allows the compiler to reuse the result of "++size" to compare against previously pre-computed size at which the table should rehash itself. Change-Id: Ief6f81b91c10ed433d67e0182f558ca70d58a2b0 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-09 14:32:43 -08:00
Shawn O. Pearce	47c2a3a98d	ObjectIdSubclassMap: Use & rather than % for hashing Bitwise and is faster than integer modulus operations, and since the table size is always a power of 2, this is simple to use for index operation. Change-Id: I83d01e5c74fd9e910c633a98ea6f90b59092ba29 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-09 14:30:43 -08:00
Shawn O. Pearce	ff6ac0aaef	ObjectIdSubclassMap: Fix non-standard naming conventions obj_hash doesn't match our naming conventions, camelCaseNames are the preferred format. Change-Id: I72da199daccb60a98d17b6af1e498189bf149515 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-09 14:28:14 -08:00
Jesse Greenwald	c68aba2a48	Fixed ordering of Config.getSubsections(...) A standard HashSet was being used to store the list of subsections as they were being parsed. This was changed to use a LinkedHashSet so that iterating over the set would return values in the same order as they are listed in the config file. Change-Id: I4251f95b8fe0ad59b07ff563c9ebb468f996c37d	2011-03-09 10:00:24 -08:00
Matthias Sohn	c7e9f013b7	[findbugs] ProgressReportingFilter can be a static inner class Change-Id: I628b1f25f04c9297655d5ac451ae5a133db53896 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-03-09 00:05:36 +01:00
Matthias Sohn	a02e8e3d26	[findbugs] Avoid futile attempt to change max pool size Javadoc for ScheduledThreadPoolExecutor says [1]: While ScheduledThreadPoolExecutor inherits from ThreadPoolExecutor, a few of the inherited tuning methods are not useful for it. In particular, because it acts as a fixed-sized pool using corePoolSize threads and an unbounded queue, adjustments to maximumPoolSize have no useful effect. [1] http://download.oracle.com/javase/6/docs/api/java/util/concurrent/ScheduledThreadPoolExecutor.html Change-Id: I8eccb7d6544aa6e27f5fa064c19dddb2a706523f Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-03-08 23:41:47 +01:00
Shawn O. Pearce	f67e5602af	PackWriter: Reduce GC during enumeration Instead of resizing an ArrayList until all objects have been added, append objects into a specialized List type that uses small arrays of 1024 entries for each 1024 objects added. For a large repository like linux-2.6, PackWriter will now allocate 1,758 smaller arrays to hold the object list, without creating any garbage from the intermediate states due to list expansion. 1024 was chosen as the block size (and initial directory size) as this is a reasonable balance for the PackWriter code. Each block uses approximately 4096 bytes in a 32 bit JVM, as does the default top level block directory. The top level directory doesn't expand until 1 million items have been added to the list, which for linux-2.6 won't yet occur as the lists are per-object-type and are thus bounded to about 1/3 of 1.8 million. Change-Id: If9e4092eb502394c5d3d044b58cf49952772f6d6 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-07 18:23:39 -08:00
Shawn O. Pearce	ef822fe3cd	Remove deprecated Repository.mapTree The mapTree() routines have been deprecated for a long time, and their sibilings for mapCommit() and mapTag() were already removed from the main Repository API. Remove mapTree(). Application callers who only need the tree's name can use resolve("^{tree}") syntax to resolve to the tree ObjectId, or fail if the input is not a tree. Applications that want to read a tree should use DirCache or TreeWalk. Change-Id: I85726413790fc87721271c482f6636f81baf8b82 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-07 12:36:51 -08:00
Shawn O. Pearce	1f57061684	Remove deprecated TreeVisitor This type and its associated methods has been deprecated for a while now. Time to remove it. Applications can use a TreeWalk instead to access the elements of any tree-like object. Change-Id: I047e552ac77b77e2de086f63cb4fb318da57c208 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-07 12:30:04 -08:00
Shawn O. Pearce	6c3badea7a	Remove deprecated TreeIterator This interface has been deprecated for a while now. Applications can use a TreeWalk instead. Change-Id: I751d6e919e4b501c36fc36e5f816b8a8c5379cb9 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-07 12:26:51 -08:00
Shawn O. Pearce	5ecc6e32cd	Remove deprecated IndexTreeVisitor This has been deprecated for some time now. Applications should instead use DirCache within a TreeWalk. Change-Id: I8099d93f07139c33fe09bdeef8d739782397da17 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-07 12:23:15 -08:00
Shawn O. Pearce	c88d34b067	Remove deprecated WriteTree This class has been deprecated for a long time now. Time to remove it. Applications can use the newer DirCache.writeTree() as a replacement. Change-Id: I91dc9507668d8a3ecadd6acd4f1c8b7bd7760cc3 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-07 12:20:44 -08:00
Shawn O. Pearce	d72b932853	Remove deprecated WorkDirCheckout This class has been deprecated for a long time now. Time to remove it. Applications can use the newer DirCacheCheckout class as a replacement. Change-Id: Id66d29fcca5a7286b8f8838303d83f40898918d2 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-07 12:18:12 -08:00
Shawn O. Pearce	9013e9e993	Remove deprecated Treeish interface This interface has been deprecated for a long time now. Time to remove it. Change-Id: I29a938657e4637b2a9d0561940b38d70866613f7 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-07 12:12:20 -08:00
Tomasz Zarna	cda64073fd	Allow to amend a commit with CommitCommand Bug: 339088 Change-Id: I57dc727688c4bb6968ac076b176661c857c05afa Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-03-07 10:19:20 -06:00
Shawn O. Pearce	4e187d898a	PackFile: Fix copy as-is for small objects When I disabled validation I broke the code that handled copying small objects whose contents were below 8192 bytes in size but spanned over the end of one window and into the next window. These objects did not ever populate the temporary write buffer, resulting in garbage writing into the output stream instead of valid object contents. Change-Id: Ie26a2aaa885d0eee4888a9b12c222040ee4a8562 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-04 18:56:16 -08:00
Shawn Pearce	b9df3e6019	Merge "Fix DirCache re-read."	2011-03-04 10:19:06 -05:00
Shawn O. Pearce	a78b79cc30	Don't auto follow non-annotated tags in fetch When fetch TagOpt is AUTO_FOLLOW do not follow refs/tags/ names that point directly to commits which are on unreleated side branches. Change-Id: Iea6eee5a05ae7402a7f256fd9c1e3d3b5ccb58dd Reported-by: Slawomir Ginter <sginter@atlassian.com> Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-04 07:14:50 -08:00
Robin Rosenberg	3947bd25d9	Fix DirCache re-read. During unit tests and most likely elsewhere, updates come too fast for a simple timestamp comparison (with one seconds resolution) to work. I.e. DirCache thinks it hasn't changed. Use FileSnapshot instead which has more advanced logic. Change-Id: Ib850f84398ef7d4b8a8a6f5a0ae6963e37f2b470 Signed-off-by: Robin Rosenberg <robin.rosenberg@dewire.com>	2011-03-04 16:00:25 +01:00
Shawn O. Pearce	b21c82fdb0	resolve(): Fix wrong parsing of branch "foo-gbed2-dev" When parsing a string such as "foo-gbed2" resolve() was assuming the suffix was from git describe output. This lead to JGit trying to find the completion for the object abbreviation "bed2", rather than using the current value of the reference. If there was only one such object in the repository, JGit might actually use the wrong value here, as resolve() would return the completion of the abbreviation "bed2" rather than the current value of the reference "refs/heads/foo-gbed2". Move the parsing of git describe abbreviations out of the operator portion of the resolve() method and into the simple portion that is supposed to handle only object ids or reference names, and only do the describe parsing after all other approaches have already failed to provide a resolution. Add new unit tests to verify the behavior is as expected by users. Bug: 338839 Change-Id: I52054d7b89628700c730f9a4bd7743b16b9042a9 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-03 16:17:29 -08:00
Shawn O. Pearce	3ee3588b86	RemoteRefUpdate: Accept Ref and ObjectId arguments for source Applications may already have a Ref or ObjectId on hand that they want the remote to be updated to. Instead of converting these into a String and relying on the parsing rules of resolve(), allow the application to supply the Ref or ObjectId directly. Bug: 338839 Change-Id: If5865ac9eb069de1c8f224090b6020fc422f9f12 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-03 15:13:33 -08:00
Shawn O. Pearce	a468cb57c2	PackWriter: Validate reused cached packs If object reuse validation is enabled, the output pack is going to probably be stored locally. When reusing an existing cached pack to save object enumeration costs, ensure the cached pack has not been corrupted by checking its SHA-1 trailer. If it has, writing will abort and the output pack won't be complete. This prevents anyone from trying to use the output pack, and catches corruption before it can be carried any further. Change-Id: If89d0d4e429d9f4c86f14de6c0020902705153e6 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-02 12:49:00 -08:00
Shawn O. Pearce	1b2062fe37	PackWriter: Avoid CRC-32 validation when feeding IndexPack There is no need to validate the object contents during copyObjectAsIs if the result is going to be parsed by unpack-objects or index-pack. Both programs will compute the SHA-1 of the object, and also validate most of the pack structure. For git daemon like servers, this work is already done on the client end of the connection, so the server doesn't need to repeat that work itself. Disable object validation for the 3 transport cases where we know the remote side will handle object validation for us (push, bundle creation, and upload pack). This improves performance on the server side by reducing the work that must be done. Change-Id: Iabb78eec45898e4a17f7aab3fb94c004d8d69af6 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-02 12:23:58 -08:00
Shawn O. Pearce	bd6853e90a	PackWriter: Position tags after commits Annotated tags need to be parsed by many viewing tools, but putting them at the end of the pack hurts because kernel prefetching might not have loaded them, since they are so far from the commits they reference. Position tags right behind the commits, but before the trees. Typically the annotated tag set for a repository is very small, so the extra prefetch burden it puts on tools that don't need annotated tags (but do need commits and trees) is fairly low. Change-Id: Ibbabdd94e7d563901c0309c79a496ee049cdec50 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-01 10:07:08 -08:00
Shawn O. Pearce	26dffbe04d	PackWriter: Refactor object writing loop This simple refactoring makes it easier to pre-process each of the object lists before its handed into the actual write routine. Change-Id: Iea95e5ecbc7374f6bcbb43d1c75285f4f564d09d Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-01 10:07:08 -08:00
Shawn O. Pearce	751c329b35	PackWriter: Don't reuse commit or tag deltas JGit doesn't generate deltas for commit or tag objects when it packs a repository from scratch. This is an explicit design decision that is (mostly) justified by the fact that these objects do not delta compress well. Annotated tags are made once on stable points of the project history, it is unlikely they will ever appear again with sufficient common text to justify using a delta over just deflating the raw content. JGit never tries to delta compress annotated tags and I take the stance that these are best stored as non-deltas given how frequently they might be accessed by repository viewers. Commits only have sufficient common text when they are cherry-picked to forward-port or back-port a change from one branch to another. Even in these cases the distance between the commits as returned by the log traversal has to be small enough that they would both appear in the delta search window at the same time in order to delta compress one of the messages against the other. JGit never tries to delta compress commits, as it requires a lot of CPU time but typically does not produce a smaller pack file. Avoid reusing deltas for either of these types when constructing a new pack. To avoid killing performance during serving of network clients, UploadPack disables this code change by allowing PackWriter to reuse delta commits. Repositories that were already repacked by C Git will not have their delta commits decompressed and recompressed on the fly during object writing, saving server-side CPU resources. Change-Id: I749407e7c5c677e05e4d054b40db7656cfa7fca8 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-01 10:07:08 -08:00
Shawn O. Pearce	67b064fc9f	PackWriter: Do not delta compress already packed objects This is a tiny optimization to how delta search works. Checking for isReuseAsIs() avoids doing delta compression search on non-delta objects already stored in packs within the repository. Such objects are not likely to be delta compressable, as they were already delta searched when their containing pack was generated and they were not delta compressed at that time. Doing delta compression now is unlikely to produce a different result, but would waste a lot of CPU. The isReuseAsIs() flag is checked before isDoNotDelta() because it is very common to reuse objects in the output pack. Most objects get reused, and only a handful have the isDoNotDelta() bit set. Moving the check earlier allows the loop to more quickly skip through objects that will never need to be considered. Change-Id: Ied757363f775058177fc1befb8ace20fe9759bac Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-01 10:07:08 -08:00
Shawn O. Pearce	bf1b970de1	Paper bag fix BatchingProgressMonitor alarm queue The alarm queue threads were started with an empty task body, which meant the thread started and terminated immediately, leaving the queue itself with no worker. Change-Id: I2a9b5fe9c2bdff4a5e0f7ec7ad41a54b41a4ddd6 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-03-01 10:07:08 -08:00
Shawn O. Pearce	68ab451d39	ProgressMonitor: Refactor to use background alarms Instead of polling the system clock on every update(1) method call, use a scheduled executor to toggle a volatile once per second until the task is done. Check the volatile on each update(int), looking to see if output should occur. This limits progress output to either once per 1% complete, or once per second. To save time during update calls the timer isn't reset during each 1% of output, which means we may see one unnecessary output trigger if at least 1% completed during the one second of the alarm time. Change-Id: I8fdd7e31c37bef39a5d1b3da7105da0ef879eb84 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-28 19:36:56 -08:00
Matthias Sohn	2fba1e65e1	Fix NPE on checkout of remote tracking branch Checkout of remote tracking branch failed when no local branch existed. Also enhance RepositoryTestCase to enable checking index state of another test repository. Bug: 337695 Change-Id: Idf4c05bdf23b5161688818342b2bf9a45b49f479 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-03-01 00:21:14 +01:00
Shawn O. Pearce	03f78fc3bc	UnpackedObject: Fix readSome() when initial read is short JDK7 changed behavior slightly on some InputStream types, resulting in the first read being shorter than the count requested. That caused us to overwrite the earlier part of the buffer with later data, as the offset index wasn't updated in the loop. Fix the loop to increment offset by the number of bytes read in this iteration, so the next read appends to the buffer rather than doing an overwrite. Bug: 338119 Change-Id: I222fb2f993cd9b637b6b8d93daab5777ef7ec7a6 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-25 17:20:14 -08:00
Chris Aniszczyk	7e1f18c079	Merge "RevWalk: Don't release during inMergeBase()"	2011-02-24 11:23:47 -05:00
Shawn Pearce	2c9a192505	Merge "Fix formatting of pom.xml"	2011-02-24 10:29:47 -05:00
Matthias Sohn	e0a8398f1f	FetchCommand: do not set a null credentials provider FetchCommand now does not set a null credentials provider on Transport because in this case the default provider is replaced with null and the default mechanism for providing credentials is not working. Change-Id: I44096aa856f031545df39d4b09af198caa2c21f6 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-02-24 16:05:05 +01:00
Matthias Sohn	8d000bd578	Fix formatting of pom.xml Change-Id: I508def09cb2d4e5bd27b412f4ad5d43984388749 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-02-24 16:04:33 +01:00
Shawn O. Pearce	e757975fcd	RevWalk: Don't release during inMergeBase() In `bc1af8459e` ("RevWalk: Don't reset ObjectReader when stopping") we stopped releasing the reader when the current log traversal is over. This should have also been applied to the merge base logic that is buried within MergeGenerator, but got missed. Change-Id: I8328f43f02cba06fd545e22134872e781b9d4d36 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-24 06:55:25 -08:00
Shawn Pearce	2902c7679b	Merge "Respect core.excludesfile to enable global ignore rules "	2011-02-23 18:08:50 -05:00
Matthias Sohn	e703b6c640	Respect core.excludesfile to enable global ignore rules Also use FS.resolve() to properly resolve files from path strings. Bug: 328428 (partial fix) Change-Id: I41d94694f220dcb85605c9acadfffb1fa23beaeb Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-02-23 23:44:50 +01:00
Shawn O. Pearce	7505b93546	PackWriter: Add missing timers to Statistics We did not record the time spent on the object reuse search or the object size lookup, both of which occur between the counting phase and the compressing phase. If there are enough objects involved, these times can be significant so its worth timing them and recording it. Change-Id: I89084acfc598bb6533d75d90cb8de459f0ed93be Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-23 11:56:19 -08:00
Sasa Zivkov	5f3d577e5a	Show notes in Log CLI command Support for --no-standard-notes and --show-notes=REF options is added to the Log command. The --show-notes option can be specified more than once if more than one notes branch should be used for showing notes. The notes are displayed from note branches in the order how the note branches are specified in the command line. However, the standard note, from the refs/notes/commits, is always displayed as first unless the --no-standard-notes options is given. Change-Id: I4e7940804ed9d388b625b8e8a8e25bfcf5ee15a6 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-02-23 09:32:23 -06:00
Shawn O. Pearce	977446e5da	PackWriter: Fix total delta count The total delta count is supposed to include reused deltas, not just newly created deltas. Change-Id: I98cbdcef80d59714a4f62ff322e7b709b08b6d26 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-22 17:00:05 -08:00
Shawn O. Pearce	561549d766	Merge "Create empty GIT_DIR/hooks directory"	2011-02-22 10:46:09 -05:00
Shawn Pearce	5bf3df5e1d	Merge "Fix potential NullPointerException in PlotCommit"	2011-02-22 10:45:51 -05:00
Shawn O. Pearce	6444e60d0e	Create empty GIT_DIR/hooks directory Bug: 337801 Change-Id: I5e0c4d838a211509fb4cc7e048dba6efaec15d5c Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-22 07:38:51 -08:00
Mathias Kinzler	9953e2a39e	Fix potential NullPointerException in PlotCommit Change-Id: Ib7f661a259561251e74337fa233036e041c42423 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2011-02-22 09:11:42 +01:00
Shawn O. Pearce	8f865bfffe	PackWriter: Hoist and cluster reference targets Many source browsers and network related tools like UploadPack need to find and parse the target of all branches and annotated tags within the repository during their startup phase. Clustering these together into the same part of the pack file will improve locality, reducing thrashing when an application starts and needs to load all of these into memory at once. To prevent bottlenecking basic log viewing tools that are scannning backwards from the tip of a current branch (and don't need tags) we place this cluster of older targets after 4096 newer commits have already been placed into the pack stream. 4096 was chosen as a rough guess, but was based on a few factors: - log viewers typically show 5-200 commits per page - users only view the first page or two - DHT can cram 2200-4000 commits per 1 MiB chunk thus these will fall into the second commit chunk (roughly) Unfortunately this placement hurts history tools that are scanning backwards through the commit graph and completely ignored tags or branch heads when they started. An ancient tagged commit is no longer positioned behind its first child (its now much earlier), resulting in a page fault for the parser to reload this cluster of objects on demand. This may be an acceptable loss. If a user is walking backwards and has already scanned through more than 4096 commits of history, waiting for the region to reload isn't really that bad compared to the amount of time already spent. If the repository is so small that there are less than 4096 commits, this change has no impact on the placement of objects. Change-Id: If3052e430d305e17878d94145c93754f56b74c61 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-18 18:15:26 -08:00
Shawn O. Pearce	19037e8cfc	PackWriter: Parse tag target objects in a batch If the underlying storage has a high latency per SHA-1 lookup (e.g. the DHT support we are working on), parsing each wanted annotated tag object back to its underlying commit is too slow, its a sequential lookup for each tag. With hundreds of tags in a repository this takes far too long. Instead queue up a list of the tags whose objects need to be found, and then locate all of those in one parseAny batch. This works for the common case of annotated tag to single tree or commit. For the less often used tag->tag->commit, it at least gets us one level parsed in the larger batch before we have to go back to sequential lookups. Change-Id: I94beef3f14281406f15c8cf9fa02d83faf102a19 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-18 18:15:25 -08:00
Shawn O. Pearce	388ba7e005	PackWriter: Correct total delta count when reusing pack If the CachedPack knows its delta count, we need to increment both the totalDeltas and reusedDeltas fields of the stats object. Change-Id: I70113609c22476ce7f1e4d9a92f486e9b0f59e44 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-18 17:36:45 -08:00
Shawn O. Pearce	3e64b928d5	PackWriter: Short-circuit counting on full cached pack reuse If one or more cached packs fully covers the request, don't bother with looking up the objects and trying to walk the graph. Just use the cached packs and return immediately. This helps clones of quiet repositories that have not been modified since their last repack, its likely the cached packs are accurate and no graph walking is required. Change-Id: I9062a5ac2f71b525322590209664a84051fd5f8a Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-18 17:36:45 -08:00
Shawn O. Pearce	4275c4c1cf	PackWriter: Fix warning about untyped collection Change-Id: I44699d8ab9768844ba91f7224a7d4ee685c93ce6 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-18 17:36:45 -08:00
Shawn O. Pearce	5fc0f1043b	BundleWriter: Always use OFS_DELTA CGit just learned to always use OFS_DELTA when writing out bundle files. This makes sense because bundle came about well after OFS_DELTA was established, so any version of CGit that can read a bundle file can also read OFS_DELTA. Since OFS_DELTA is smaller, always use it when writing bundles. Change-Id: I44f9921494798ea0c99e16eab58b87bebeb9aff5 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-18 17:36:45 -08:00
Chris Aniszczyk	ae0a1340ff	Merge "PackWriter: Sort commits by parse order to improve locality"	2011-02-18 14:30:19 -05:00
Tomasz Zarna	dcb7e477ee	Wrong constant used when configuring a repository Bug: 337546 Change-Id: Ib2f31d621caa5f8b24ce74ce82499889d4f30550	2011-02-18 11:41:19 +01:00
Shawn O. Pearce	733780e8a1	PackWriter: Sort commits by parse order to improve locality RevWalk in JGit and the revision code in C Git both parse commits out of the pack file in an order that differs from strict timestamp and topological sorting. Both implementations pop a commit from the head of a date queue, and then immediately parse all of its parents in order to insert those into the date queue at the proper positions as determined by their committer timestamp field. This implies that the parents are parsed when their most recent child is popped from the queue, and not where they are popped during traversal. Hoisting a parent commit to be immediately behind its child improves locality by making sure all parents of a merge are clustered together, and thus can be paged into the parser by the pack file buffering system (aka WindowCache in JGit) together. Change-Id: I80f9e64cafa2e8f082776b43845edf23065386a2 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-17 21:32:03 -08:00
Shawn Pearce	681739b1c8	Merge "Changed TreeWalk.forPath(...) to work with recursive paths."	2011-02-18 00:21:59 -05:00
Jesse Greenwald	c5863e4d3b	Changed TreeWalk.forPath(...) to work with recursive paths. Previously, this method would not (always) work when a recursive path such as "a/b" was passed into it. Change-Id: I0752a1f5fc7fef32064d8f921b33187c0bdc7227 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-02-17 13:48:22 -06:00
Chris Aniszczyk	5f258d91c0	Add git-reset to the Git API Bug: 334764 Change-Id: Ice404629687d7f2a595d8d4eccf471b12f7e32ec Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-02-17 10:28:26 -06:00
Shawn Pearce	c13bf05754	Merge "Fix NullPointer when pulling from a deleted local branch"	2011-02-16 10:15:31 -05:00
Stefan Lay	68459b646e	Fix NullPointer when pulling from a deleted local branch A checked Exception is thrown instead. The reason for throwing an Exception is that the state of the repository is inconsistent in this case: There is a merge configuration containing a non-existing local branch. Ideally the deletion of a local branch should also delete the corresponding merge configuration. Bug: 337315 Change-Id: I71e56ffb90e11e6e3c1bbd964ad63972d67990c0 Signed-off-by: Stefan Lay <stefan.lay@sap.com>	2011-02-16 15:46:26 +01:00
Shawn O. Pearce	bd531eb998	smart-http: Support progress in ReceivePack As PackParser supports a progress meter for the "Resolving deltas" phase of its work, we should export this to smart HTTP clients so they know the server is still working on their (large) upload. However this isn't as simple as just dropping in a binding for the SmartOutputStream to flush when its told to. We want to avoid spurious flushes triggered by the use of sideband, or the status report formatting in the send-pack/receive-pack protocol. Change-Id: Ibd88022a298c5fed0edb23dfaf2e90278807ba8b Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-15 16:32:51 -08:00
Shawn O. Pearce	14f99dc29d	PackWriter: Try for accurate delta reuse on cached pack If a cached pack is used, it might know how many deltas are contained within it. Record that count as part of our reusedDeltas field for the stats line we show clients. Change-Id: I1c61fb817305a95eeac654cccf132cba20b2339c Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-15 16:32:51 -08:00
Shawn O. Pearce	1f7982f642	UploadPack: Expose advertised refs to callers Like ReceivePack, callers that embed UploadPack within their service may wish to see the set of references that were sent to the client. We already have the map on hand, it just needs to be exposed with a getter. Change-Id: I123b23e475860d5bb968906bef59068985088b7b Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-15 16:32:51 -08:00
Shawn O. Pearce	a06be83045	RepositoryBuilder: Allow callers to require repository exists The setMustExist() method allows callers to require the repository exists in order for build() to succeed. This is useful within a RepositoryResolver where existence is required. Change-Id: I6a1154551435cf0da6c2b4a7f4dce266abea5dff Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-02-15 18:17:46 -06:00
Chris Aniszczyk	22abbd338d	Merge "daemon: Use HTTP's resolver and factory pattern"	2011-02-15 12:09:39 -05:00
Shawn Pearce	3a00ffa528	Merge "Fix processing of broken symbolic references in RefDirectory"	2011-02-15 09:33:31 -05:00
Marc Strapetz	b297cf67a9	Fix processing of broken symbolic references in RefDirectory Change-Id: I1f85890fe718f38ef4b62ebe711f0668267873a2	2011-02-15 09:33:39 +01:00
Shawn O. Pearce	1b7a5a2960	daemon: Use HTTP's resolver and factory pattern Using a resolver and factory pattern for the anonymous git:// Daemon class makes transport.Daemon more useful on non-file storage systems, or in embedded applications where the caller wants more precise control over the work tasks constructed within the daemon. Rather than defining new interfaces, move the existing HTTP ones into transport.resolver and make them generic on the connection handle type. For HTTP, continue to use HttpServletRequest, and for transport.Daemon use DaemonClient. To remain compatible with transport.Daemon, FileResolver needs to learn how to use multiple base directories, and how to export any Repository instance at a fixed name. Change-Id: I1efa6b2bd7c6567e983fbbf346947238ea2e847e Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-14 18:28:21 -08:00
Chris Aniszczyk	8235b88a4b	Merge "UploadPack: Expose PackWriter activity to a logger"	2011-02-14 18:15:34 -05:00
Chris Aniszczyk	cb2a22a9a5	Merge "RevWalk: Avoid unnecessary re-parsing of commit bodies"	2011-02-14 18:14:59 -05:00
Chris Aniszczyk	e22cbd1847	Merge "RevWalk: Don't reset ObjectReader when stopping"	2011-02-14 18:13:39 -05:00
Chris Aniszczyk	100299f59b	Merge "UploadPack: Donate parsed commits to PackWriter"	2011-02-14 18:13:00 -05:00
Mathias Kinzler	9c84574e8f	CreateBranchCommand: Wrong existence check Bug: 337044 Change-Id: I3bc42fea1f552f10d4729999cab6fb4241b70325 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2011-02-14 15:48:43 +01:00
Shawn O. Pearce	c8c4524b6b	UploadPack: Expose PackWriter activity to a logger The UploadPackLogger interface allows applications that embed GitServlet or otherwise use UploadPack to service clients to track and log how PackWriter was used, and what it sent. This provides more granularity into the request activity than might be available from the HTTP server logs, helping administrators to better understand utilization and Git server performance. Change-Id: I1d36b060eb3385339d5f986e68192789ef70fc4e Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-13 13:43:11 -08:00
Shawn O. Pearce	24c1c530db	RevWalk: Avoid unnecessary re-parsing of commit bodies If the RevFilter doesn't actually require the commit body, we shouldn't reparse it if the body was disposed. This happens often inside of UploadPack during common ancestor negotation, the RevWalk is reset and re-run over roughly the same commit space, but the bodies are discarded because the commit message is not relevant to the process. Change-Id: I87b6b6a5fb269669867047698abf718d366bd002 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-13 13:43:11 -08:00
Shawn O. Pearce	bc1af8459e	RevWalk: Don't reset ObjectReader when stopping Applications like UploadPack reset() and reuse the same RevWalk multiple times in very rapid succession. Releasing the ObjectReader's internal state on each use, only to allocate it again on the next cycle kills performance if the ObjectReader has internal caches, or even if the Inflater gets returned and pulled from the InflaterCache too frequently. Making releasing the ObjectReader the application's responsibility when it is done with the RevWalk, which most already do by wrapping their loop in a try/finally block. Change-Id: I3ad188a719e8d7f6bf27d1a7ca16d465534713f4 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-13 13:43:11 -08:00
Shawn O. Pearce	5664fb3bfb	UploadPack: Donate parsed commits to PackWriter When UploadPack has computed the merge base between the client's have set and the want set, its already loaded and parsed all of the interesting commits that PackWriter needs to transmit to the client. Switching the RevWalk and its object pool over to be an ObjectWalk saves PackWriter from needing to re-parse these same commits from the ObjectDatabase, reducing the startup latency for the enumeration phase of packing. UploadPack doesn't want to use an ObjectWalk for the okToGiveUp() tests because its slower, during each commit popped it needs to cache the tree into the pendingObjects list, and during each reset() it discards a bunch of ObjectWalk specific state and reallocates some internal collections. ObjectWalk was never meant to be rapidly reset() like UploadPack does, so its perhaps somewhat cleaner to allow "upgrading" a RevWalk to an ObjectWalk. Bug: 301639 Change-Id: I97ef52a0b79d78229c272880aedb7f74d0f7532f Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-13 13:43:11 -08:00
Chris Aniszczyk	3dcbf375a8	Setup the default remote and merge config in CloneCommand Bug: 336621 Change-Id: I8c889d7b42f6f121d096acad1fada8e3752d74f9 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-02-13 15:33:45 -06:00
Shawn O. Pearce	3271bcee2b	UploadPack: Rely on peeled ref data for include-tag The peeled reference information for tags is more efficient to work with than parsing the tag objects, as usually its coming from the packed-refs file, which stores the peeled information for us. Rely on the peeled information to decide if the tag should be included or not, instead of using our RevWalk to parse the object. Change-Id: I6714a8560a1c04b5578e9c5b469ea3c77188dff3 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-13 15:32:23 -06:00
Shawn O. Pearce	f9c9fe5226	UploadPack: Assume okToGiveUp is initially false When negotiate() starts there is at least one want, but no haves, and thus no common base exists. Its not ok to give up yet, the client should try to find a common base with the server. Avoid scanning our history along the want chains until we have found at least one commit in common with the client, this will trigger okToGiveUp to be set to null, enabling okToGiveUp() to perform the scan. Bug: 301639 Change-Id: I98a82a5424fd4c9995924375c7910f76ca4f03af Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-13 15:30:25 -06:00
Shawn O. Pearce	d6b7139cd8	UploadPack: Avoid walking the entire project history If the client presents a common commit on a side branch, and there is a want for a disconnected branch UploadPack was walking back on the entire history of the disconnected branch because it never would find the common commit. Limit our search back along any given want to be no earlier than the oldest common commit received via a "have" line from our client. This prevents us from looking at all of the project history. Bug: 301639 Change-Id: Iffaaa2250907150d6efa1cf2f2fcf59851d5267d Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-02-13 15:26:26 -06:00
Chris Aniszczyk	8949ea4789	Merge "UploadPack: Tag non-commits SATISIFIED earlier"	2011-02-13 16:23:35 -05:00
Chris Aniszczyk	d90b6aaa44	Merge "UploadPack: Don't discard COMMON, SATISIFIED flags"	2011-02-13 16:23:07 -05:00
Chris Aniszczyk	6a90776240	Merge "UploadPack: Fix want-is-satisfied test"	2011-02-13 16:21:11 -05:00
Chris Aniszczyk	7eca85f4eb	Merge "UploadPack: Avoid parsing want list on clone"	2011-02-13 16:20:27 -05:00
Matthias Sohn	f2c8eec57b	Qualify post 0.11 builds Change-Id: Ibcef4fc4c986c2cda01e943d16aa1c53eff99f25 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-02-12 03:30:05 +01:00
Matthias Sohn	857d151198	JGit 0.11.1 Change-Id: I9ac2fdfb4326536502964ba614d37d0bd103f524 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-02-11 23:25:34 +01:00
Chris Aniszczyk	b46e06bc74	Merge "Fix NPE on reading global config on MAC" into stable-0.11	2011-02-09 12:27:36 -05:00
Jens Baumgart	b82e4bf771	Fix NPE on reading global config on MAC Bug: 336610 Change-Id: Iefcb85e791723801faa315b3ee45fb19e3ca52fb Signed-off-by: Jens Baumgart <jens.baumgart@sap.com>	2011-02-09 15:12:31 +01:00
Jens Baumgart	c9e4a78555	Add isOutdated method to DirCache isOutdated returns true iff the memory state differs from the index file. Change-Id: If35db06743f5f588ab19d360fd2a18a07c918edb Signed-off-by: Jens Baumgart <jens.baumgart@sap.com>	2011-02-09 15:02:22 +01:00
Mathias Kinzler	724af77c65	PullCommand: use default remote instead of throwing Exception When pulling into a local branch that has no upstream configuration, pull should try to used the default remote ("origin") instead of throwing an Exception. Bug: 336504 Change-Id: Ife75858e89ea79c0d6d88ba73877fe8400448e34 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2011-02-08 08:56:19 +01:00
Shawn O. Pearce	0180946bc8	Remove quoting of command over SSH If the command contains spaces, it needs to be evaluated by the remote shell. Quoting the command breaks this, making it impossible to run a remote command that needs additional options. Bug: 336301 Change-Id: Ib5d88f0b2151df2d1d2b4e08d51ee979f6da67b5 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-06 14:04:39 -08:00
Shawn O. Pearce	0fe7eeba04	UploadPack: Tag non-commits SATISIFIED earlier This gets non-commits out of the wantSatisfied() main loop by making use of the cached SATISIFIED flag and its existing bypass. Anything that isn't a commit cannot be discovered by the have negotiation, so its always assumed to be SATISIFIED by the server. Bug: 301639 Change-Id: I1ef354fbf2e2ed44c9020a4069d7179f2159f19f Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-06 01:08:41 -08:00
Shawn O. Pearce	b5da75bb87	UploadPack: Don't discard COMMON, SATISIFIED flags When the walker resets, its going to scrub the COMMON and SATISIFIED flags off a commit if the commit is contained within another commit the client wants. This is common if the client asks for both a 'maint' and 'master' branch, and 'maint' is also fully merged into 'master'. COMMON shouldn't be scrubbed during reset because its used to control membership of the commonBase collection, which is a List. commonBase should technically be a set, but membership is cheaper with a RevFlag. COMMON appears on a commit reachable from a WANT when there is also a PEER_HAS flag present, as this is a merge base. Scrubbing this off when another branch is tested isn't useful. SATISIFIED is a cache to tell us if wantSatisified() has already completed for this particular WANT. If it has, there isn't a need to recompute on that branch. Scrubbing it off 'maint' when we test 'master' just means we would later need to re-test 'maint', wasting CPU time on the server. Bug: 301639 Change-Id: I3bb67d68212e4f579e8c5dfb138f007b406d775f Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-06 01:08:41 -08:00
Shawn O. Pearce	a35c793b2d	UploadPack: Fix want-is-satisfied test okToGiveUpImp() has been missing a ! for a long time. This loop over wantAll() is looking for an object where wantSatisfied() returns false, because there is no common merge base present. Unfortunately it was missing a !, causing the loop to break and return false after at least one want was satisified. Bug: 301639 Change-Id: Ifdbe0b22c9cd0a9181546d090b4990d792d70c82 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-06 01:08:41 -08:00
Shawn O. Pearce	c6423932bf	Fix JGit --upload-pack, --receive-pack options JGit did not use sh -c to run the receive-pack or upload-pack programs locally, which caused errors if these strings contained spaces and needed the local shell to evaluate them. Win32 support using cmd.exe /c is completely untested, but seems like it should work based on the limited information I could get through Google search results. Bug: 336301 Change-Id: I22e5e3492fdebbae092d1ce6b47ad411e57cc1ba Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-05 17:40:54 -08:00
Shawn O. Pearce	2096c749c3	UploadPack: Avoid parsing want list on clone If a client wants to perform a clone of the repository, it sends wants, but no haves. There is no point in parsing the want list within UploadPack, as there won't be a common merge base search. Instead just defer the parsing to PackWriter, which will do its own parsing and object enumeration. If the client does have a "have" set, defer parsing of the want list until the have list is also parsed, and parse them together in a single batch queue. This lets the underlying storage system use a larger lookup batch if there is significant latency involved when resolving an ObjectId to a RevObject. Change-Id: I9c30d34f8e344da05c8a2c041a6dc181d8e8bc19 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-04 09:10:23 -08:00
Shawn O. Pearce	a3620cbbe1	Reuse cached SHA-1 when computing from WorkingTreeIterator Change-Id: I2b2170c29017993d8cb7a1d3c8cd94fb16c7dd02 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2011-02-03 17:06:23 -06:00
Shawn O. Pearce	461b012e95	PackWriter: Support reuse of entire packs The most expensive part of packing a repository for transport to another system is enumerating all of the objects in the repository. Once this gets to the size of the linux-2.6 repository (1.8 million objects), enumeration can take several CPU minutes and costs a lot of temporary working set memory. Teach PackWriter to efficiently reuse an existing "cached pack" by answering a clone request with a thin pack followed by a larger cached pack appended to the end. This requires the repository owner to first construct the cached pack by hand, and record the tip commits inside of $GIT_DIR/objects/info/cached-packs: cd $GIT_DIR root=$(git rev-parse master) tmp=objects/.tmp-$$ names=$(echo $root \| git pack-objects --keep-true-parents --revs $tmp) for n in $names; do chmod a-w $tmp-$n.pack $tmp-$n.idx touch objects/pack/pack-$n.keep mv $tmp-$n.pack objects/pack/pack-$n.pack mv $tmp-$n.idx objects/pack/pack-$n.idx done (echo "+ $root"; for n in $names; do echo "P $n"; done; echo) >>objects/info/cached-packs git repack -a -d When a clone request needs to include $root, the corresponding cached pack will be copied as-is, rather than enumerating all of the objects that are reachable from $root. For a linux-2.6 kernel repository that should be about 376 MiB, the above process creates two packs of 368 MiB and 38 MiB[1]. This is a local disk usage increase of ~26 MiB, due to reduced delta compression between the large cached pack and the smaller recent activity pack. The overhead is similar to 1 full copy of the compressed project sources. With this cached pack in hand, JGit daemon completes a clone request in 1m17s less time, but a slightly larger data transfer (+2.39 MiB): Before: remote: Counting objects: 1861830, done remote: Finding sources: 100% (1861830/1861830) remote: Getting sizes: 100% (88243/88243) remote: Compressing objects: 100% (88184/88184) Receiving objects: 100% (1861830/1861830), 376.01 MiB \| 19.01 MiB/s, done. remote: Total 1861830 (delta 4706), reused 1851053 (delta 1553844) Resolving deltas: 100% (1564621/1564621), done. real 3m19.005s After: remote: Counting objects: 1601, done remote: Counting objects: 1828460, done remote: Finding sources: 100% (50475/50475) remote: Getting sizes: 100% (18843/18843) remote: Compressing objects: 100% (7585/7585) remote: Total 1861830 (delta 2407), reused 1856197 (delta 37510) Receiving objects: 100% (1861830/1861830), 378.40 MiB \| 31.31 MiB/s, done. Resolving deltas: 100% (1559477/1559477), done. real 2m2.938s Repository owners can periodically refresh their cached packs by repacking their repository, folding all newer objects into a larger cached pack. Since repacking is already considered to be a normal Git maintenance activity, this isn't a very big burden. [1] In this test $root was set back about two weeks. Change-Id: Ib87131d5c4b5e8c5cacb0f4fe16ff4ece554734b Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-03 13:20:22 -08:00
Shawn O. Pearce	71f168fcd7	PackWriter: Display totals after sending objects CGit pack-objects displays a totals line after the pack data was fully written. This can be useful to understand some of the decisions made by the packer, and has been a great tool for helping to debug some of that code. Track some of the basic values, and send it to the client when packing is done: remote: Counting objects: 1826776, done remote: Finding sources: 100% (55121/55121) remote: Getting sizes: 100% (25654/25654) remote: Compressing objects: 100% (11434/11434) remote: Total 1861830 (delta 3926), reused 1854705 (delta 38306) Receiving objects: 100% (1861830/1861830), 386.03 MiB \| 30.32 MiB/s, done. Change-Id: If3b039017a984ed5d5ae80940ce32bda93652df5 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-02 17:17:57 -08:00
Shawn O. Pearce	04759f3274	RefAdvertiser: Avoid object parsing It isn't strictly necessary to validate every reference's target object is reachable in the repository before advertising it to a client. This is an expensive operation when there are thousands of references, and its very unlikely that a reference uses a missing object, because garbage collection proceeds from the references and walks down through the graph. So trying to hide a dangling reference from clients is relatively pointless. Even if we are trying to avoid giving a client a corrupt repository, this simple check isn't sufficient. It is possible for a reference to point to a valid commit, but that commit to have a missing blob in its root tree. This can be caused by staging a file into the index, waiting several weeks, then committing that file while also racing against a prune. The prune may delete the blob, since its modification time is more than 2 weeks ago, but retain the commit, since its modification time is right now. Such graph corruption is already caught during PackWriter as it enumerates the graph from the client's want list and digs back to the roots or common base. Leave the reference validation also for that same phase, where we know we have to parse the object to support the enumeration. Change-Id: Iee70ead0d3ed2d2fcc980417d09d7a69b05f5c2f Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-02-02 17:16:32 -08:00
Chris Aniszczyk	f265a80d2e	Merge "Expose some constants needed for reading the Pull configuration"	2011-02-02 10:22:23 -05:00
Mathias Kinzler	13a406287e	Expose some constants needed for reading the Pull configuration Change-Id: I72cb1cc718800c09366306ab2eebd43cd82023ff Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2011-02-02 14:45:37 +01:00
Jens Baumgart	29ed09a44f	PushCommand: do not set a null credentials provider PushCommand now does not set a null credentials provider on Transport because in this case the default provider is replaced with null and the default mechanism for providing credentials is not working. Bug: 336023 Change-Id: I7a7a9221afcfebe2e1595a5e59641e6c1ae4a207 Signed-off-by: Jens Baumgart <jens.baumgart@sap.com>	2011-02-02 13:13:28 +01:00
Robin Stocker	b0245b548b	Don't print "into HEAD" when merging refs/heads/master When MergeMessageFormatter was given a symbolic ref HEAD which points to refs/heads/master (which is the case when merging a branch in EGit), it would result in a merge message like the following: Merge branch 'a' into HEAD But it should print the following (as C Git does): Merge branch 'a' The solution is to use the leaf ref when checking for refs/heads/master. Change-Id: I28ae5713b7e8123a0176fc6d7356e469900e7e97	2011-02-01 22:27:33 +01:00
Shawn O. Pearce	13bcf05a9e	PackWriter: Make thin packs more efficient There is no point in pushing all of the files within the edge commits into the delta search when making a thin pack. This floods the delta search window with objects that are unlikely to be useful bases for the objects that will be written out, resulting in lower data compression and higher transfer sizes. Instead observe the path of a tree or blob that is being pushed into the outgoing set, and use that path to locate up to WINDOW ancestor versions from the edge commits. Push only those objects into the edgeObjects set, reducing the number of objects seen by the search window. This allows PackWriter to only look at ancestors for the modified files, rather than all files in the project. Limiting the search to WINDOW size makes sense, because more than WINDOW edge objects will just skip through the window search as none of them need to be delta compressed. To further improve compression, sort edge objects into the front of the window list, rather than randomly throughout. This puts non-edges later in the window and gives them a better chance at finding their base, since they search backwards through the window. These changes make a significant difference in the thin-pack: Before: remote: Counting objects: 144190, done remote: Finding sources: 100% (50275/50275) remote: Getting sizes: 100% (101405/101405) remote: Compressing objects: 100% (7587/7587) Receiving objects: 100% (50275/50275), 24.67 MiB \| 9.90 MiB/s, done. Resolving deltas: 100% (40339/40339), completed with 2218 local objects. real 0m30.267s After: remote: Counting objects: 61549, done remote: Finding sources: 100% (50275/50275) remote: Getting sizes: 100% (18862/18862) remote: Compressing objects: 100% (7588/7588) Receiving objects: 100% (50275/50275), 11.04 MiB \| 3.51 MiB/s, done. Resolving deltas: 100% (43160/43160), completed with 5014 local objects. real 0m22.170s The resulting pack is 13.63 MiB smaller, even though it contains the same exact objects. 82,543 fewer objects had to have their sizes looked up, which saved about 8s of server CPU time. 2,796 more objects from the client were used as part of the base object set, which contributed to the smaller transfer size. Change-Id: Id01271950432c6960897495b09deab70e33993a9 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Sigend-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-02-01 09:12:06 -06:00
Shawn O. Pearce	2fbcba41e3	PackWriter: Cleanup findObjectToPack method Some of this code predates making ObjectId.equals() final and fixing RevObject.equals() to match ObjectId.equals(). It was therefore more complex than it needs to be, because it tried to work around RevObject's broken equals() rules by converting to ObjectId in a different collection. Also combine setUpWalker() and findObjectsToPack() methods, these can be one method and the code is actually cleaner. Change-Id: I0f4cf9997cd66d8b6e7f80873979ef1439e507fe Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-02-01 09:03:24 -06:00
Shawn O. Pearce	8f63dface2	PackWriter: Correct 'Compressing objects' progress message The first 'Compressing objects' progress message is wrong, its actually PackWriter looking up the sizes of each object in the ObjectDatabase, so objects can be sorted correctly in the later type-size sort that tries to take advantage of "Linus' Law" to improve delta compression. Rename the progress to say 'Getting sizes', which is an accurate description of what it is doing. Change-Id: Ida0a052ad2f6e994996189ca12959caab9e556a3 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-02-01 09:01:58 -06:00
Chris Aniszczyk	eb5658e629	Merge "Add git-clone to the Git API"	2011-02-01 09:56:46 -05:00
Shawn O. Pearce	37a10e3006	PackWriter: Don't include edges in progress meter When compressing objects, don't include the edges in the progress meter. These cost almost no CPU time as they are simply pushed into and popped out of the delta search window. Change-Id: I7ea19f0263e463c65da34a7e92718c6db1d4a131 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-02-01 08:55:43 -06:00
Chris Aniszczyk	cc5295c4b4	Merge "Show resolving deltas progress to push clients"	2011-02-01 09:40:57 -05:00
Chris Aniszczyk	c1de63262e	Merge "ObjectWalk: Fix reset for non-commit objects"	2011-02-01 09:38:30 -05:00
Chris Aniszczyk	4112884ede	Add git-clone to the Git API Enhance the Git API to support cloning repositories. Bug: 334763 Change-Id: Ibe1191498dceb9cbd1325aed85b4c403db19f41e Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-01-31 16:56:56 -06:00
Shawn O. Pearce	168114fd39	Show resolving deltas progress to push clients CGit push clients 1.6.6 and later support progress messages on the side-band-64k channel during push, as this was introduced to handle server side hook errors reported over smart HTTP. Since JGit's delta resolution isn't always as fast as CGit's is, a user may think the server has crashed and failed to report status if the user pushed a lot of content and sees no feedback. Exposing the progress monitor during the resolving deltas phase will let the user know the server is still making forward progress. This also helps BasePackPushConnection, which has a bounded timeout on how long it will wait before assuming the remote server is dead. Progress messages pushed down the side-band channel will reset the read timer, helping the connection to stay alive and avoid timing out before the remote side's work is complete. Change-Id: I429c825e5a724d2f21c66f95526d9c49edcc6ca9 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-01-31 12:31:52 -08:00
Shawn O. Pearce	c2ab3421a2	ObjectWalk: Fix reset for non-commit objects Non-commits are added to a pending queue, but duplicates are removed by checking a flag. During a reset that flag must be stripped off the old roots, otherwise the caller cannot reuse the old roots after the reset. RevWalk already does this correctly for commits, but ObjectWalk failed to handle the non-commit case itself. Change-Id: I99e1832bf204eac5a424fdb04f327792e8cded4a Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-01-31 12:31:52 -08:00
Mathias Kinzler	b15b9d5df2	Proper handling of rebase during pull After consulting with Christian Halstrick, it turned out that the handling of rebase during pull was implemented incorrectly. Change-Id: I40f03409e080cdfeceb21460150f5e02a016e7f4 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2011-01-31 12:12:48 +01:00
Robin Rosenberg	9ffcf2a8b3	Merge changes I3a74cc84,I219f864f * changes: [findbugs] Do not ignore exceptional return value of createNewFile() Do not create files to be updated before checkout of DirCache entry	2011-01-29 17:52:12 -05:00
Tomasz Zarna	9fbda22392	Add setCredentialsProvider to PullCommand Bug: 335703 Change-Id: Id9713a4849c772e030fca23dd64b993264f28366 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-01-28 14:06:47 -06:00
Chris Aniszczyk	a880233d7f	Merge "ObjectIdSubclassMap: Support duplicate additions"	2011-01-28 12:45:39 -05:00
Shawn O. Pearce	17dc6bdafd	ObjectIdSubclassMap: Support duplicate additions The new addIfAbsent() method combines get() with add(), but does it in a single step so that the common case of get() returning null for a new object can immediately insert the object into the map. Change-Id: Ib599ab4de13ad67665ccfccf3ece52ba3222bcba Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-01-28 08:17:20 -08:00
Chris Aniszczyk	bf69401fee	Merge "Make PullCommand work with Rebase"	2011-01-28 10:52:38 -05:00
Chris Aniszczyk	0b2ac1e929	Merge "RebaseCommand: detect and handle fast-forward properly"	2011-01-28 10:38:27 -05:00
Shawn O. Pearce	065a0a8122	Revert "Teach PackWriter how to reuse an existing object list" This reverts commit `f5fe2dca3c`. I regret adding this feature to the public API. Caches aren't always the best idea, as they require work to maintain. Here the cache is redundant information that must be computed, and when it grows stale must be removed. The redundant information takes up more disk space, about the same size as the pack-*.idx files are. For the linux-2.6 repository, that's more than 40 MB for a 400 MB repository. So the cache is a 10% increase in disk usage. The entire point of this cache is to improve PackWriter performance, and only PackWriter performance, and only when sending an initial clone to a new client. There may be better ways to optimize this, and until we have a solid solution, we shouldn't be using a separate cache in JGit.	2011-01-28 07:20:26 -08:00
Mathias Kinzler	14ca80bc90	Make PullCommand work with Rebase Rebase must honor the upstream configuration branch.<branchname>.rebase Change-Id: Ic94f263d3f47b630ad75bd5412cb4741bb1109ca Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2011-01-28 15:04:52 +01:00
Mathias Kinzler	e8a1328d05	RebaseCommand: detect and handle fast-forward properly This bug was hidden by an incomplete test: the current Rebase implementation using the "git rebase -i" pattern does not work correctly if fast-forwarding is involved. The reason for this is that the log command does not return any commits in this case. In addition, a check for already merged commits was introduced to avoid spurious conflicts. Change-Id: Ib9898fe0f982fa08e41f1dca9452c43de715fdb6 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2011-01-28 15:03:02 +01:00
Mathias Kinzler	c544e96a4c	TransportHttp wrongly uses JDK 6 constructor of IOException IOException constructor taking Exception as parameter is new for JDK 6. Change-Id: Iec349fc7be9e9fbaeb53841894883c47a98a7b29 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2011-01-28 09:24:20 +01:00
Matthias Sohn	38eec8f4a2	[findbugs] Do not ignore exceptional return value of mkdir java.io.File.mkdir() and mkdirs() report failure as an exceptional return value false. Fix the code which silently ignored this exceptional return value. Change-Id: I41244f4b9d66176e68e2c07e2329cf08492f8619 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-01-28 01:11:12 +01:00
Matthias Sohn	9ec97688b9	Do not create files to be updated before checkout of DirCache entry DirCacheCheckout.checkoutEntry() prepares the new file content using a temporary file and then renames it to the file to be written during checkout. For files to be updated checkout() created each file before calling checkoutEntry(). Hence renaming the temporary file always failed which was corrected in exception handling by retrying to rename the file after deleting the just newly created file. Change-Id: I219f864f2ed8d68051d7b5955d0659964fa27274 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-01-28 01:11:12 +01:00
Shawn O. Pearce	f5fe2dca3c	Teach PackWriter how to reuse an existing object list Counting the objects needed for packing is the most expensive part of an UploadPack request that has no uninteresting objects (otherwise known as an initial clone). During this phase the PackWriter is enumerating the entire set of objects in this repository, so they can be sent to the client for their new clone. Allow the ObjectReader (and therefore the underlying storage system) to keep a cached list of all reachable objects from a small number of points in the project's history. If one of those points is reached during enumeration of the commit graph, most objects are obtained from the cached list instead of direct traversal. PackWriter uses the list by discarding the current object lists and restarting a traversal from all refs but marking the object list name as uninteresting. This allows PackWriter to enumerate all objects that are more recent than the list creation, or that were on side branches that the list does not include. However, ObjectWalk tags all of the trees and commits within the list commit as UNINTERESTING, which would normally cause PackWriter to construct a thin pack that excludes these objects. To avoid that, addObject() was refactored to allow this list-based enumeration to always include an object, even if it has been tagged UNINTERESTING by the ObjectWalk. This implies the list-based enumeration may only be used for initial clones, where all objects are being sent. The UNINTERESTING labeling occurs because StartGenerator always enables the BoundaryGenerator if the walker is an ObjectWalk and a commit was marked UNINTERESTING, even if RevSort.BOUNDARY was not enabled. This is the default reasonable behavior for an ObjectWalk, but isn't desired here in PackWriter with the list-based enumeration. Rather than trying to change all of this behavior, PackWriter works around it. Because the list name commit's immediate files and trees were all enumerated before the list enumeration itself starts (and are also within the list itself) PackWriter runs the risk of adding the same objects to its ObjectIdSubclassMap twice. Since this breaks the internal map data structure (and also may cause the object to transmit twice), PackWriter needs to use a new "added" RevFlag to track whether or not an object has been put into the outgoing list yet. Change-Id: Ie99ed4d969a6bb20cc2528ac6b8fb91043cee071 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-01-27 09:38:19 -08:00
Shawn O. Pearce	a017fdf112	Allow ObjectReuseAsIs to resort objects during writing It can be very handy for the implementation to resort the object list based on data locality, improving prefetch in the operating system's buffer cache. Export the list to the implementation was a proper List, and document that its mutable and OK to be modified. The only caller in PackWriter is already OK with these rules. Change-Id: I3f51cf4388898917b2be36670587a5aee902ff10 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-01-27 08:58:55 -08:00
Shawn O. Pearce	c218a0760d	PackWriter: Use TOPO order only for incremental packs When performing an initial clone of a repository there are no uninteresting commits, and the resulting pack will be completely self-contained. Therefore PackWriter does not need to honor C Git standard TOPO ordering as described in JGit commit `ba984ba2e0` ("Fix checkReferencedIsReachable to use correct base list"). Switching to COMMIT_TIME_DESC when there are no uninteresting commits allows the "Counting objects" phase to emit progress earlier, as the RevWalk will not buffer the commit list. When TOPO is set the RevWalk enumerates all commits first, before outputing any for PackWriter to mark progress updates from. Change-Id: If2b6a9903b536c7fb3c45f85d0a67ff6c6e66f22 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-01-27 08:58:44 -08:00
Shawn O. Pearce	559c4661c3	Remove getObjectsDirectory, openPack from base API These two methods are specific to the FileRepository implementation and should not be exposed as part of the base Repository API. Now that PackParser is generic and does not require these two methods to import a pack stream into a repostiory, it is safe to remove these and get them out of the public view. Change-Id: I8990004d08074657f467849dabfdaa7e6674e69a Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2011-01-27 08:56:35 -08:00
Shawn Pearce	cc983454c0	Merge "Support for self signed certificate (HTTPS)"	2011-01-27 11:46:49 -05:00
Matthias Sohn	91af19de56	Hard reset should not report conflict on untracked file This problem surfaced since EGit Core ResetOperationTest is failing since change I26806d21. JGit detected checkout conflict for untracked files which never were tracked by the repository. "git reset --hard" in c git also doesn't remove such untracked files. Change-Id: Icc8e1c548ecf6ed48bd2979c81eeb6f578d347bd Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2011-01-27 17:20:04 +01:00
Roberto Tyley	afa7c7ab07	Rename PlotWalk.getTags() to getRefs() Change-Id: I170685e70d9ac09a010df69d26ec1c38bde60174 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-01-26 22:35:41 -06:00
Roberto Tyley	6ac8279ae7	Provide access to the Refs of a PlotCommit This information is generally useful - have followed the accessor pattern of 'children' and 'parents' Change-Id: I79b3ddd6f390152aa49e6b7a4c72a4aca0d6bc72 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-01-26 22:03:41 -06:00
Robin Rosenberg	24e7f0f6fa	Fix tests broken by fix for adding files in a network share The change Ie0350e032a97e0d09626d6143c5c692873a5f6a2 was not done properly. The renamed file was not write protected, and this broke a test. Bug: 335388 Change-Id: I41b2235b7677bc5fddc70dda2a56cdd2cb53ce5d Signed-off-by: Robin Rosenberg <robin.rosenberg@dewire.com>	2011-01-26 13:52:58 -06:00
Mathias Kinzler	a5b36ae1ea	FetchCommand: allow to set "TagOpt" This is needed for implementing Fetch in EGit using the API. Change-Id: Ibdcc95906ef0f93e3798ae20d4de353fb394f2e2 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2011-01-26 20:03:02 +01:00
Christian Halstrick	0d7dd6625a	Make sure not to overwrite untracked not-ignored files When DirCacheCheckout was checking out it was silently overwriting untracked files. This is only ok if the files are also ignored. Untracked and not ignored files should not be overwritten. This fix adds checks for this situation. Because this change in the behaviour also broke tests which expected that a checkout will overwrite untracked files (PullCommandTest) these tests have to be modified also. Bug: 333093 Change-Id: I26806d2108ceb64c51abaa877e11b584bf527fc9 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-01-26 11:41:44 -06:00
Robin Rosenberg	c4c8d80fd3	Fix adding files in a network share We cannot always rename read-only files on network shares, so rename the temp file for a new loose object first, and then set it as read-only. Bug: 335388 Change-Id: Ie0350e032a97e0d09626d6143c5c692873a5f6a2 Signed-off-by: Robin Rosenberg <robin.rosenberg@dewire.com> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2011-01-26 11:26:55 -06:00
Chris Aniszczyk	509662653b	Merge "Refactor and comment complicated if statements"	2011-01-26 12:23:26 -05:00
Chris Aniszczyk	9b8ac0151e	Merge "MergeCommand should create missing branches"	2011-01-26 12:17:47 -05:00
Mathias Kinzler	414e0cd329	Make setCredentialsProvider more convenient to use Change-Id: I984836ea7d6a67fd2d1d05f270afa7c29f30971c Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2011-01-26 18:03:22 +01:00

1 2 3 4 5 ...

1099 Commits