motiejus/jgit - jgit - gitea: Gitea Service

motiejus

jgit

Author	SHA1	Message	Date
Chris Aniszczyk	a92bda5adf	Merge "Extract pack directory last modified check code"	2010-12-20 10:27:33 -05:00
Mathias Kinzler	645d262de6	Checkout: expose a CheckoutResult This is needed by callers to determine checkout conflicts and possible files that were not deleted during the checkout so that they can present the end user with a better Exception description and retry to delete the undeleted files later, respectively. Change-Id: I037930da7b1a4dfb24cfa3205afb51dc29e4a5b8 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2010-12-20 10:21:49 +01:00
Robin Rosenberg	94a2cbb407	Fix wrong javadoc comment in Repository Change-Id: I9fc084b48418884ce1ccf16d56e800f1d3594885 Signed-off-by: Robin Rosenberg <robin.rosenberg@dewire.com>	2010-12-19 11:03:23 +01:00
Robin Rosenberg	33c6eb848e	Merge "Move TransferConfig to transport package"	2010-12-18 10:43:26 -05:00
Mathias Kinzler	73f36aa8f7	DirCacheCheckout: fix getToBeDeleted() This wrongly returns the same as getConflicts() Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com> Change-Id: Id37c625458fc5a9b3987f05b684620e24fdfe852	2010-12-16 08:41:36 +01:00
Shawn O. Pearce	34454465c2	Move TransferConfig to transport package This doesn't belong in the main lib package. Change-Id: Idb20bf5849138b34a7277250fe0795c2a1f22447 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-15 17:04:03 -08:00
Shawn Pearce	c19093bbad	Merge "Do not rely on filemode differences in case of symbolic links"	2010-12-15 18:55:59 -05:00
Shawn O. Pearce	3922e026e0	FileBasedConfig: Use FileSnapshot for isOutdated() Relying only on the last modified time for a file can be tricky. The "racy git" problem may cause some modifications to be missed. Use the new FileSnapshot code to track when a configuration file has been modified, and needs to be reloaded in memory. Change-Id: Ib6312fdd3b2403eee5af3f8ae711294b0e5f9035 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-15 15:14:05 -08:00
Shawn O. Pearce	c8db22f355	Extract pack directory last modified check code Pulling the last modified checking logic out of ObjectDirectory makes it possible to reuse this code for other files, such as the $GIT_DIR/config or $GIT_DIR/packed-refs files. Change-Id: If2f27a89fc3b7adde7e65ff40bbca5d55b98b772 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-15 15:14:05 -08:00
Shawn O. Pearce	013cb8de38	Reduce calls to Repository.getConfig Each time getConfig() is called on FileRepository, it checks the last modified time of both ~/.gitconfig and $GIT_DIR?config. If $GIT_DIR/config appears to have been modified, it is read back in from disk and the current config is wiped out. When mutating a configuration file, this may cause in-memory edits to disappear. To avoid that callers need to avoid calling getConfig until after the configuration has been saved to disk. Unfortunately the API is still horribly broken. Configuration should be modified only while a lock is held on the configuration file, very similar to the way a ref is updated via its locking protocol. But our existing API is really broken for that so we'll have to defer cleaning up the edit path for a future change. Change-Id: I5888dd97bac20ddf60456c81ffc1eb8df04ef410 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-15 15:14:05 -08:00
Shawn O. Pearce	86847ee322	Support GIT_SSH=tortoiseplink The tortoiseplink command does not understand -batch, even though it smells like the putty plink command that does use it. Don't add -batch if GIT_SSH is tortoiseplink. Change-Id: I638532a02faa2caf8c39d482094e7ff4f4ec7e78 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-15 10:18:03 -08:00
Shawn O. Pearce	8efbd378e1	Correct plink -batch option When GIT_SSH is set to use plink, the correct option name is "-batch" and not "--batch". This was a typo introduced when we added support for plink via GIT_SSH. Change-Id: I391660e38f5d208bba11e3f2a8f25922de2af878 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-15 10:17:01 -08:00
Philipp Thun	bab053afdd	Do not rely on filemode differences in case of symbolic links When checking whether a file in the working tree has been modified - WorkingTreeIterator.isModified() - we should not trust the filemode in case of symbolic links, but check the timestamp and also the content, if requested. Without this fix symlinks will always be shown in EGit as modified files on Windows systems. Change-Id: I367c807df5a7e85e828ddacff7fee7901441f187 Signed-off-by: Philipp Thun <philipp.thun@sap.com>	2010-12-14 11:31:41 +01:00
Shawn O. Pearce	5ac5871d16	Simplify NoteParser use of prefix.length() Sasa pointed out we only ever use the length here, so instead of holding onto the AbbreviatedObjectId, lets just hold onto the length as a primitive int. Change-Id: I2444f59f9fe5ddcaea4a3537d3f1064736ae3215 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> CC: Sasa Zivkov <zivkov@gmail.com>	2010-12-13 16:01:39 -06:00
Shawn O. Pearce	2bc13104a8	Fix HTTP digest authentication JGit's internal implementation of the HTTP digest authentication method wasn't conforming to RFC 2617 (HTTP Authentication: Basic and Digest Access Authentication), resulting in authentication failures when connecting to a digest protected site. The code now more accurately matches section 3.2.2 (The Authorization Request Header) from the standards document. Change-Id: If41b5c2cbdd59ddd6b2dea143f325e42cd58c395 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-13 10:11:03 -08:00
Matthias Sohn	c6ca443b61	File utilities for creating directories The java.io.File methods for creating directories report failure by returning false. To ease proper checking of return values provide utility methods wrapping mkdir() and mkdirs() which throw IOException on failure. Also fix the tests to store test data under a trash folder and cleanup after test. Change-Id: I09c7f9909caf7e25feabda9d31e21ce154e7fcd5 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-12-13 08:47:17 -06:00
Shawn O. Pearce	45a020fe6a	DiffFormatter: Use IndexDiffFilter to speed up working tree If DiffFormatter is asked to compare the index to the working tree, it can go faster by using the cached stat information to compare the two entries rather than relying on SHA-1 computation alone. Change-Id: Icb21c15b8279ee8cee382e5e179e0cf8903aee4d Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-10 17:17:22 -08:00
Mathias Kinzler	9b039b42e0	Rebase: abort on unknown/unsupported command in git-rebase-todo This is needed to ensure interoperability with the command line: if the git-rebase-todo file was created manually (by git rebase -i in the command line), and any commands other than pick are used (reword, edit, fixup, squash) JGit must abort as it does not understand these commands yet. The same is true if an unknown command is found (e.g. due to a typo); this is the same behavior as shown by the command line. Change-Id: I2322014f69460361f7fc09da223e8a5c31f100dd Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2010-12-10 09:44:51 +01:00
Shawn Pearce	93a7b2b24d	Merge "IndexPack: Remove blob-streaming size threshold"	2010-12-09 19:33:58 -05:00
roberto	941b3d8a81	IndexPack: Remove blob-streaming size threshold Always use streaming (for SHA-checksum & collision detection) when indexing whole blobs, regardless of their size. Positives: * benefits of bugfix #312868 will apply to all runtimes, without additional conf for mem-constrained JVMs (5MB huge for some) * no byte array allocation (re-uses readBuffer instead of allocating new full-size array) * mildly better overall performance (given the usual blob-does-not-need-collision-checking case) * removes unnecessary code Negative: * doubles the disk IO for a blob comparision (comparitively rare occurance) I perf-tested a range of threshold sizes against a random selection of packfiles I found on my harddrive, the results are here: https://spreadsheets.google.com/ccc?key=tLCQElyyd2RKN9QevfvgwGQ&hl=en_GB#gid=1 My interpretation of the results is that the streaming size threshold isn't beneficial (actually seems to be very slightly detrimental) -so we should just get rid of it. This tallies with some of the comments Shawn & I had for the default value of streamFileThreshold in the review for I862afd4c: http://egit.eclipse.org/r/#patch,sidebyside,2040,2,org.eclipse.jgit/src/org/eclipse/jgit/transport/IndexPack.java The perf-test code is here: https://gist.github.com/735402 It's a bit scruffy but basically does 10 runs (in randomised order) for each threshold size on various packfiles, waiting a second between each pack-indexing to allow GC to catch up. I know it's not perfect - proper perf testing is hard to do :-)	2010-12-09 23:46:47 +00:00
Chris Aniszczyk	a3475fb664	Merge "Add option to skip deletion of non-existing files"	2010-12-09 18:31:48 -05:00
Chris Aniszczyk	ec5116b09c	Merge "Simplify logic in StrategySimpleTwoWayInCore"	2010-12-09 18:30:41 -05:00
Matthias Sohn	cbd1ecff4d	Add option to skip deletion of non-existing files For convenience provide an option to skip deletion of non-existing files. Also add some tests for deletion methods in FileUtils. Change-Id: I33e355cfcdc19367d50208150ee49a4a06394890 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-12-10 00:21:23 +01:00
Shawn O. Pearce	33c670c1f0	Simplify logic in StrategySimpleTwoWayInCore Sasa and I were reviewing this code today and Sasa pointed out we can simplify the conflict logic, as the two cases (subtree and file) are logically identical. Change-Id: Ie0d40b2dd15605785eff453a846b1d20a2d021fc Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Reviewed-by: Sasa Zivkov <zivkov@gmail.com>	2010-12-09 10:55:43 -08:00
Mathias Kinzler	2a7cd0086b	Rebase: fix wrong update if original HEAD after Merge+Skip Rebase would update the original HEAD to the wrong commit when "skipping" the last commit after a merged commit. Includes a test for the specific situation. Change-Id: I087314b1834a3f11a4561f04ca5c21411d54d993 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2010-12-09 19:22:11 +01:00
Christian Halstrick	1783749e16	Add a performance optimized variant of the ANY_DIFF filter If a treewalk walks also over index and the workingtree then the IndexDiffFilter filter can be used which works much faster then the semantically equivalent ANY_DIFF filter. This is because this filter can better avoid computing SHA-1 ids over the content of working-tree files which is very costly. This fix will significantly improve the performance of e.g. EGit's commit dialog. Change-Id: I2a51816f4ed9df2900c6307a54cd09f50004266f Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Philipp Thun <philipp.thun@sap.com>	2010-12-09 18:51:33 +01:00
Mathias Kinzler	6bca46e168	Implement rebase --continue and --skip For --continue, the Rebase command asserts that there are no unmerged paths in the current repository. Then it checks if a commit is needed. If yes, the commit message and author are taken from the author_script and message files, respectively, and a commit is performed before the next step is applied. For --skip, the workspace is reset to the current HEAD before applying the next step. Includes some tests and a refactoring that extracts Strings in the code into constants. Change-Id: I72d9968535727046e737ec20e23239fe79976179 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com> Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-12-09 16:10:21 +01:00
Shawn O. Pearce	18abb8195a	IndexDiff: Remove unnecessary changesExist flag Instead of setting a boolean when a difference record is found, return false from diff() only if all of the collections are empty. When all of them are empty, no difference was found. Change-Id: I555fef37adb764ce253481751071c53ad12cf416 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-08 10:03:20 -08:00
Shawn O. Pearce	a66a7d90fd	IndexDiff: Use isModified() when comparing index-worktree The isModified() is more efficient because it can skip over files that are stat clean, without needing to scan them. This is useful to efficently work on paths that were already staged and thus differ between HEAD and the index, but not between the index and the working tree. Change-Id: I4418202e612f0571974e0898050d987c6c280966 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-08 10:03:20 -08:00
Shawn O. Pearce	d4bbb2e449	IndexDiff: Clean up tree-index compare for staged files When comparing the ObjectIds for two tree entries its faster to use the raw buffer compares over allocating ObjectIds and then performing equals on their contents. However, this also needs to consider the raw modes. It is possible for a path to change modes but not ObjectId (e.g. making a file executable), and in this case its still a staged change to report back to the caller. Change-Id: I1a267254c04b3273a97f63c71d1e6718cd9d2fa8 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-08 10:03:20 -08:00
Shawn O. Pearce	e6c3922764	IndexDiff: Fix getAssumeUnchanged() If the caller really needs the list of files that are flagged as assume-unchanged (aka assume-valid in the DirCache), we should give them the complete list and not just those that we wrongly identified as being modified during diff(). This change is necessary because diff() is slightly broken and is discovering differences on files that it shouldn't have considered. Change-Id: Ibe464c1a0e51c19dc287a4bc5348b7b07f4d840b Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-08 10:03:20 -08:00
Shawn O. Pearce	72f87adce6	IndexDiff: Correct Javadoc for getUntracked() method Change-Id: I5f26c40dec5f0e4a47413af033dbedb0c252dd20 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-08 10:03:20 -08:00
Shawn O. Pearce	48e80698cf	IndexDiff: Remove always true not-subtree check The TreeWalk is configured to be recursive, which means subtrees are never presented to the application. Therefore the working tree file mode can never be a subtree/subdirectory at this point in the code. Change-Id: Ie842ddc147957d09205c0d2ce87b25c566862fd9 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-08 10:03:20 -08:00
Shawn O. Pearce	ca9baa0ee2	IndexDiff: Always use TreeWalk.getPathString() Instead of asking the individual iterators for their path string, use the TreeWalk's generic getPathString() method. Its just as fast because it uses the path of the current matching iterator. Change-Id: I9b827fbbafce1c78f09d5527cdc64fbe9022a16e Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-08 10:03:20 -08:00
Shawn O. Pearce	f4e9c8890c	IndexDiff: Simplify allocation of filter list We add either 3 or 4 filters. If we are adding only 3 filters, allocating the array for 4 isn't a huge waste of memory, but it does simplify our code. Change-Id: I7df29b414f6d5cfcf533edb1405083e6fcec32cf Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-08 10:03:20 -08:00
Shawn O. Pearce	11fd0fe03a	Clarify WorkingTreeOptions and filemode usage To improve runtime performance, caching the WorkingTreeOptions inside of the Config object using the Config.SectionParser API allows the WorkingTreeOptions to be accessed more efficiently whenever a FileTreeIterator is constructed for the Repository. Instead of passing the filemode handling option into isModified(), the WorkingTreeIterator should always honor whatever setting has been configured in this repository, as defined by its own copy of the WorkingTreeOptions. This simplifies all of the callers as they no longer need to lookup core.filemode on their own. A few locations were changed from always using a hardcoded "true" on the file mode to passing what is actually configured in the repository. This is a behavior change, but corrects what should be considered to be bugs as the core.filemode variable wasn't always being used. Change-Id: Idb176736fa0dc97af372f1d652a94ecc72fb457c Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-08 10:03:19 -08:00
Shawn O. Pearce	c181e1ab8a	IndexPack: Use streaming for large whole blobs When indexing large blobs that are stored whole (non-delta form), avoid allocating the entire blob in memory and instead stream it through the SHA-1 checksum computation. This reduces the size of memory required by IndexPack when processing very big blobs, such as a 500 MiB uncompressable binary. If the large blob already exists in the local repository, its contents needs to be compared byte-for-byte after the entire pack has been indexed, to ensure there isn't an unexpected SHA-1 collision which may result in later data corruption. This compare is performed as a streaming compare, again avoiding the large object allocation. This change doesn't improve on memory utilization for large objects stored as deltas. The change also doesn't improve handling for any large commits, trees or annotated tags. There isn't much to be done here for those objects, because they need to be passed down to the ObjectChecker as a byte[]. Fortunately it isn't common for these object types to be that large, Bug: 312868 Change-Id: I862afd4cb78013ee033d4ec68c067b1774a05be8 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com> CC: Roberto Tyley <roberto.tyley@guardian.co.uk>	2010-12-08 11:30:11 -06:00
Chris Aniszczyk	bc1130c6aa	Merge "Refactor IndexPack to use InputStream for inflation"	2010-12-08 11:19:51 -05:00
Christian Halstrick	e3881de258	Removed unread parameters Some method parameters in WorkingTreeIterator are never used. Remove them. Especially the removal of the FS parameter in isModified() simplifies upcoming performance optimizations. Change-Id: I7c449589283a4a6b6e23f2586cd784febdca8bcd Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-12-08 10:15:48 -06:00
Shawn O. Pearce	a02be9725c	Remove empty iterator from TreeWalk Its confusing that a new TreeWalk() needs to have reset() invoked on it before addTree(). This is a historical accident caused by how TreeWalk was abused within ObjectWalk. Drop the initial empty tree from the TreeWalk and thus remove a number of pointless reset() operations from unit tests and some of the internal JGit code. Existing application code which is still calling reset() will simply be incurring a few unnecessary field assignments, but they should consider cleaning up their code in the future. Change-Id: I434e94ffa43491019e7dff52ca420a4d2245f48b Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-07 16:49:51 -08:00
Shawn O. Pearce	c94efa8286	Refactor IndexPack to use InputStream for inflation By inflating with an InputStream like API, it is possible to stream through large objects rather than allocating the entire thing as a byte[]. This change only refactors the inflation code within IndexPack to use a streaming interface. Change-Id: I5a84b486901c2cf63fa6a3306dd5fb5c53b4056b Signed-off-by: Shawn O. Pearce <spearce@spearce.org> CC: Roberto Tyley <roberto.tyley@guardian.co.uk>	2010-12-07 16:19:48 -08:00
Matthias Sohn	45731756a5	[findbugs] Do not ignore exceptional return value java.io.File.delete() reports failure as an exceptional return value false. Fix the code which silently ignored this exceptional return value. Also remove some duplicate deletion helper methods. Change-Id: I80ed20ca1f07a2bc6e779957a4ad0c713789c5be Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-12-07 16:18:30 -08:00
Matthias Sohn	e22f9552a8	Provide file utilities for file deletion Provide file helper methods in a reusable utility class to replace many local implementations. java.io.File has some methods reporting failure by returning false. We prefer to throw IOException on failure so that callers can't forget checking the return value. Change-Id: I430c77b5d2cffcf8b47584326ad4817a7291845e Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-12-07 16:18:29 -08:00
Chris Aniszczyk	db8cc4c84e	Clean up Init API Static accessors should come before a constructor. Change-Id: Iee1051ce4f2038f19a08741e7a3a33f06a97a3c0 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-12-07 09:13:57 -06:00
Chris Aniszczyk	48b73efe1e	Merge "Rebase Interoperability third part: handle stop upon conflict"	2010-12-07 09:34:25 -05:00
Chris Aniszczyk	a51f44edb0	Merge "Rebase Interoperability second part: fix "pop steps""	2010-12-07 09:19:35 -05:00
Mathias Kinzler	ad96546ca0	Rebase Interoperability third part: handle stop upon conflict There are some files that need to exist so that the CLI can continue after the rebase has been stopped due to conflicts Change-Id: I3cb4dc98609c059bf0cf9fd5f9e47a9c681cea2d Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2010-12-07 13:34:44 +01:00
Shawn Pearce	6462be8350	Merge "LockFile.commit: retry renaming"	2010-12-06 18:55:18 -05:00
Chris Aniszczyk	a2469bb5d2	Merge "Add InitCommand"	2010-12-06 17:08:55 -05:00
Chris Aniszczyk	34554e4f1c	Merge "Add debugging toString to TreeFormatter"	2010-12-06 10:11:11 -05:00
Chris Aniszczyk	6eb6d7c77a	Merge "Add insert(TreeFormatter) to ObjectInserter"	2010-12-06 10:10:58 -05:00
Chris Aniszczyk	731f84559d	Merge "Add toByteArray to CommitBuilder, TreeBuilder"	2010-12-06 10:10:41 -05:00
Chris Aniszczyk	35d51d040c	Merge "Remove unused getTreeId from TreeFormatter"	2010-12-06 10:10:26 -05:00
Chris Aniszczyk	643de8323a	Merge "Remove result id from CommitBuilder, TagBuilder"	2010-12-06 10:09:59 -05:00
Jens Baumgart	cbf5ff6ac7	LockFile.commit: retry renaming Currently the following can happen in LockFile.commit: deletion of the original file succeeds but renaming fails afterwards. In this case the original file (e.g. branch file in refs/heads) is lost. To workaround the issue the same retry logic as for file deletion is applied to file renaming. Bug: 331890 Change-Id: I68620c07f2d3ab7f3279c71a91e184e8eac69832 Signed-off-by: Jens Baumgart <jens.baumgart@sap.com> Signed-off-by: Philipp Thun <philipp.thun@sap.com>	2010-12-06 13:40:07 +01:00
Chris Aniszczyk	90fbc1db3a	Merge "Honor GIT_SSH when opening SSH connections"	2010-12-05 20:14:46 -05:00
Chris Aniszczyk	f7a566c1aa	Add InitCommand Adds git-init support to the Git API. Change-Id: I1428b861f22cabe4d92cadf3d9114dddeec75b40 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-12-05 19:01:43 -06:00
Shawn O. Pearce	ed7e38b98d	Merge "Ensure stable tag sort in PlotWalk"	2010-12-05 18:10:12 -05:00
Chris Aniszczyk	ef11143ffe	Merge "Abstract SSH setup to support GIT_SSH"	2010-12-05 10:50:05 -05:00
Shawn O. Pearce	064ecc25ce	Fix findGitDir() with no ceiling directories Bug: 322866 Change-Id: I64205bb0315a725dfa523ccff1796de50f465162 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> CC: Ketan Padegaonkar <KetanPadegaonkar@gmail.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-12-05 15:42:57 +01:00
Matthias Sohn	c474813b0a	Merge "Correct CommitBuilder, TagBuilder method to be build()"	2010-12-05 08:19:58 -05:00
Robin Rosenberg	40c2f68382	Merge "Fix checking out large files"	2010-12-04 03:49:11 -05:00
Shawn O. Pearce	864091d982	Ensure stable tag sort in PlotWalk Because tags are more interesting here than local or remote branch heads, tags get sorted earlier in the array than heads or remotes do. Bug: 324939 Change-Id: Ifc3863461654df7f34fdecbd2abe1f4b5d2ffb8e Signed-off-by: Shawn O. Pearce <spearce@spearce.org> CC: Mathias Kinzler <mathias.kinzler@sap.com> CC: Stefan Lay <stefan.lay@sap.com>	2010-12-03 16:38:24 -08:00
Shawn O. Pearce	61db0e4787	Fix checking out large files DirCacheCheckout needs to use ObjectLoader.copyTo to avoid loading the complete content of a large file into the JVM heap. Bug: 321097 Change-Id: I967590b6f233fd1c83d873075db01d653208b3b9 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> CC: Chris Aniszczyk <caniszczyk@gmail.com> CC: Christian Halstrick <christian.halstrick@sap.com>	2010-12-03 16:37:56 -08:00
Shawn O. Pearce	22e720ce77	Honor GIT_SSH when opening SSH connections If the environment variable GIT_SSH is set, use GIT_SSH for any remote protocol connections, instead of the local JSch library. Bug: 321062 Change-Id: Ia18ea49d58f3ed657430067f1f72ef788a2dae4c Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-03 16:33:46 -08:00
Shawn O. Pearce	04b289cc42	Abstract SSH setup to support GIT_SSH In order to honor GIT_SSH the TransportGitSsh class needs to run the process named by the GIT_SSH environment variable and use that as the pipes for connectivity to the remote peer. Refactor the current transport code to support a different type of pipe connectivity, so we can later add GIT_SSH. Bug: 321062 Change-Id: I9d8ee1a95f1bac5013b33a4a42dcf1f98f92172f Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-03 16:14:46 -08:00
Matthias Sohn	6ca9fd2d95	Add missing license header Change-Id: Ibfd17951606f02283660befcff53ff9b73405dd9 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-12-03 22:37:46 +01:00
Shawn O. Pearce	8fd2335b70	Add debugging toString to TreeFormatter Displaying the current tree in the ls-tree style output makes it easier to see what entries are currently stored. Change-Id: If17c414db0d2e8d84e65de8bbcba7fd1b79aa311 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Reviewed-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-12-03 13:11:39 -08:00
Shawn O. Pearce	8d4c95a645	Add insert(TreeFormatter) to ObjectInserter This makes usage of a TreeFormatter more similar to a CommitBuilder or a TagBuilder: populate the formatter and pass to the ObjectInserter. Change-Id: I5a45ef3a35cc73f4905a34bc6f6228510df8eb2c Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Reviewed-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-12-03 13:03:12 -08:00
Shawn O. Pearce	9ad802c15b	Add toByteArray to CommitBuilder, TreeBuilder This better matches the existing API of TreeFormatter, but is just a simple delegation to build(). Change-Id: I188f43acc34455e773d63836724b05e18f5c7a84 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Reviewed-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-12-03 12:57:41 -08:00
Shawn O. Pearce	807ee4797f	Remove unused getTreeId from TreeFormatter Change-Id: If5955757575d4c6053b6f8109e9dc2ecb0502446 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Reviewed-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-12-03 12:47:37 -08:00
Shawn O. Pearce	cf52ef5531	Remove result id from CommitBuilder, TagBuilder These objects don't need to be updated with the resulting ObjectId of the formatted content, callers can get that from the ObjectInserter on their own. Change-Id: Idc5f097de9f7beafc5e54e597383d82daf9d7db4 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Reviewed-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-12-03 12:38:31 -08:00
Shawn O. Pearce	f996fb1796	Correct CommitBuilder, TagBuilder method to be build() The correct names for these is build(), as that is what a Java developer will expect given the "builder" pattern. Bug: 323541 Change-Id: I35042bdc95a955beeaee29e54bde10e4240b2a71 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Reviewed-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-12-03 12:28:00 -08:00
Matthias Sohn	37001ddc8d	Fix jgit build broken by `deabacc4` Since `049827d7` MergeAlgorithm isn't static anymore. Change-Id: I3d704f663a776bb57e59f28a8200753fae5e9d25 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-12-03 09:24:31 +01:00
Chris Aniszczyk	39fe52ccc7	Merge "Rebase Interoperability first part: write "interactive" file"	2010-12-02 21:19:10 -05:00
Chris Aniszczyk	b5f9a9b4d3	Merge "Fixed Merge Algorithm regarding concurrent file creations"	2010-12-02 20:19:04 -05:00
Christian Halstrick	deabacc420	Fixed Merge Algorithm regarding concurrent file creations When in OURS and THEIRS a new file is created we want a conflict when the two contents differ. If on two branches the same file with the same content is created this should not be a conflict. But: the current merge algorithm is throwing NPEs in this case. Fix this by choosing an empty RawText as common base if the base is empty. Change-Id: I21cb23f852965b82fb82ccd66ec961c7edb3ac3d Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-12-02 13:15:59 +01:00
Shawn O. Pearce	e0a9961b78	Avoid unnecessary decoding of length in PackFile If the object type is a whole object and all we want is the type, there is no need to skip the length header. The type is already known and can be returned as-is. Instead skip the length header only for the two delta formats, where the delta base must itself be scanned. Change-Id: I87029258e88924b3e5850bdd6c9006a366191d10 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-12-01 09:59:55 -08:00
Shawn O. Pearce	d29b5db695	Remove unused 'shift' variable from PackFile This variable was not used for anything, but Eclipse's JDT failed to notice because of the "shift += " operation within the body of the while loop. Here we don't need the shift because we do not decode the length, but we do have to skip over the bytes that store the length to locate the delta base. Bug: 331319 Change-Id: I200a874fd7e39e3adf2640b8cd0f53dcf91ef4c9 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> CC: Remy Suen <remysuen@ca.ibm.com>	2010-12-01 09:57:16 -08:00
Mathias Kinzler	59e62ba7e1	Rebase Interoperability second part: fix "pop steps" If the CLI stops a rebase upon conflict, the current step is already popped from the git-rebase-todo and appended to the "done" file. The current implementation wrongly pops the step only after successful cherry-pick. Change-Id: I8640dda0cbb2a5271ecf75fcbad69410122eeab6 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2010-12-01 15:10:13 +01:00
Mathias Kinzler	7aa1b85821	Rebase Interoperability first part: write "interactive" file The Repository is then in state "Rebase interactive". Change-Id: I5d2de57f8670e1d4c71ed22509ab17f04e2561b5 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2010-12-01 15:08:07 +01:00
Stefan Lay	b4359cb829	Include list of assume unchanged files in IndexDiff The IndexDiff had not collected the info if the flag "assume-unchanged" is set. This information is useful for clients which may want to decide if specific actions are allowed on a file. Bug: 326213 Change-Id: I14bb7b03247d6c0b429a9d8d3f6b10f21d8ddeb1 Signed-off-by: Stefan Lay <stefan.lay@sap.com>	2010-11-30 10:51:21 -08:00
Stefan Lay	7bf0f5070e	Use the Set interface in declarations and as return value Change-Id: Ib273c4980036f75bd4dad3ffe1c29a37b2df932a Signed-off-by: Stefan Lay <stefan.lay@sap.com>	2010-11-30 11:05:42 +01:00
Shawn Pearce	a115b64f4b	Merge "Check assume unchanged flag in Add command"	2010-11-29 18:21:08 -05:00
Shawn Pearce	f968cbabcf	Merge "Fix DiffConfig to understand "copy" resp. "copies" for diff.renames property."	2010-11-29 17:59:15 -05:00
Stefan Lay	9225b88ae6	Check assume unchanged flag in Add command When the assume unchanged flag is set the Add command must not update the index for this file if any changes are present in the working directory. Bug: 331351 Change-Id: I255870f689225a1d88971182e0eb377952641b42 Signed-off-by: Stefan Lay <stefan.lay@sap.com>	2010-11-29 17:58:38 +01:00
Marc Strapetz	e147fbcd66	Fix DiffConfig to understand "copy" resp. "copies" for diff.renames property. Rename detection should be considered enabled if diff.renames config property is set to "copy" or "copies", instead of throwing IllegalArgumentException. Change-Id: If55d955e37235d4d00f5b0febd6aa10c0e27814e	2010-11-29 17:14:07 +01:00
Mathias Kinzler	12b6350435	RebaseCommand: trim line endings when reading files In order to enable interoperability with the command line, we need to remove line feeds when reading the files. Change-Id: Ie2f5799037a60243bb4fac52346908ff85c0ce5d Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2010-11-26 12:22:40 +01:00
Christian Halstrick	12a5c8d413	Change default diff algorithm to histogram and add tests The referenced bug showed that JGit produced different merge results compared to C Git. Unit test was added to reproduce the issue. The problem can be solved by switching to histogram diff algorithm. Bug: 331078 Change-Id: I54f30afb3a9fef1dbca365ca5f98f4cc846092e3 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Philipp Thun <philipp.thun@sap.com>	2010-11-26 00:44:05 +01:00
Christian Halstrick	049827d708	Make diff algorithm configurable The diff algorithm which is used by Merge, Cherry-Pick, Rebase should be configurable. A new configuration parameter "diff.algorithm" is introduced which currently accepts the values "myers" or "histogram". Based on this parameter for example the ResolveMerger will choose a diff algorithm. The reason for this is bug 331078. This bug shows that JGit is more compatible with C Git when histogram diff is in place. But since histogram diff is quite new we need an easy way to fall back to Myers diff. Bug: 331078 Change-Id: I2549c992e478d991c61c9508ad826d1a9e539ae3 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Philipp Thun <philipp.thun@sap.com>	2010-11-26 00:30:08 +01:00
Christian Halstrick	7e298c9ed5	Add more tests for rebase and externalized missing Strings Coverage tests showed that we are missing to test certain areas in the rebase command. Add the missing tests. Change-Id: Ia4a272d26cde7e1861dac30496e4b6799fc8187a Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-11-24 15:59:08 +01:00
Chris Aniszczyk	923443f94f	Add CheckoutCommand Add the ability to checkout a branch to the working tree. Bug: 330860 Change-Id: Ie06b9e799a9e1be384da0b8996efa7209b32eac3 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-11-22 15:53:35 -06:00
Matthias Sohn	34962b4700	Merge "Fix bug regarding handling of non-versioned files during merge"	2010-11-22 16:43:43 -05:00
Christian Halstrick	5adef23365	Fix bug regarding handling of non-versioned files during merge There was a bug introduced by commit `0e815fe`. For non-versioned files the merge algorithm detected an incoming deletion from THEIRS. Consequently such files were deleted. That's a severe bug which was fixed by more precisely detecting incoming deletions. Change-Id: I4385d3c990db11d62e371a385dc8ee89841db84a Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Philipp Thun <philipp.thun@sap.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-11-22 22:41:25 +01:00
Chris Aniszczyk	f7690cceef	Add RmCommand to Git API Bug: 330827 Change-Id: I0b74bb92254d0ee988139d25022d06d16ed89d58 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-11-22 11:02:28 -06:00
Mathias Kinzler	e5b96a7848	Initial implementation of a Rebase command This is a first iteration to implement Rebase. At the moment, this does not implement --continue and --skip, so if the first conflict is found, the only option is to --abort the command. Bug: 328217 Change-Id: I24d60c0214e71e5572955f8261e10a42e9e95298 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-11-22 09:58:36 -06:00
Shawn O. Pearce	bd98a0a9a5	Move WorkingTreeIterator inherited state into an object Instead of copying up to 4 fields from the parent iterator each time a child iterator is initialized and used, construct a single state object that contains the 4 fields, and pass that one state object through to the child. This makes it easier to add additional state fields that must be inherited, at the slight expense of an extra object allocation per TreeWalk, and an extra level of field indirection whenever the options, nameEncoder, or read buffer is required by the iterator. Change-Id: Ic4603c33b772d7a45f9c81140537d51945688fcb Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-18 17:06:12 -08:00
Shawn O. Pearce	3de186fbf0	Name TreeFilter and MergeFilter implementations Naming these inner classes ensures that stack traces which contain them will give us useful information about which filter is involved in the trace, rather than the generated names $1, $2, etc. This makes it much easier to understand a stack trace at a glance. Change-Id: Ia6a75fdb382ff6461e02054d94baf011bdeee5aa Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-18 16:50:14 -08:00
Chris Aniszczyk	2054c3fb8a	Add core.filemode to CoreConfig Let CoreConfig cache the value of core.filemode so clients like EGit can take advantage of it. Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-11-14 18:54:36 -06:00
Christian Halstrick	da1ea27fa2	Fixed checkouts when HEAD is ignored In the case where DirCacheCheckout was used to checkout a tree without taking HEAD into account (e.g. during a clone or hard reset) we didn't handle conflicts correctly. E.g. if there are conflicts (entries with stage != 0) in the index and we tried to hard reset we have been processing the conflicting pathes multiple times (once for every stage). With this fix we will update the index with the entry from the "merge" state (the state we want checkout) when we detect existing conflicts. Change-Id: Iffbddccaa588cf0d1460a5e44dabaf540d996e26 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-11-13 11:42:13 -06:00
Chris Aniszczyk	952c4e1f3d	Merge "Base64: Reformat to match JGit style"	2010-11-13 12:40:05 -05:00
Chris Aniszczyk	07cabc8c6f	Merge "Base64: Strip out code JGit doesn't use"	2010-11-13 12:39:48 -05:00
Chris Aniszczyk	f638679797	Merge "Remove unnecessary note fanout when removing notes"	2010-11-13 12:38:17 -05:00
Chris Aniszczyk	1b3abe75f8	Merge "Split note leaf buckets at 256 elements"	2010-11-13 12:37:30 -05:00
Chris Aniszczyk	9f2bde653f	Merge "Add internal API for note iteration"	2010-11-13 12:32:59 -05:00
Chris Aniszczyk	e9002a45ce	Merge "Allow writing a NoteMap back to the repository"	2010-11-13 12:31:58 -05:00
Chris Aniszczyk	56a802104a	Merge "Add in-memory updating support to NoteMap"	2010-11-13 12:31:02 -05:00
Chris Aniszczyk	43156bf045	Merge "Remember non-note tree entries when reading"	2010-11-13 12:29:31 -05:00
Shawn O. Pearce	51bf8ea2a4	Merge branch 'rename-detection' * rename-detection: RenameDetector: Only scan deletes if adds exist SimilarityRenameDetector: Initialize sizes to 0 SimilarityRenameDetector: Avoid allocating source index SimilarityRenameDetector: Only attempt to index large files once SimilarityIndex: Don't overflow internal counter fields SimilarityIndex: Accept files larger than 8 MB SimilarityIndex: Correct comment explaining the logic	2010-11-12 16:15:43 -08:00
Shawn O. Pearce	c35f98b226	Merge branch 'fs-fsync' * fs-fsync: Remove unnecessary flush calls from LockFile Remove unnecessary region locking from LockFile Support core.fsyncRefFiles option Support core.fsyncObjectFiles option Simplify LockFile write(ObjectId) case	2010-11-12 16:12:27 -08:00
Shawn O. Pearce	ef70a12fd1	Base64: Reformat to match JGit style Rewrite the initialization of the encoding tables to be more clear, but slightly slower to setup. We generally perfer a clear definition of the data over a slightly slower class load time. Change-Id: I0c7f89b6ab82dcf71525ffb69a388c312c195913 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 16:05:00 -08:00
Shawn O. Pearce	d2ce91199e	Base64: Strip out code JGit doesn't use Since we have already modified this class to localize an error message, we might as well strip it down to contain only the functionality we need, or might ever use. To keep this simple to review we don't adjust formatting right away, so code that was buried inside of an if or else block whose condition was removed might not have the correct indentation anymore. We can fix this with a later reformatting change. Change-Id: I2996aaa704e9d6182e5500c7a63240d5e9d722cc Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 16:01:05 -08:00
Christian Halstrick	484807e82b	Added one-tree constructor to DirCacheCheckout When DirCacheCheckout should be used to checkout only one tree (reset --hard, clone) then we had to use the standard constructor and specify null as value for head. This change adds explicit constructors not taking HEAD and documents that. Bug: 330021 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-11-13 00:45:50 +01:00
Shawn O. Pearce	e7e9a47b52	Remove unnecessary note fanout when removing notes Fanout level notes trees are combined back together into a flat leaf level tree if during a removal of a subtree there are less than 3/4 of the fanout subtrees still existing, and the size of the combined leaf is under the 256 split limit noted above. This rule is used because deletes are less common than insertions, and SHA-1's relatively uniform distribution suggests that with only 192 subtrees existing in the fanout, there should be approximately 192 names in the combined replacement leaf tree. Change-Id: Ia9d145ffd5454982509fc40906bc4dbbf2b13952 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 14:01:28 -08:00
Shawn O. Pearce	2b0df15f7f	Split note leaf buckets at 256 elements Leaf level notes trees are split into a new fan-out tree if an insertion occurs and the tree already contains >= 256 notes in it. The splitting may occur multiple times if all of the notes have the same prefix; in the worst case this produces a tree path such as "00/00/00/00/00/00/00/00/00/00/00/00/00/00/00/00/00/00/00/be" if all of the notes begin with zeros. Change-Id: I2d7d98f35108def9ec49936ddbdc34b13822a3c7 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 14:01:28 -08:00
Shawn O. Pearce	3728918d72	Add internal API for note iteration Some algorithms need to be able to iterate through all notes within a particular bucket, such as when splitting or combining a bucket. Exposing an Iterator<Note> makes this traversal possible. For a LeafBucket the iteration is simple, its over the sorted array of elements. For FanoutBucket its a bit more complex as the iteration needs to union the iterators of each fanout bucket, lazily loading any buckets that aren't already in-memory. Change-Id: I3d5279b11984f44dcf0ddb14a82a4b4e51d4632d Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 14:01:28 -08:00
Shawn O. Pearce	3e2b9b691e	Allow writing a NoteMap back to the repository This is necessary to allow applications to wrap the note tree in a commit and update the note branch with the new state. Change-Id: Idbd7ead4a1b16ae2b64a30a4a01a29cfed548cdf Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 14:01:28 -08:00
Shawn O. Pearce	faa0747cce	Add in-memory updating support to NoteMap NoteMap now supports editing in-memory, allowing applications to modify the NoteMap once it has been loaded from the branch. The ability to write the branch back to tree objects is not yet done, so the edits are strictly transient. Change-Id: I63448954abfca2a8e3e95369cd84c0d1176cdb79 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 14:01:24 -08:00
Shawn O. Pearce	2f6e79307d	Remove unnecessary flush calls from LockFile Change-Id: I144af9db4714acabd796880be73bd50d84b92efe Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 13:38:13 -08:00
Shawn O. Pearce	ed5fe8af9a	Remove unnecessary region locking from LockFile The lock file protocol relies on the atomic creation of a standardized name in the parent directory of the file being updated. Since the creation is atomic, at most one thread in any process can succeed on this creation, and all others will fail. While the lock file exists, that file is private to the thread that is writing it, and no others will attempt to read or modify the file. Consequently the use of the region level locks around the file are unnecessary, and may actually reduce performance when using NFS, SMB, or some other sort of remote filesystem that supports locking. Change-Id: Ice312b6fb4fdf9d36c734c3624c6d0537903913b Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 13:38:06 -08:00
Shawn O. Pearce	e0e7fe531d	Support core.fsyncRefFiles option If core.fsyncRefFiles is set to true, fsync is used whenever a reference file is updated, ensuring the file contents are also written to disk. This can help to prevent empty ref files after a system crash when using a filesystem such as HFS+ where data writes may be delayed. Change-Id: Ie508a974da50f63b0409c38afe68772322dc19f1 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 13:38:04 -08:00
Shawn O. Pearce	24fccadeda	Support core.fsyncObjectFiles option Some repositories may be on really unstable filesystems, but still want to have good reliability when objects are written to disk. If core.fsyncObjectFiles is set to true, request the JVM to ensure the data is written before returning success to the caller of insert. The option defaults to false because it should be useless on any filesystem that orders writes and metadata, such as ext3 mounted with data=ordered (or data=journal). But it may be useful on some systems (especially HFS+) where file content may flush to the disk independently of filesystem structure changes. Because FileChannel.force(boolean) only claims to ensure data is written if it was written using the write(ByteBuffer) method of FileChannel, redirect all writes when using fsyncObjectFiles to go through the FileChannel interface instead of through the older style OutputStream interface. This may not be necessary on all JVMs, but its more portable to follow the definition than the common behavior. Change-Id: I57f6b6bb7e403c07fbae989dbf3758eaf5edbc78 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 13:37:27 -08:00
Shawn O. Pearce	bc9bca064d	RenameDetector: Only scan deletes if adds exist If there are only deletes, don't need perform rename or copy detection. There are no adds (aka destinations) for the deletes to match against. Change-Id: I00fb90c509fa26a053de561dd8506cc1e0f5799a Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 11:57:02 -08:00
Shawn O. Pearce	05653bda04	SimilarityRenameDetector: Initialize sizes to 0 Setting the array elements to -1 is more expensive than relying on the allocator to zero the array for us first. Shifting the code to always add 1 to the size (so an empty file is actually 1 byte long) allows us to detect an unloaded size by comparing to 0, thus saving the array fill calls. Change-Id: Iad859e910655675b53ba70de8e6fceaef7cfcdd1 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 11:57:02 -08:00
Shawn O. Pearce	68baa3097e	SimilarityRenameDetector: Avoid allocating source index If the only file added is really small, and all of the deleted files are really big, none of the permutations will match up due to the sizes being too far apart to fit the current rename score. Avoid allocating the really big deleted SimilarityIndex by deferring its construction until at least one add along that row has a reasonable chance of matching it. This avoids expending a lot of CPU time looking at big deleted binary files when a small modified text file was broken due to a high percentage of changed lines. Change-Id: I11ae37edb80a7be1eef8cc01d79412017c2fc075 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 11:57:02 -08:00
Shawn O. Pearce	918e6e20f0	SimilarityRenameDetector: Only attempt to index large files once If a file fails to index the first time the loop encounters it, the file is likely to fail to index again on the next row. Rather than wasting a huge amount of CPU to index it again and fail, remember which destination files failed to index and skip over them on each subsequent row. Because this condition is very unlikely, avoid allocating the BitSet until its actually needed. This keeps the memory usage unaffected for the common case. Change-Id: I93509b28b61a9bba8f681a7b4df4c6127bca2a09 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 11:57:02 -08:00
Shawn O. Pearce	0e307a6afd	SimilarityIndex: Don't overflow internal counter fields The counter portion of each pair is only 32 bits wide, but is part of a larger 64 bit integer. If the file size was larger than 4 GB the counter could overflow and impact the key, changing the hash, and later resulting in an incorrect similarity score. Guard against this overflow condition by capping the count for each record at 2^32-1. If any record contains more than that many bytes the table aborts hashing and throws TableFullException. This permits the index to scan and work on files that exceed 4 GB in size, but only if the file contains more than one unique block. The index throws TableFullException on a 4 GB file containing all zeros, but should succeed on a 6 GB file containing unique lines. The index now uses a 64 bit accumulator during the common scoring algorithm, possibly resulting in slower summations. However this index is already heavily dependent upon 64 bit integer operations being efficient, so increasing from 32 bits to 64 bits allows us to correctly handle 6 GB files. Change-Id: I14e6dbc88d54ead19336a4c0c25eae18e73e6ec2 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 11:57:02 -08:00
Shawn O. Pearce	d63887127e	SimilarityIndex: Accept files larger than 8 MB Files bigger than 8 MB (2^23 bytes) tended to overflow the internal hashtable, as the table was capped in size to 2^17 records. If a file contained 2^17 unique data blocks/lines, the table insertion got stuck in an infinite loop as the able couldn't grow, and there was no open slot for the new item. Remove the artifical 2^17 table limit and instead allow the table to grow to be as big as 2^30. With a 64 byte block size, this permits hashing inputs as large as 64 GB. If the table reaches 2^30 (or cannot be allocated) hashing is aborted. RenameDetector no longer tries to break a modify file pair, and it does not try to match the file for rename or copy detection. Change-Id: Ibb4d756844f4667e181e24a34a468dc3655863ac Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 11:56:59 -08:00
Shawn O. Pearce	f3b511568b	SimilarityIndex: Correct comment explaining the logic This comment was wrong, due to a copy-and-paste error. Here the code is looking at records of dst that do not exist in src, and are skipping past them to find another match. Change-Id: I07c1fba7dee093a1eeffcf7e0c7ec85446777ffb Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-12 11:56:57 -08:00
Shawn Pearce	e8315ce19d	Merge "Fix null ref exception in DirCacheCheckout"	2010-11-12 11:29:32 -05:00
Shawn O. Pearce	5a2cbd4aa7	Remember non-note tree entries when reading In order to safely edit a notes tree, NoteMap needs to retain any non-note tree entries it read from the source tree and put them back out into the modified tree when it commits a new version of the note branch. Remember any tree entries that didn't look like a note during the parsing of the tree, so they can be put into a TreeFormatter later when the tree writes to the repository. Change-Id: Ia284af7e7866da35db35374c6c5869f00c857944 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-11 10:57:16 -08:00
Shawn O. Pearce	b81b97fbdd	Lazy load note subtrees from fanout levels Instead of reading a note tree recursively up front when the NoteMap is loaded, read only the root tree and load subtrees on demand when they are accessed by the application. This gives a lower latency to read a note for the recent commits on a branch, as only the paths that are needed get read. Given a 2/38 style fanout, the tree will fully load when 256 objects have been accessed by the application. But unlike the prior version of NoteMap, the NoteMap will load faster and answer lookups sooner, as the loading time for all 256 levels is spread out across each of the get() requests. Given a 2/2/36 style fanout, the tree won't need to fully load until about 65,536 objects are accessed. To simplify the implementation we only support the flat layout (all notes in the top level tree), or a 2/38, 2/2/36, 2/2/2/34, through 2/.../2 style fanout. Unlike C Git we don't support reading the old experimental 4/36 fanout. This is sufficient because C Git won't create the 4/36 style fanout when creating or updating a notes tree, and there really aren't any in the wild today. Change-Id: I6099b35916a8404762f31e9c11f632e43e0c1bfd Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-11 10:23:38 -06:00
Shawn O. Pearce	936820988f	Define NoteMap, a simple note tree reader The NoteMap makes it easy to read a small notes tree as created by the `git notes` command in C Git. To make the initial implementation simple a notes tree is read recursively into a map in memory. This is reasonable if the application will need to access all notes, or if there are less than 256 notes in the tree, but doesn't behave well when the number of notes exceeds 256 and the application doesn't need to access all of them. We can later add support for lazily loading different subpaths, thus fixing the large note tree problem described above. Currently the implementation only supports reading. Writing notes is more complex because trees need to be expanded or collapsed at the exact 256 entry cut-off in order to retain the same tree SHA-1 that C Git would use for the same content. It also needs to retain non-note tree entries such as ".gitignore" or ".gitattribute" files that might randomly appear within a notes tree. We can also add writing support later. Change-Id: I93704bd84ebf650d51de34da3f1577ef0f7a9144 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-11-11 10:06:43 -06:00
Chris Aniszczyk	6043d4638c	Merge "Add MutableObjectId setByte to modify a mutable id"	2010-11-11 10:52:37 -05:00
Chris Aniszczyk	573666403d	Merge "Support CredentialsProvider for SSH connections"	2010-11-11 10:27:52 -05:00
Stefan Lay	33c419fdfe	Merge "Define a default CredentialsProvider"	2010-11-11 09:36:34 -05:00
Stefan Lay	dcac1fe4bf	Merge "Enable providing credentials for HTTP authentication"	2010-11-11 09:35:43 -05:00
Chris Aniszczyk	9e28cf2fa3	Merge "Add ObjectId getByte for random access"	2010-11-10 18:00:36 -05:00
Shawn O. Pearce	d279bc83b0	Support CredentialsProvider for SSH connections When setting up an SSH connection, use the caller supplied CredentialsProvider, if one has been given to the Transport or was defined as the default. The CredentialsProvider is re-wrapped as a JSch UserInfo, allowing the connection to use this for user interactive prompts. This give a unified API for authentication on any transport type. Change-Id: Id3b4cf5bfd27a23207cdfb188bae3b78e71e02c0 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-10 15:00:13 -08:00
Shawn O. Pearce	ce99b48384	Define a default CredentialsProvider This permits applications to set their preferred credentials UI implementation once, rather than needing to define it on every single Transport instance they open. Change-Id: I010550de1a6becab27f7aa5a9901df5a1c7e74bd Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-10 14:58:45 -08:00
Shawn O. Pearce	308e074f65	Enable providing credentials for HTTP authentication This change is based on http://egit.eclipse.org/r/#change,1652 by David Green. The change adds the concept of a CredentialsProvider which can be registered for git transports and which is responsible to return credential-related data like passwords and usernames. Whenenver the transports detects that an authentication with certain credentials has to be done it will ask the CredentialsProvider for this data. Foreseen implementations for such a Provider may be a EGitCredentialsProvider (caching credential data entered e.g. in the Clone-Wizzard) or a NetRcProvider (gathering data out of ~/.netrc file). Bug: 296201 Change-Id: Ibe13e546b45eed3e193c09ecb414bbec2971d362 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com> Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Stefan Lay <stefan.lay@sap.com> Signed-off-by: Shawn O. Pearce <spearce@spearce.org> CC: David Green <dgreen99@gmail.com>	2010-11-10 14:58:44 -08:00
Chris Aniszczyk	453b620e62	Merge "Refactor tree entry formatting into a common class"	2010-11-10 17:53:35 -05:00
Lluis Sanchez	3b4dcb3c02	Fix null ref exception in DirCacheCheckout Added missing null check for getDirCacheEntry(). This method may return null for example if the curernt entry is a subtree.	2010-11-10 10:56:46 +01:00
Stefan Lay	20a5a34444	Fix WWW-Authenticate auth-scheme comparison The auth-scheme token (like "Basic" or "Digest") is not specified in a case sensitive way. RFC2617 (http://tools.ietf.org/html/rfc2617) specifies in section 1.2 the use of a "case-insensitive token to identify the authentication scheme". Jetty, for example, uses "basic" as token. Change-Id: I635a94eb0a741abcb3e68195da6913753bdbd889 Signed-off-by: Stefan Lay <stefan.lay@sap.com>	2010-11-10 09:42:51 +01:00
Shawn O. Pearce	cfa3f365d6	Simplify LockFile write(ObjectId) case The ObjectId (for a ref) can be easily reformatted into a temporary byte[] and then passed off to write(byte[]), removing the duplicated code that existed in both write methods. Change-Id: I09740658e070d5f22682333a2e0d325fd1c4a6cb Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-09 19:13:13 -08:00
Matthias Sohn	ab7d08ec96	Merge "Revert "[findBugs] Silence DM_STRING_CTOR on PacketLineIn""	2010-11-09 18:18:41 -05:00
Shawn O. Pearce	6af7e4d91a	Fix URIish parsing of absolute scp-style URIs We stopped handling URIs such as "example.com:/some/p ath", because this was confused with the Windows absolute path syntax of "c:/path". Support absolute style scp URIs again, but only when the host name is more than 2 characters long. Change-Id: I9ab049bc9aad2d8d42a78c7ab34fa317a28efc1a Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-09 14:36:01 -08:00
Shawn Pearce	b087bba3bd	Merge "Format merge commit messages like C Git"	2010-11-09 17:14:11 -05:00
Shawn O. Pearce	08a9682e32	Revert "[findBugs] Silence DM_STRING_CTOR on PacketLineIn" This reverts commit `1e510ec20e`. Instead work around the warning by defining our constant by constructing it through a StringBuilder. Change-Id: If139509e769d649609c62eff359ebaea5dd286b2 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> CC: Matthias Sohn <matthias.sohn@sap.com> CC: Chris Aniszczyk <caniszczyk@gmail.com>	2010-11-08 15:34:47 -08:00
Shawn Pearce	6ed0501346	Merge "IndexDiff: support state [removed, untracked]"	2010-11-08 18:32:45 -05:00
Jens Baumgart	2dc2dd8b1b	IndexDiff: support state [removed, untracked] IndexDiff was extended to detect files which are both removed from the index and untracked. Before this change these files were only added to the removed collection. Change-Id: I971d8261d2e8932039fce462b59c12e143f79f90 Signed-off-by: Jens Baumgart <jens.baumgart@sap.com> Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-08 15:32:03 -08:00
Shawn Pearce	09555c9853	Merge "Make Repository.shortenRefName static"	2010-11-08 17:42:21 -05:00
Matthias Sohn	220cd43482	[findBugs] Fix NP_LOAD_OF_KNOWN_NULL_VALUE The code analyzer can't know that passing a value known to be null is not a problem. Hence better pass null explicitly instead of the parameters being null. Change-Id: I8db6f8014de6c00dd95974d60f61ecc66191e6d4 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-11-07 20:16:15 +01:00
Christian Halstrick	0e815fe8c5	Fixed ResolveMerger regarding handling of deletions There was a bug in ResolveMerger which is one reason for bug 328841. If a merge was failing because of conflicts deletions where not handled correctly. Files which have to be deleted (because there was a non-conflicting deletion coming in from THEIRS) are not deleted. In the non-conflicting case we also forgot to delete the file but in this case we explicitly checkout in the end these files get deleted during that checkout. This is fixed by handling incoming deletions explicitly. Bug: 328841 Change-Id: I7f4c94ab54138e1b2f3fcdf34fb803d68e209ad0 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-11-07 14:59:01 +01:00
Robin Stocker	6290ca3a63	Format merge commit messages like C Git The automatically generated commit message of a merge should have the same structure as in C Git for consistency (as per git fmt-merge-msg). Before this change: merging refs/heads/a into refs/heads/master After: Merge branch 'a' Plurals, "into" and joining by "," and "and" also work. Change-Id: I9658ce2817adc90d2df1060e8ac508d7bd0571cb	2010-11-06 13:48:11 +01:00
Robin Stocker	2fb0f5cfc0	Make Repository.shortenRefName static The method has no reason to be non-static. Change-Id: I1c09e074395d49cee0e6e53679b499d1f0c351ea	2010-11-06 13:41:06 +01:00
Shawn O. Pearce	e488f1cacd	Add MutableObjectId setByte to modify a mutable id This mirrors the getByte() API in ObjectId and allows the caller to modify a single byte, which is useful when updating it as part of a loop walking through 0x00..0xff inside of a range of objects. Change-Id: I57fa8420011fe5ed5fc6bfeb26f87a02b3197dab Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-04 19:12:13 -07:00
Shawn O. Pearce	b22a4e8488	Add ObjectId getByte for random access Processing git notes requires random access to part of the raw data of each ObjectId... which isn't easy because ObjectIds are stored with an internal representation of 5 ints. Expose random access to the individual data bytes through new methods, avoiding the need to convert first to a byte[20]. Change-Id: I99e64700b27fc0c95aa14ef8ad46a0e8832d4441 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-04 19:12:13 -07:00
Shawn O. Pearce	c27e1daa55	Refactor tree entry formatting into a common class Instead of hiding this logic inside of DirCacheTree and the legacy Tree type, pull it into a common place where we can reuse it by creating tree records in a buffer that can be passed directly into the ObjectInserter. This allows us to avoid some copying, as the inserter can be given the internal buffer of the formatter. Because we trust these two callers to feed us records in the proper order, without '/' in the names, and without duplicate names in the same tree, we don't do any validation inside of the formatter itself. To protect themselves from making ordering errors, developers should continue to use DirCache to process edits to source code trees. Change-Id: Idf7f10e736d4a44ccdf8afe060535d7b0554a92f Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-04 19:12:13 -07:00
Christian Halstrick	99771f04bc	Fixed merge algorithm regarding adjacent modifications JGit merge algorithm behaved differently from C Git when we had adjacent modifications. If line 9 was modified by OURS and line 10 by theirs then C Git will return a conflict while JGit was seeing this as independent modifications. This change is not only there to achieve compatibility, but there where also some really wrong merge results produced by JGit in the area of adjacent modifications. Change-Id: I8d77cb59e82638214e45b3cf9ce3a1f1e9b35c70 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-11-02 18:29:35 +01:00
Shawn O. Pearce	aa09599a3d	Fix ugly diff showing insertion of new method When adding a new method near the end of the sequence we want to show the full method inserted, and not tear the prior method due to the common trailing curly brace being consumed as part of the common end region of the sequences. Bug: 328895 Change-Id: I233bc40445fb5452863f5fb082bc3097433a8da6 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-01 14:10:00 -07:00
Shawn O. Pearce	b88b693a3d	Fix broken HistogramDiff HistogramDiff failed on cases where the initial element for the LCS was actually very common (e.g. has 20 occurrences), and the first element of the inserted region after the LCS was also common but had fewer occurrences (e.g. 10), while the LCS also contained a unique element (1 occurrence). This happens often in Java source code. The initial element for the LCS might be the empty line ("\n"), and the inserted but common element might be "\t/\n", with the LCS being a large span of lines that contains unique method declarations. Even though "/" occurs less often than the empty line its not a better LCS if the LCS we already have contains a unique element. The logic in HistogramDiff would normally have worked fine, except I tried to optimize scanning of B by making tryLongestCommonSequence return the end of the region when there are matching elements found in A. This allows us to skip over the current LCS region, as it has already been examined, but caused us to fail to identify an element that had a lower occurrence count within the region. The solution used here is to trade space-for-time by keeping a table of A positions to their occurrence counts. This allows the matching logic to always use the smallest count for this region, even if the smallest count doesn't appear on the initial element. The new unit test testEdit_LcsContainsUnique() verifies this new behavior works as expected. Bug: 328895 Change-Id: Id170783b891f645b6a8cf6f133c6682b8de40aaf Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-11-01 14:08:45 -07:00
Shawn O. Pearce	33ae28b482	Correct typo in HistogramDiffIndex Javadoc Change-Id: I8bd2e81fcc14aa86919c504f1d0001944dea50b2 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-29 17:02:26 -07:00
Shawn Pearce	a434172079	Merge "Remove two "Dead store to local variable" warnings"	2010-10-29 10:49:45 -04:00
Shawn Pearce	1572c9f4a7	Merge "Use entrySet() instead of keySet()"	2010-10-29 10:48:18 -04:00
Shawn Pearce	6bddae5775	Merge "Use readFully() instead of read()"	2010-10-29 10:47:26 -04:00
Shawn Pearce	a8140e3c16	Merge "Use Character.valueOf instead of new Character"	2010-10-29 10:43:55 -04:00
Shawn Pearce	68f6c49ee2	Merge "Remove unnecessary null check"	2010-10-29 10:43:32 -04:00
Robin Stocker	8cbed3462e	Make private final field static It's used as a constant. Change-Id: Ic267e8cb5b62228de15e134cd80725df592a0171	2010-10-29 15:27:10 +02:00
Robin Stocker	d36c80fd04	Remove unnecessary null check The field monitor is never null, it's a NullProgressMonitor when not explicitly set. Change-Id: I8ce703a32c28ce5c3455efeb7ed5f5c9a443cbef	2010-10-29 15:12:48 +02:00
Robin Stocker	b52df1839a	Use Character.valueOf instead of new Character Otherwise a new Character is allocated each time instead of using the cache. Change-Id: I648d0b012f66ba9dc46a37a390986f9c61e5a19c	2010-10-29 15:04:27 +02:00
Robin Stocker	96bea14c7b	Use readFully() instead of read() Fixes the "Method ignores results of InputStream.read()" warning. This is the only place where read() was used instead of readFully() and the return value was not checked. So it was either an oversight or should be documented. This change assumes it was an oversight. Change-Id: I859404a7d80449c538a552427787f3e57d7c92b4	2010-10-29 14:52:52 +02:00
Robin Stocker	3b44b22609	Use entrySet() instead of keySet() The value was accessed every time in the loop body with get(), so use the more efficient entrySet(). Change-Id: I91d90cbd0b0d03ca4a3db986c58b8d80d80f40a4	2010-10-29 14:41:39 +02:00
Robin Stocker	3f78650c9a	Remove two "Dead store to local variable" warnings Change-Id: I950de82db15c4610dc5a94f304279971daef971e	2010-10-29 14:37:42 +02:00
Shawn Pearce	7f939ba86e	Merge "Fix Severe Bug in Merge Algorithm"	2010-10-28 15:54:36 -04:00
Christian Halstrick	beeb1f6d08	Fix Severe Bug in Merge Algorithm As described in Bug 328551 there was a bug that the merge algorithm was not always reporting conflicts when the same line was deleted and modified. This problem was introduced during commit `0c017188b4` when reported conflicts have been checked for common pre- and suffixes. This was fixed here by better determining whether after stripping off common prefixes and suffixes from a conflicting region there is still some conflicting part left. I also added a unit test to test this situation. Bug: 328551 Change-Id: Iec6c9055d00e5049938484a27ab98dda2577afc4 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-28 21:41:42 +02:00
Mathias Kinzler	7668a46282	PullCommand: support upstream configuration for local branches When creating a local branch based on another local branch, the upstream configuration contains "." as origin and the source branch as "merge". The PullCommand should support this by skipping the fetch step altogether and use the base branch to merge with. Change-Id: I260a1771aeeffca5b0161d1494fd63c672ecc2a6 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2010-10-28 09:18:02 -07:00
Shawn Pearce	70b8a470e7	Merge "Make AbbreviatedObjectId serializable"	2010-10-28 11:47:06 -04:00
Robin Stocker	80a4ea95e4	Make AbbreviatedObjectId serializable AmbiguousObjectException contains an AbbreviatedObjectId and is supposed to be serializable, so it should be serializable as well. Change-Id: I8056e78aee20fdd3cb9600b52cd8ed988544293d	2010-10-28 17:40:15 +02:00
Robin Stocker	db35d91fa6	Fix oddness check in MyersDiff for negative numbers It's probably not possible that these numbers are negative in the algorithm, but it's cleaner this way and gets rid of three more FindBugs warnings. Change-Id: Ifbce4e2c787fb9a7cd309c605e8d86211ef8a352	2010-10-28 17:37:21 +02:00
Shawn O. Pearce	79ca8a2d19	Merge "Call ProgressMonitor.update() from main thread"	2010-10-27 11:37:55 -04:00
Shawn O. Pearce	bdf535de4f	Call ProgressMonitor.update() from main thread Don't permit transient worker threads to access the underlying output stream of a ProgressMonitor, as they might get marked as the stream's writer thread. Instead proxy update events from the workers back onto the application's real work thread. This ensures that the stream only sees a single thread, and its the thread that will remain alive for the entire life cycle of the operation. This fixes IOException("Write end dead") during local repository fetch when threaded delta search is enabled. One of the transient delta search threads became the designated writer for the pipe, and when it terminated the reader end thought the writer was dead, even though the main writer thread was still executing in PackWriter. Bug: 326557 Change-Id: I01d1b20a3d7be1c0b480c7fb5c9773c161fe5c15 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-27 08:37:31 -07:00
Christian Halstrick	2c38e5d461	Prevent endless loop of events fired by RefsDirectory RefsDirectory fires a RefsChangedEvent when it detect that one ref changed (new, modified, deleted). But there was a potential of wrong events beeing fired leading to a endless loop in EGit. Problem is that when calling getRefs(ALL) we don't want to report additional refs and by that we remove the additional refs from the list of "refs reported upwards last time". We fire an RefsChangedEvent because we think that the special refs are not there anymore. I fixed this by removing eventing for the additional refs. Another alternative would be to always scan also for additional refs and put them in the list of refs. But getRefs(ALL) would then remove the additional refs again. I didn't do that for performance reasons and also because I am not sure whether we want evnting for additional refs. Change-Id: Icb9398b55a8c6bbf03e38f6670feb67754ce91e0 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-27 10:52:42 +02:00
Lluis Sanchez	07cae6e6c1	Optimize DirCacheCheckout When checking out a tree, files that are identical to the file in the current index and working directory don't need to be updated. Change-Id: I9e025a53facd42410796eae821baaeff684a25c5 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-10-26 13:27:40 -05:00
Christian Halstrick	c234860765	Merge "Allow setting a filter in IndexDiff"	2010-10-25 08:37:59 -04:00
Jens Baumgart	6f3b089188	Allow setting a filter in IndexDiff IndexDiff now allows to set an additional filter. This can be used e.g. for restricting the tree walk to a given set of files. Change-Id: I642de17e74b997fa0c5878c90631f6640ed70bdd Signed-off-by: Jens Baumgart <jens.baumgart@sap.com>	2010-10-25 13:00:13 +02:00
Christian Halstrick	a4f7992dfb	Add support for special symref FETCH_HEAD and MERGE_HEAD The RefDirectory class was not returning FETCH_HEAD and MERGE_HEAD when trying to get all refs via getRefs(RefDatabase.ALL). This fix adds constants for FETCH_HEAD and ORIG_HEAD and adds a new getter getAdditionalRefs() to get these additional refs. To be compatible with c git the getRefs(ALL) method will not return FETCH_HEAD, MERGE_HEAD and ORIG_HEAD. Change-Id: Ie114ca92e9d5e7d61d892f4413ade65acdc08c32 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-25 00:36:16 +02:00
Matthias Sohn	8067197049	[findbugs] Fix illegal format specifier For integral arguments the precision is not applicable, would cause a runtime exception when executed, see http://download.oracle.com/javase/1.5.0/docs/api/java/util/Formatter.html#syntax Change-Id: I4738c64c1153a8d4ef5430e15d0fe54f0a37949f Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-10-23 23:56:06 +02:00
Matthias Sohn	ffc010fda4	[findbugs] Static comparator made final Fixing FindBugs warning MS_SHOULD_BE_FINAL. Change-Id: Ic69e6f6425e0a8950ce809eb3894f48a33e860aa Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-10-23 22:25:29 +02:00
Shawn O. Pearce	d00420ae6e	Make ObjectDirectory getPacks() work the first time If an object hasn't been accessed yet the pack list for a repository may not have been scanned from disk. If an application (e.g. the dumb transport servlet support code) asks for the pack list for an ObjectDirectory, we should load it immediately. Change-Id: I93d7b1bca422d905948e8e83b2afa83c8894a68b Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-10-20 00:51:44 +02:00
Shawn O. Pearce	e51e06946f	Update CachedObjectDirectory when inserting objects If an ObjectInserter is created from a CachedObjectDirectory, we need to ensure the cache is updated whenever a new loose object is actually added to the loose objects directory, otherwise a future read from an ObjectReader on the CachedObjectDirectory might not be able to open the newly created object. We mostly had the infrastructure in place to implement this due to the injection of unpacked large deltas, but we didn't have a way to pass the ObjectId from ObjectDirectoryInserter to CachedObjectDirectory, because the inserter was using the underlying ObjectDirectory and not the CachedObjectDirectory. Redirecting to CachedObjectDirectory ensures the cache is updated. Change-Id: I1f7bdfacc7ad77ebdb885f655e549cc570652225 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-17 23:10:47 -07:00
Shawn O. Pearce	f5e5b98c3a	IndexPack: Make translated progress messages non-static These messages may need to change depending on the current thread's configured locale, and thus cannot be static. Change-Id: I96751a63852ec9c4bf6c47edadcf8752700543df Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-17 22:21:38 -07:00
Christian Halstrick	609c6dc358	Fix possible NPE in DirCacheCheckout There was a chance that we hit a NPE which doing a checkout with DirCacheCheckout when there is no HEAD (e.g. initial checkout). This is fixed here. Change-Id: Ie3b8cae21dcd90ba8352823ea94a700f8ee9221a Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-18 00:19:13 +02:00
Christian Halstrick	9b4876cedf	Add Cherry-Pick command Implemented the initial version of a cherry-pick command. A correct error handling is missing (what happens if the checkout fails, the cherry-pick leads to conflicts etc). But straightforward cherry-picks works. Change-Id: I235c0eb3a7a2d5bdfe40400f1deed06f29d746e1 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-10-15 16:38:34 +02:00
Shawn O. Pearce	4c7e100910	Add getString utility functions to RawText These routines can be useful when debugging, because we can add an expression to the Eclipse "Expressions" panel to show the text that appears on a line. Gerrit Code Review also uses these in its own subclass of RawText in order to format patch files, so pulling it up to be part of core JGit may help other applications too. Change-Id: I20a6b112e3403ecfc1c2715ae75dcecc1a85b167 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-13 20:50:44 -07:00
Shawn O. Pearce	3f3b6bfdb3	Remove dead RawText(RawTextComparator) constructor Since the introduction of HashedSequence we no longer need to supply the RawTextComparator at the time of constructing a RawText. Drop the definition from the constructor, because it doesn't make sense as part of our public API. Change-Id: Iaab34611d60eee4a2036830142b089b2dae81842 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-13 20:48:51 -07:00
Shawn O. Pearce	8ea558bd82	Fix RawTextComparator reduceCommonStartEnd at empty lines When an empty line was inserted at the beginning of the common end part of a RawText the comparator incorrectly considered it to be common, which meant the DiffAlgorithm would later not even have it be part of the region it examines. This would cause JGit to skip a line of insertion, which later confused Gerrit Code Review when it tried to match up the pre and post RawText files for a difference that had this type of insertion. Define two new unit tests to check for this insertion of a blank line condition and correct for it by removing the LF from the common region when the condition is detected. Change-Id: I2108570eb2929803b9a56f9fb9c400c758e7156b Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-13 20:48:51 -07:00
Christian Halstrick	fb1e500adc	Rename method to ResolveMerger.setWorkingTreeIterator() renamed an ugly methodname Change-Id: I26bda06ef64b8644fd3a555dc55dff43cdb56a71 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-13 11:30:41 +02:00
Mathias Kinzler	5c135a5856	DeleteBranchCommand does not clean up upstream configuration It wrongly uses the full name of the branch to remove the configuration entries but must use the shortened one. Change-Id: Ie386a128a6c6beccc20bafd15c2e36254c5f560d Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com>	2010-10-12 12:22:40 -07:00
Christian Halstrick	285d08d8b7	Fix NPE when calling CreateBranch without explict startpoint When creating a branch with CreateBranchCommand.call() without specifying an explicit startPoint HEAD should be used as startPoint. There was a bug leading to an NPE in such a case. Change-Id: Ic0a5dc1f33a0987d66c09996c8012c45785500ff Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-12 19:18:53 +02:00
Christian Halstrick	be93452842	Remove wrong comment in MergeCommand There was a wrong javadoc comment telling that MergeCommand only supports fast-forward merges. This has been fixed. Change-Id: I7edea779a83528beee34a1753026288c384881ce Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-12 19:18:51 +02:00
Christian Halstrick	0a8d54c286	Remove AmbiguousObjectException from BranchCreateCommand.call() We wanted to wrap all LowLevel JGit excpetions into a JGitInternalException so that users of this high-level interface don't have to explicitly catch all of them. This was forgotten on BranchCreateCommand.call() and I added it. Change-Id: Ie140e99574fb004137c66e80fb92eb6c6d0fa5e1 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-12 19:18:50 +02:00
Shawn O. Pearce	e82cadc0dc	Delete PatienceDiff HistogramDiff outperforms it for any case where PatienceDiff needs to fallback to another algorithm. Consequently it's not worth keeping around, because we would always want a fallback enabled. Change-Id: I39b99cb1db4b3be74a764dd3d68cd4c9ecd91481 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-11 17:05:18 -05:00
Shawn O. Pearce	6048f34c58	Use HistogramDiff by default in DiffFormatter Its behavior is similar to PatienceDiff, and runs nearly as fast, often beating the performance of MyersDiff. Change-Id: I43c3faefa8109f1a68ef57522bec9cf27b5df252 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-11 14:37:00 -07:00
Chris Aniszczyk	7429a9a5aa	Merge "Define LowLevelDiffAlgorithm to bypass re-hashing"	2010-10-11 17:18:05 -04:00
Chris Aniszczyk	033ab7f6f0	Merge changes I50dcec81,Ieab28bb3 * changes: Fix empty block corner case in PatienceDiff Fix infinite loop in PatienceDiff	2010-10-11 15:00:51 -04:00
Shawn O. Pearce	4522b07d0f	Fix corrupted large deltas Large objects stored as deltas get unpacked by JGit into a loose object, so they are cheaper to access later on. This unpacking was broken because TeeInputStream copied the wrong length into the loose object, sometimes copying too many bytes into the result. This created a loose object that did not have the correct content, and whose length did not match the length denoted in the object header. Change-Id: I3ce1fd9f3dc5bd195249c7872b3bec49570424a2 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-10 18:50:46 -07:00
Shawn O. Pearce	1bd24a23f9	Define LowLevelDiffAlgorithm to bypass re-hashing When passing to a fallback algorithm, we can avoid creating a new copy of the hash codes for each sequence by passing in the hashed sequences directly. This makes it cheaper to switch from HistogramDiff down to MyersDiff in a single pass. Change-Id: Ibf2e81be57c083862eeb134279aed676653bf9b5 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-10 14:39:29 -07:00
Shawn O. Pearce	4fc50df97d	Fix empty block corner case in PatienceDiff There is a corner case where we get an EMPTY region during recursion, but we didn't expect to receive that. Its harmless to ignore the region since the region is empty and has no content, so do so rather than throwing an exception Change-Id: I50dcec81ecba763072bb739adfab5879fb48b23a Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-10 14:39:29 -07:00
Shawn O. Pearce	7a0c126d5f	Fix infinite loop in PatienceDiff Certain inputs caused an infinite loop because the prior match data couldn't be used as expected. Rather than incrementing the match pointer before looking at an element, do it after, so the loop breaks when we wrap around to the starting point. Change-Id: Ieab28bb3485a914eeddc68aa38c256f255dd778c Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-10-10 14:39:29 -07:00
Mathias Kinzler	7bdef4583b	Add "Branch" command The need for branching becomes more pressing with pull support: we need to make sure the upstream configuration entries are written correctly when creating and renaming branches (and of course are cleaned up when deleting them). This adds support for listing, adding, deleting and renaming branches including the more common options. Bug: 326938 Change-Id: I00bcc19476e835d6fd78fd188acde64946c1505c Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-10-10 15:38:49 -05:00
Christian Halstrick	be2ddff6a7	Add support for single-slash URI In bug 323571 it is mentioned that if you call 'toURI().toURL().toString()' on a java.io.File you cannot pass that string to jgit as an URIish. Problem is that the passed URI looks like 'file:/C:/a/b.txt' and that we where expecting double slashes after scheme':'. This fix adds support for this single-slash file URLs. Bug: 323571 Change-Id: I866a76a4fcd0c3b58e0d26a104fc4564e7ba5999 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-08 23:57:52 +02:00
Mathias Kinzler	db55d13f5f	Add "Pull" command This is the minimal implementation of a "Pull" command. It does not have any parameters besides the generic progress monitor and timeout. It works on the currently checked-out branch and assumes that the configuration contains the keys "branch.<branch name>.remote" and "branch.<branch name>.merge" to determine the remote configuration for the fetch and the remote branch name for the merge. Bug: 303404 Change-Id: I7fe09029996d0cfc09a7d8f097b5d6af1488fa93 Signed-off-by: Mathias Kinzler <mathias.kinzler@sap.com> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-10-08 08:57:28 -05:00
Christian Halstrick	2160c09dd4	Refactored URI parsing to detect wrong URIs There where quite some bugs regarding wrong URI parsing. In order to solve them the parsing has to be refactored. We now have specialized regexps for 'scheme://host/...', scp URIs and local file names. Now we can detect problems while parsing 'git://host:/abc' which was previously not possible. Bug: 315571 Bug: 292897 Bug: 307017 Bug: 323571 Bug: 317388 Change-Id: If72576576ebb6b9d9dc8b7e51ddd87c9909e8b62 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-10-08 11:12:09 +02:00
Christian Halstrick	2136095203	Fixed URI regexp regarding user/password part The regular expression which should handle the user/password part in an URI was potentially processing too many chars. This led to problems when user/pwd and port was specified Change-Id: I87db02494c4b367283e1d00437b1c06d2c8fdd28 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-10-08 11:08:12 +02:00
Christian Halstrick	a1b0ca1807	Introduce commented constants for the segments of an URI regex The regular expressions used to parse URI's are constructed by concatenating different segments to a big String. Introduce String constants for these segements and document them. Change-Id: If8b9dbaaf57ca333ac0b6c9610c3d3a515c540f9 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-08 06:44:16 +02:00
Matthias Sohn	784d388c49	Externalize strings in TransportHttp Some strings were not externalized. Also use them in HTTP tests to ensure that they will also succeed when message bundles are translated. Change-Id: Id02717176557e7d57e676e1339cd89f2be88d330 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-10-08 01:03:17 +03:00
Chris Aniszczyk	7a6efe1dfc	Merge "Support HTTP basic and digest authentication"	2010-10-07 12:25:15 -04:00
Christian Halstrick	0a2b4c1455	Split URI regex strings differently The strings used to construct the regex to parse URIs are split differently. This makes it easier to introduce meaningful String constants later on. Change-Id: I9355fd42e57e0983204465c5d6fe5b6b93655074 Signed-off-by: Christian Halstrick <christian.halstrick@sap.com>	2010-10-06 14:10:24 -05:00
Chris Aniszczyk	47e9e165b8	Add pull operation related constants Change-Id: Idb7526800e80e17624ec05fb10bbc19e7f744f49 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-10-06 11:46:13 -05:00
Chris Aniszczyk	98a41bd4d0	Add PushCommand API Change-Id: Iff144a51fdc9a1112a21492c390a873a2b293bc9 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-10-05 13:15:24 -05:00
Shawn O. Pearce	7ba31474a3	Increase core.streamFileThreshold default to 50 MiB Projects like org.eclipse.mdt contain large XML files about 6 MiB in size. So does the Android project platform/frameworks/base. Doing a clone of either project with JGit takes forever to checkout the files into the working directory, because delta decompression tends to be very expensive as we need to constantly reposition the base stream for each copy instruction. This can be made worse by a very bad ordering of offsets, possibly due to an XML editor that doesn't preserve the order of elements in the file very well. Increasing the threshold to the same limit PackWriter uses when doing delta compression (50 MiB) permits a default configured JGit to decompress these XML file objects using the faster random-access arrays, rather than re-seeking through an inflate stream, significantly reducing checkout time after a clone. Since this new limit may be dangerously close to the JVM maximum heap size, every allocation attempt is now wrapped in a try/catch so that JGit can degrade by switching to the large object stream mode when the allocation is refused. It will run slower, but the operation will still complete. The large stream mode will run very well for big objects that aren't delta compressed, and is acceptable for delta compressed objects that are using only forward referencing copy instructions. Copies using prior offsets are still going to be horrible, and there is nothing we can do about it except increase core.streamFileThreshold. We might in the future want to consider changing the way the delta generators work in JGit and native C Git to avoid prior offsets once an object reaches a certain size, even if that causes the delta instruction stream to be slightly larger. Unfortunately native C Git won't want to do that until its also able to stream objects rather than malloc them as contiguous blocks. Change-Id: Ief7a3896afce15073e80d3691bed90c6a3897307 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-10-04 14:04:47 -05:00
Chris Aniszczyk	44b4f458a8	Merge "Add reflog message to TagCommand"	2010-09-29 10:09:43 -04:00
Shawn O. Pearce	858b2c92e8	Support HTTP basic and digest authentication Natively support the HTTP basic and digest authentication methods by setting the Authorization header without going through the JREs java.net.Authenticator API. The Authenticator API is difficult to work with in a multi-threaded server environment, where its using a singleton for the entire JVM. Instead compute the Authorization header from the URIish user and pass, if available. Change-Id: Ibf83fea57cfb17964020d6aeb3363982be944f87 Signed-off-by: Shawn O. Pearce <spearce@spearce.org> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-09-28 08:42:47 +02:00
Chris Aniszczyk	e5c217bcf3	Merge "Use only a single instance for NLS translation bundles"	2010-09-27 17:59:36 -04:00
Chris Aniszczyk	153c796bce	Merge "Update FetchCommand with dry run and thin options"	2010-09-27 10:07:49 -04:00
Robin Rosenberg	65ed25b34e	Return the documented value from DirCacheCheckout.checkout Change-Id: I34d773b18e6a1ee05774d7b9471f9915c48aa63e Signed-off-by: Robin Rosenberg <robin.rosenberg@dewire.com>	2010-09-27 10:51:07 +02:00
Christian Halstrick	82d75f31d4	Merge "Extend merge support for bare repositories"	2010-09-27 04:46:06 -04:00
Robin Rosenberg	be9d096986	Use only a single instance for NLS translation bundles As findbugs pointed out, there was a small risk for creating multiple instances of translation bundles. If that happens, drop the second instance. Change-Id: I3aacda86251d511f6bbc2ed7481d561449ce3b6c Signed-off-by: Robin Rosenberg <robin.rosenberg@dewire.com>	2010-09-26 09:46:35 +02:00
Shawn O. Pearce	b533a72934	Implement HistogramDiff HistogramDiff is an alternative implementation of patience diff, performing a search over all matching locations and picking the longest common subsequence that has the lowest occurrence count. If there are unique common elements, its behavior is identical to that of patience diff. Actual performance on real-world source files usually beats MyersDiff, sometimes by a factor of 3, especially for complex comparators that ignore whitespace. Change-Id: I1806cd708087e36d144fb824a0e5ab7cdd579d73 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-24 19:10:09 -07:00
Shawn O. Pearce	9bcf391355	Micro-optimize EditList.addAll Pass through the addAll request to our underlying ArrayList. This way the underlying ArrayList grows no more than once during the call, which may be important if the list was originally allocated at the default size of 16, but 64 Edits are being added. Change-Id: I31c3261e895766f82c3c832b251a09f6e37e8860 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-24 13:58:12 -07:00
Chris Aniszczyk	39734f2908	Update FetchCommand with dry run and thin options FetchCommand was missing the ability to set dry run and thin preferences on the transport operation. Change-Id: I0bef388a9b8f2e3a01ecc9e7782aaed7f9ac82ce Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-09-24 15:33:38 -05:00
Dmitry Fink	906887a735	Extend merge support for bare repositories Optional inCore parameter to Resolver/Strategy will instruct it to perform all the operations in memory and avoid modifying working folder even if there is one. Change-Id: I5b873dead3682f79110f58d7806e43f50bcc5045	2010-09-24 11:01:20 -07:00
Shawn O. Pearce	11f99fecfd	Reduce content hash function collisions The hash code returned by RawTextComparator (or that is used by the SimilarityIndex) play an important role in the speed of any algorithm that is based upon them. The lower the number of collisions produced by the hash function, the shorter the hash chains within hash tables will be, and the less likely we are to fall into O(N^2) runtime behaviors for algorithms like PatienceDiff. Our prior hash function was absolutely horrid, so replace it with the proper definition of the DJB hash that was originally published by Professor Daniel J. Bernstein. To support this assertion, below is a table listing the maximum number of collisions that result when hashing the unique lines in each source code file of 3 randomly chosen projects: test_jgit: 931 files; 122 avg. unique lines/file Algorithm \| Collisions -------------+----------- prior_hash 418 djb 5 sha1 6 string_hash31 11 test_linux26: 30198 files; 258 avg. unique lines/file Algorithm \| Collisions -------------+----------- prior_hash 8675 djb 32 sha1 8 string_hash31 32 test_frameworks_base: 8381 files; 184 avg. unique lines/file Algorithm \| Collisions -------------+----------- prior_hash 4615 djb 10 sha1 6 string_hash31 13 We can clearly see that prior_hash performed very poorly, resulting in 8,675 collisions (elements in the same hash bucket) for at least one file in the Linux kernel repository. This leads to some very bad O(N) style insertion and lookup performance, even though the hash table was sized to be the next power-of-2 larger than the total number of unique lines in the file. The djb hash we are replacing prior_hash with performs closer to SHA-1 in terms of having very few collisions. This indicates it provides a reasonably distributed output for this type of input, despite being a much simpler algorithm (and therefore will be much faster to execute). The string_hash31 function is provided just to compare results with, it is the algorithm commonly used by java.lang.String hashCode(). However, life isn't quite this simple. djb produces a 32 bit hash code, but our hash tables are always smaller than 2^32 buckets. Mashing the 32 bit code into an array index used to be done by simply taking the lower bits of the hash code by a bitwise and operator. This unfortuntely still produces many collisions, e.g. 32 on the linux-2.6 repository files. From [1] we can apply a final "cleanup" step to the hash code to mix the bits together a little better, and give priority to the higher order bits as they include data from more bytes of input: test_jgit: 931 files; 122 avg. unique lines/file Algorithm \| Collisions -------------+----------- prior_hash 418 djb 5 djb + cleanup 6 test_linux26: 30198 files; 258 avg. unique lines/file Algorithm \| Collisions -------------+----------- prior_hash 8675 djb 32 djb + cleanup 7 test_frameworks_base: 8381 files; 184 avg. unique lines/file Algorithm \| Collisions -------------+----------- prior_hash 4615 djb 10 djb + cleanup 7 This is a massive improvement, as the number of collisions for common inputs drops to acceptable levels, and we haven't really made the hash functions any more complex than they were before. [1] http://lkml.org/lkml/2009/10/27/404 Change-Id: Ia753b695de9526a157ddba265824240bd05dead1 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-23 17:44:53 -07:00
Chris Aniszczyk	fcc3349cfc	Add reflog message to TagCommand Ensure we update the reflog when tagging. Change-Id: I3f4a4d68cbfc62d2276e3a47e3e3720f02cb2522 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-09-22 10:47:44 -05:00
Shawn O. Pearce	857d68d173	Perform common start/end elimination by default for DiffAlgorithm As it turns out, every single diff algorithm we might try to implement can benfit from using the SequenceComparator's native concept of the simple reduceCommonStartEnd() step. For most inputs, there can be a significant number of elements that can be removed from the space the DiffAlgorithm needs to consider, which will reduce the overall running time for the final solution. Pool this logic inside of DiffAlgorithm itself as a default, but permit a specific algorithm to override it when necessary. Convert MyersDiff to use this reduction to reduce the space it needs to search, making it perform slightly better on common inputs. Change-Id: I14004d771117e4a4ab2a02cace8deaeda9814bc1 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-21 18:00:59 -07:00
Shawn O. Pearce	e84d826eb6	Remove unnecessary hash cache from PatienceDiffIndex PatienceDiff always uses a HashedSequence, which promises to provide constant time access for hash codes during the equals method and aborts fast if the hash codes don't match. Therefore we don't need to cache the hash codes inside of the index, saving us memory. Change-Id: I80bf1e95094b7670e6c0acc26546364a1012d60e Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-21 18:00:31 -07:00
Shawn O. Pearce	a67afbfee1	Implement Bram Cohen's Patience Diff Change-Id: Ic7a76df2861ea6c569ab9756a62018987912bd13 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-20 18:15:22 -07:00
Shawn O. Pearce	baaddd51f1	Move cached element hash codes to HashedSequence Most diff implementations really want to use cached hash codes for elements, rather than element equality, as they need to perform many compares and unique hash codes for elements can really speed that process up. To make it easier to define element hash functions, move the caching of hash codes into a wrapper sequence type, so that individual sequence types like RawText don't need to do this themselves. This has a nice property of also allowing the sequence to no longer care about the specific SequenceComparator that is going to be used, and permits the caching to only examine the middle region that isn't common to the two inputs. Change-Id: If8623556da9419117b07c5073e8bce39de02570e Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-20 18:15:21 -07:00
Shawn O. Pearce	e0970cd1b4	Micro-optimize reduceCommonStartEnd for RawText This is a faster exact match based form that tries to improve performance for the common case of the header and trailer of a text file not changing at all. After this fast path we use the slower path based on the super class' using equals() to allow for whitespace ignore modes to still work. Some simple performance testing showed a major improvement over the older implementation for a common edit we see in JGit. The test compared blob `29a89bc` and `372a978`, which is the ObjectDirectory.java file difference in commit `41dd9ed1c0`. The two text files are approximately 22 KiB in size. DEFAULT old 203900 ns DEFAULT new 100400 ns This new version is 2x faster for the DEFAULT comparator, which does not treat space specially. This is because we can now examine a larger swath of text with fewer instructions per byte compared. The older algorithm had to stop at each line break and recompute how to examine the next line, while the new algorithm only stops when the first difference is found. WS_IGNORE_ALL old 298500 ns WS_IGNORE_ALL new 63300 ns Its 4.7x faster for the whitespace ignore comparator, as the common header and footer do not have a whitespace difference. Avoiding the special case handling for whitespace on each byte considered saves a lot of time. Since most edits to source code (and other text like files) appears in the interior of the file, fast elimination of common header/footer means faster diff throughput. In the less common case of an actual header or footer edit, the common header/footer elimination is stopped rather quickly either way, so there is very little downside to the optimiation applied here. Change-Id: I1d501b4c3ff80ed086b20bf12faf51ae62167db7 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-20 18:05:41 -07:00
Shawn O. Pearce	590a9f94a1	Add Subsequence utility methods DiffAlgorithm implementations may find it useful to construct an Edit and use that to later subsequence the two base sequences, so define two new utility methods a() and b() to construct the A and B ranges. Once a subsequence has had Edits created for it the indexes are within the space of the subsequence. These must be shifted back to the original base sequence's indexes. Define toBase() as a utility method to perform that shifting work in-place, so DiffAlgorithm implementations have an efficient way to convert back to the caller's original space. Change-Id: I8d788e4d158b9f466fa9cb4a40865fb806376aee Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-20 18:05:41 -07:00
Chris Aniszczyk	6cc5a58915	Merge "Define a subsequence utility type"	2010-09-17 15:10:15 -04:00
Chris Aniszczyk	207ab8b8f5	Merge "Define DiffAlgorithm as an abstract function"	2010-09-17 15:08:27 -04:00
Chris Aniszczyk	bbabc19e2f	Add FetchCommand Adds API for performing git fetch operations. Change-Id: Idd95664fd4e3bca03211e4ffda3e354849f92a35 Signed-off-by: Chris Aniszczyk <caniszczyk@gmail.com>	2010-09-17 13:32:59 -05:00
Shawn O. Pearce	2ee6d95e5b	Fix UnsupportedOperationException while fixing thin pack If a thin pack has a large delta we need to be able to open its cached copy from the loose object directory through the CachedObjectDatabase handle. Unfortunately that did not support the openObject2 method, which the LargePackedDeltaObject used directly to bypass looking at the pack files. Bug: 324868 Change-Id: I1d5886a6c3254c6dea2852d50b8614c31a93e615 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-17 09:13:34 -07:00
Stefan Lay	feeb3df047	Merge "Probe filesystem and set core.filemode correctly"	2010-09-17 09:32:28 -04:00
Shawn O. Pearce	5deb5b9a4a	Merge branch 'stable-0.9' * stable-0.9: Qualify post-0.9.3 builds JGit 0.9.3 clone: Correct formatting of init message Fix cloning of repositories with big objects Qualify post-0.9.1 builds JGit 0.9.1 Fix PlotCommitList to set lanes on child-less commits	2010-09-16 17:22:37 -07:00
Shawn O. Pearce	5fce8d81d8	Fix cloning of repositories with big objects When running IndexPack we use a CachedObjectDirectory, which knows what objects are loose and tries to avoid stat(2) calls for objects that do not exist in the repository, as stat(2) on Win32 is very slow. However large delta objects found in a pack file are expanded into a loose object, in order to avoid costly delta chain processing when that object is used as a base for another delta. If this expand occurs while working with the CachedObjectDirectory, we need to update the cached directory data to include this new object, otherwise it won't be available when we try to open it during the object verify phase. Bug: 324868 Change-Id: Idf0c76d4849d69aa415ead32e46a435622395d68 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-15 08:42:14 -07:00
Shawn O. Pearce	3fe527624d	Probe filesystem and set core.filemode correctly When creating a new FileRepository, probe the capability of the local filesystem and set core.filemode based on how it reacts. We can't just rely on FS.supportsExecute() because a POSIX system (which usually does support execute) might be storing the repository on a partition that doesn't have execute support (e.g. plain FAT-32). Creating a temporary file, setting both states, checking we get the desired results will let us set the variable correctly on all systems. Change-Id: I551488ea8d352d2179c7b244f474d2e3d02567a2 Signed-off-by: Shawn O. Pearce <spearce@spearce.org>	2010-09-15 07:59:38 -07:00
Christian Halstrick	2dc031ad9b	Fix PlotCommitList to set lanes on child-less commits In PlotCommitList.enter() commits are positioned on lanes for visual presentation. This implementation was buggy: commits without children (often the starting points for the RevWalk) are not positioned on separate lanes. The problem was that when handling commits with multiple children (that's where branches fork out) it was not handled that some of the children may not have been positioned on a lane yet. I fixed that and added a number of tests which specifically test the layout of commits on lanes. Bug: 300282 Bug: 320263 Change-Id: I267b97ecccb5251cec54cec90207e075ab50503e Signed-off-by: Christian Halstrick <christian.halstrick@sap.com> Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	2010-09-14 18:19:44 +02:00

... 3 4 5 6 7 ...

934 Commits