Commits · c41871e542dce9cfb1b5cf1814e17c65c7e04ce4 · lvzhengyang / git2

12 May, 2020 1 commit

tests: merge: fix printf formatter on 32 bit arches · 0cf9b666

We currently use `PRIuMAX` to print an integer of type `size_t` in
merge::trees::rename::cache_recomputation. While this works just fine on
64 bit arches, it doesn't on 32 bit ones. As a result, our nightly
builds on x86 and arm32 fail.

Fix the issue by using `PRIuZ` instead.

committed 4 years ago

0cf9b666 Browse Directory

01 Apr, 2020 1 commit

merge: cache negative cache results for similarity metrics · 4dfcc50f

When computing renames, we cache the hash signatures for each of the
potentially conflicting entries so that we do not need to repeatedly
read the file and can at least halfway efficiently determine whether two
files are similar enough to be deemed a rename. In order to make the
hash signatures meaningful, we require at least four lines of data to be
present, resulting in at least four different hashes that can be
compared. Files that are deemed too small are not cached at all and
will thus be repeatedly re-hashed, which is usually not a huge issue.

The issue with above heuristic is in case a file does _not_ have at
least four lines, where a line is anything separated by a consecutive
run of "\n" or "\0" characters. For example "a\nb" is two lines, but
"a\0\0b" is also just two lines. Taken to the extreme, a file that has
megabytes of consecutive space- or NUL-only may also be deemed as too
small and thus not get cached. As a result, we will repeatedly load its
blob, calculate its hash signature just to finally throw it away as we
notice it's not of any value. When you've got a comparitively big file
that you compare against a big set of potentially renamed files, then
the cost simply expodes.

The issue can be trivially fixed by introducing negative cache entries.
Whenever we determine that a given blob does not have a meaningful
representation via a hash signature, we store this negative cache marker
and will from then on not hash it again, but also ignore it as a
potential rename target. This should help the "normal" case already
where you have a lot of small files as rename candidates, but in the
above scenario it's savings are extraordinarily high.

To verify we do not hit the issue anymore with described solution, this
commit adds a test that uses the exact same setup described above with
one 50 megabyte blob of '\0' characters and 1000 other files that get
renamed. Without the negative cache:

$ time ./libgit2_clar -smerge::trees::renames::cache_recomputation >/dev/null
real    11m48.377s
user    11m11.576s
sys     0m35.187s

And with the negative cache:

$ time ./libgit2_clar -smerge::trees::renames::cache_recomputation >/dev/null
real    0m1.972s
user    0m1.851s
sys     0m0.118s

So this represents a ~350-fold performance improvement, but it obviously
depends on how many files you have and how big the blob is. The test
number were chosen in a way that one will immediately notice as soon as
the bug resurfaces.

committed 4 years ago

4dfcc50f Browse Directory

18 Jan, 2020 1 commit

merge: update enum type name for consistency · 94beb3a3

libgit2 does not use `type_t` suffixes as it's redundant; thus, rename
`git_merge_diff_type_t` to `git_merge_diff_t` for consistency.

committed 5 years ago

94beb3a3 Browse Directory

20 Jul, 2019 1 commit

fileops: rename to "futils.h" to match function signatures · e54343a4

Our file utils functions all have a "futils" prefix, e.g.
`git_futils_touch`. One would thus naturally guess that their
definitions and implementation would live in files "futils.h" and
"futils.c", respectively, but in fact they live in "fileops.h".

Rename the files to match expectations.

committed 5 years ago

e54343a4 Browse Directory

15 Jun, 2019 1 commit

blob: add underscore to `from` functions · 08f39208

The majority of functions are named `from_something` (with an
underscore) instead of `fromsomething`.  Update the blob functions for
consistency with the rest of the library.

committed 5 years ago

08f39208 Browse Directory

13 Jun, 2019 1 commit

tests: merge::analysis: use variants to deduplicate test suites · 70fae43c

Since commit 394951ad (tests: allow for simple data-driven
tests, 2019-06-07), we have the ability to run a given test suite
with multiple variants. Use this new feature to deduplicate the
test suites for merge::{trees,workdir}::analysis into a single
test suite.

committed 5 years ago

70fae43c Browse Directory

10 Jun, 2019 5 commits
- Fix memleaks in analysis tests. · 438c9958
```
Wrap some missed setup api calls in asserts.
```
  Robert Coup committed 5 years ago
  438c9958 Browse Directory
- Review fixes: · 21ddeabe
```
- whitespace -> tabs
- comment style
- improve repo naming in merge/trees/analysis tests.
```
  Robert Coup committed 5 years ago
  21ddeabe Browse Directory
- Refactor testing: · 7b27b6cf
```
- move duplication between merge/trees/ and merge/workdir/ into merge/analysis{.c,.h}
- remove merge-resolve.git resource, open the existing merge-resolve as a bare repo instead.
```
  Robert Coup committed 5 years ago
  7b27b6cf Browse Directory
- merge: add doc header to analysis tests · 5427461f
  Robert Coup committed 5 years ago
  
  5427461f Browse Directory
- merge: tests for bare repo merge analysis · 1d04f477
```
dupe of workdir/analysis.c against a bare repo.
```
  Robert Coup committed 5 years ago
  1d04f477 Browse Directory
14 Dec, 2018 1 commit
- annotated_commit: add failing test for looking up from annotated tag · 0f299365
  Carlos Martín Nieto committed 6 years ago
  
  0f299365 Browse Directory
01 Dec, 2018 1 commit
- object_type: use new enumeration names · 168fe39b
```
Use the new object_type enumeration names within the codebase.
```
  Edward Thomson committed 6 years ago
  168fe39b Browse Directory
19 Oct, 2018 1 commit

merge: make analysis possible against a non-HEAD reference · 6e9fb040

This moves the current merge analysis code into a more generic version
that can work against any reference.

Also change the tests to check returned analysis values exactly.

committed 6 years ago

6e9fb040 Browse Directory

13 Jul, 2018 1 commit

treewide: remove use of C++ style comments · 9994cd3f

C++ style comment ("//") are not specified by the ISO C90 standard and
thus do not conform to it. While libgit2 aims to conform to C90, we did
not enforce it until now, which is why quite a lot of these
non-conforming comments have snuck into our codebase. Do a tree-wide
conversion of all C++ style comments to the supported C style comments
to allow us enforcing strict C90 compliance in a later commit.

committed 6 years ago

9994cd3f Browse Directory

06 Jul, 2018 1 commit
- tests: add missing cl_git_pass to tests · 8455a270
```
Reported by Coverity, CID 1393678-1393697.
```
  Etienne Samson committed 6 years ago
  8455a270 Browse Directory
10 Jun, 2018 1 commit
- Convert usage of `git_buf_free` to new `git_buf_dispose` · ecf4f33a
  Patrick Steinhardt committed 6 years ago
  
  ecf4f33a Browse Directory
04 Feb, 2018 3 commits

Add failing test case for virtual commit merge base issue · b8823c2b
Edward Thomson committed 7 years ago

b8823c2b Browse Directory
merge::trees::recursive: test for virtual base building · afcaf35e
```
Virtual base building: ensure that the virtual base is created and
revwalked in the same way as git.
```
Edward Thomson committed 7 years ago
afcaf35e Browse Directory

merge: reverse merge bases for recursive merge · b924df1e

When the commits being merged have multiple merge bases, reverse the
order when creating the virtual merge base.  This is for compatibility
with git's merge-recursive algorithm, and ensures that we build
identical trees.

Git does this to try to use older merge bases first.  Per 8918b0c:

> It seems to be the only sane way to do it: when a two-head merge is
> done, and the merge-base and one of the two branches agree, the
> merge assumes that the other branch has something new.
>
> If we start creating virtual commits from newer merge-bases, and go
> back to older merge-bases, and then merge with newer commits again,
> chances are that a patch is lost, _because_ the merge-base and the
> head agree on it. Unlikely, yes, but it happened to me.

committed 7 years ago

b924df1e Browse Directory

21 Jan, 2018 2 commits

merge: test CR/LF conflicts for CR/LF files · 2a8841ae

Ensure that when the files being merged have CR/LF line endings that the
conflict markers produced in the conflict file also have CR/LF line
endings.

committed 7 years ago

2a8841ae Browse Directory

merge: recursive uses larger conflict markers · 185b0d08

Git uses longer conflict markers in the recursive merge base - two more
than the default (thus, 9 character long conflict markers). This allows
users to tell the difference between the recursive merge conflicts and
conflicts between the ours and theirs branches.

This was introduced in git d694a17986a28bbc19e2a6c32404ca24572e400f.

Update our tests to expect this as well.

committed 7 years ago

185b0d08 Browse Directory

04 Dec, 2017 1 commit
- Do not attempt to check out submodule as blob when merging a submodule modify/deltete conflict · 2a3e0635
  David Turner committed 7 years ago
  
  2a3e0635 Browse Directory
11 Nov, 2017 1 commit

tests: add test case for index reloads on merge · 5248a1a5

Adds a test case for the issue #4203, when diverging indexes
on memory and disk cause git merge to abort with GIT_ECONFLICT

committed 7 years ago

5248a1a5 Browse Directory

09 Feb, 2017 1 commit

merge_trees: introduce test for submodule renames · 49806e9b

Test that shows that submodules are incorrectly considered in renames,
and `git_merge_trees` will fail to lookup the submodule as a blob.

committed 8 years ago

49806e9b Browse Directory

01 Jan, 2017 1 commit
- merge: set default rename threshold · 19ed4d0c
```
When `GIT_MERGE_FIND_RENAMES` is set, provide a default for
`rename_threshold` when it is unset.
```
  Edward Thomson committed 8 years ago
  19ed4d0c Browse Directory
26 May, 2016 1 commit
- git_diff_generated: abstract generated diffs · 9be638ec
  Edward Thomson committed 8 years ago
  
  9be638ec Browse Directory
17 Mar, 2016 8 commits
- merge drivers: handle configured but not found driver · d953c450
  Edward Thomson committed 8 years ago
  
  d953c450 Browse Directory
- merge driver: remove `check` callback · 6d8b2cdb
```
Since the `apply` callback can defer, the `check` callback is not
necessary.  Removing the `check` callback further makes the `payload`
unnecessary along with the `cleanup` callback.
```
  Edward Thomson committed 8 years ago
  6d8b2cdb Browse Directory
- merge driver: tests for set and unset merge attribute · 58d33126
```
Ensure that setting the merge attribute forces the built-in default
`text` driver and does *not* honor the `merge.default` configuration
option.  Further ensure that unsetting the merge attribute forces
a conflict (the `binary` driver).
```
  Edward Thomson committed 8 years ago
  58d33126 Browse Directory
- merge driver: tests for custom default merge drivers · d3f0875a
  Edward Thomson committed 8 years ago
  
  d3f0875a Browse Directory
- merge driver: test GIT_EMERGECONFLICT · 7d307c1e
```
When a `check` or `apply` callback function returns `GIT_EMERGECONFLICT`
stop and product a conflict.
```
  Edward Thomson committed 8 years ago
  7d307c1e Browse Directory
- merge driver: test GIT_PASSTHROUGH · 59f29314
```
When a `check` or `apply` callback function returns `GIT_PASSTHROUGH`,
move on to the default merge driver.
```
  Edward Thomson committed 8 years ago
  59f29314 Browse Directory
- merge driver: introduce custom merge drivers · 3f04219f
```
Consumers can now register custom merged drivers with
`git_merge_driver_register`.  This allows consumers to support the
merge drivers, as configured in `.gitattributes`.  Consumers will be
asked to perform the file-level merge when a custom driver is
configured.
```
  Edward Thomson committed 8 years ago
  3f04219f Browse Directory
- Fix rebase bug and include test for merge=union · 7a74590d
  Stan Hu committed 8 years ago
  
  7a74590d Browse Directory
07 Mar, 2016 1 commit
- merge::workdir::dirty: update to use `st_ctime_nsec` · 6abdf52d
```
Update unit test to use newfangled `st_ctime_nsec`, which provides
indirection to the platform-correct name.
```
  Edward Thomson committed 8 years ago
  6abdf52d Browse Directory
12 Feb, 2016 1 commit
- win32: introduce p_timeval that isn't stupid · 35439f59
```
Windows defines `timeval` with `long`, which we cannot
sanely cope with.  Instead, use a custom timeval struct.
```
  Edward Thomson committed 9 years ago
  35439f59 Browse Directory
11 Feb, 2016 1 commit
- merge tests: correct casts · 263e674e
  Edward Thomson committed 9 years ago
  
  263e674e Browse Directory
25 Nov, 2015 2 commits

recursive merge: add a recursion limit · 5b9c63c3
Edward Thomson committed 9 years ago

5b9c63c3 Browse Directory

merge: handle conflicts in recursive base building · 78859c63

When building a recursive merge base, allow conflicts to occur.
Use the file (with conflict markers) as the common ancestor.

The user has already seen and dealt with this conflict by virtue
of having a criss-cross merge. If they resolved this conflict
identically in both branches, then there will be no conflict in the
result. This is the best case scenario.

If they did not resolve the conflict identically in the two branches,
then we will generate a new conflict. If the user is simply using
standard conflict output then the results will be fairly sensible.
But if the user is using a mergetool or using diff3 output, then the
common ancestor will be a conflict file (itself with diff3 output,
haha!). This is quite terrible, but it matches git's behavior.

committed 9 years ago

78859c63 Browse Directory