Commits · 10e2573550b0753f4aa0728bb2bbacf57d2377d0 · lvzhengyang / git2

28 Apr, 2021 1 commit

diff: use git_repository_workdir_path · 91156a0f

The new git_repository_workdir_path function does error checking on
working directory inputs on Windows; use it to construct paths within
working directories.

committed 3 years ago

91156a0f Browse File

11 Dec, 2020 2 commits

Small refactor to make thing tidier · 6cd0c853
```
Also repurposed an unused function and deleted another one.
```
lhchavez committed 4 years ago
6cd0c853 Browse File

Cache the parsed submodule config when diffing · 41da4e16

This change makes that anything that calls `git_diff__from_iterators`
(any of the `git_diff_xxx` functions) only need to parse the
`.gitmodules` file once. This can be avoided by calling
`git_repository_submodule_cache_all(...)`, but we can do that safely for
the user with no change in semantics.

Fixes: #5725

committed 4 years ago

41da4e16 Browse File

27 Nov, 2020 2 commits
- iterator: use GIT_ASSERT · 79b0c8c8
  Edward Thomson committed 4 years ago
  
  79b0c8c8 Browse File
- diff: use GIT_ASSERT · 5d6c2f26
  Edward Thomson committed 4 years ago
  
  5d6c2f26 Browse File
09 Jun, 2020 1 commit

tree-wide: mark local functions as static · a6c9e0b3

We've accumulated quite some functions which are never used outside of
their respective code unit, but which are lacking the `static` keyword.
Add it to reduce their linkage scope and allow the compiler to optimize
better.

committed 4 years ago

a6c9e0b3 Browse File

01 Jun, 2020 1 commit
- git_pool_init: handle failure cases · 0f35efeb
```
Propagate failures caused by pool initialization errors.
```
  Edward Thomson committed 4 years ago
  0f35efeb Browse File
18 Jan, 2020 1 commit

iterator: update enum type name for consistency · b59c71d8

libgit2 does not use `type_t` suffixes as it's redundant; thus, rename
`git_iterator_type_t` to `git_iterator_t` for consistency.

committed 5 years ago

b59c71d8 Browse File

22 Nov, 2019 1 commit
- blob: use `git_object_size_t` for object size · 4334b177
```
Instead of using a signed type (`off_t`) use a new `git_object_size_t`
for the sizes of objects.
```
  Edward Thomson committed 5 years ago
  4334b177 Browse File
05 Nov, 2019 1 commit
- fix a bug introduced in 8a23597b · 1886478d
  romkatv committed 5 years ago
  
  1886478d Browse File
27 Aug, 2019 2 commits

diff_generate: detect memory allocation errors when preparing opts · fe241071

When preparing options for the two iterators that are about to be
diffed, we allocate a common prefix for both iterators depending on
the options passed by the user. We do not check whether the allocation
was successful, though. In fact, this isn't much of a problem, as using
a `NULL` prefix is perfectly fine. But in the end, we probably want to
detect that the system doesn't have any memory left, as we're unlikely
to be able to continue afterwards anyway.

While the issue is being fixed in the newly created function
`diff_prepare_iterator_opts`, it has been previously existing in the
previous macro `DIFF_FROM_ITERATORS` already.

committed 5 years ago

fe241071 Browse File

diff_generate: refactor `DIFF_FROM_ITERATORS` macro of doom · 8a23597b

While the `DIFF_FROM_ITERATORS` does make it shorter to implement the
various `git_diff_foo_to_bar` functions, it is a complex and unreadable
beast that implicitly assumes certain local variable names. This is not
something desirable to have at all and obstructs understanding and more
importantly debugging the code by quite a bit.

The `DIFF_FROM_ITERATORS` macro basically removed the burden of having
to derive the options for both iterators from a pair of iterator flags
and the diff options. This patch introduces a new function that does the
that exact and refactors all callers to manage the iterators by
themselves.

As we potentially need to allocate a shared prefix for the
iterator, we need to tell the caller to allocate that prefix as soon as
the options aren't required anymore. Thus, the function has a `char
**prefix` out pointer that will get set to the allocated string and
subsequently be free'd by the caller.

While this patch increases the line count, I personally deem this to an
acceptable tradeoff for increased readbiblity.

committed 5 years ago

8a23597b Browse File

20 Jul, 2019 1 commit

fileops: rename to "futils.h" to match function signatures · e54343a4

Our file utils functions all have a "futils" prefix, e.g.
`git_futils_touch`. One would thus naturally guess that their
definitions and implementation would live in files "futils.h" and
"futils.c", respectively, but in fact they live in "fileops.h".

Rename the files to match expectations.

committed 5 years ago

e54343a4 Browse File

18 Jul, 2019 1 commit
- configuration: cvar -> configmap · 658022c4
```
`cvar` is an unhelpful name.  Refactor its usage to `configmap` for more
clarity.
```
  Patrick Steinhardt committed 5 years ago
  658022c4 Browse File
15 Jun, 2019 1 commit

oid: `is_zero` instead of `iszero` · 5d92e547

The only function that is named `issomething` (without underscore) was
`git_oid_iszero`.  Rename it to `git_oid_is_zero` for consistency with
the rest of the library.

committed 5 years ago

5d92e547 Browse File

25 Jan, 2019 2 commits

diff_generate: validate oid file size · 89bd4ddb
```
Index entries are 32 bit unsigned ints, not `size_t`s.
```
Edward Thomson committed 6 years ago
89bd4ddb Browse File

blob: validate that blob sizes fit in a size_t · c6cac733

Our blob size is a `git_off_t`, which is a signed 64 bit int. This may
be erroneously negative or larger than `SIZE_MAX`. Ensure that the blob
size fits into a `size_t` before casting.

committed 6 years ago

c6cac733 Browse File

22 Jan, 2019 1 commit
- git_error: use new names in internal APIs and usage · f673e232
```
Move to the `git_error` name in the internal API for error-related
functions.
```
  Edward Thomson committed 6 years ago
  f673e232 Browse File
01 Dec, 2018 2 commits
- object_type: use new enumeration names · 168fe39b
```
Use the new object_type enumeration names within the codebase.
```
  Edward Thomson committed 6 years ago
  168fe39b Browse File
- index: use new enum and structure names · 18e71e6d
```
Use the new-style index names throughout our own codebase.
```
  Edward Thomson committed 6 years ago
  18e71e6d Browse File
10 Jun, 2018 1 commit
- Convert usage of `git_buf_free` to new `git_buf_dispose` · ecf4f33a
  Patrick Steinhardt committed 6 years ago
  
  ecf4f33a Browse File
06 Jun, 2018 1 commit

Fix stash save bug with fast path index check · 5a7d454b

If the index contains stat data for a modified file, and the file is
not racily dirty, and there exists an untracked working tree directory
alphabetically after that file, and there are no other changes to the
repo, then git_stash_save would fail. It would confuse the untracked
working tree directory for the modified file, because they have the
same sha: zero. The wt directory has a sha of zero because it's a
directory, and the file would have a zero sha because we wouldn't read
the file -- we would just know that it doesn't match the index. To
fix this confusion, we simply check mode as well as SHA.

committed 6 years ago

5a7d454b Browse File

03 Jan, 2018 1 commit

diff_generate: avoid excessive stats of .gitattribute files · d8896bda

When generating a diff between two trees, for each file that is to be
diffed we have to determine whether it shall be treated as text or as
binary files. While git has heuristics to determine which kind of diff
to generate, users can also that default behaviour by setting or
unsetting the 'diff' attribute for specific files.

Because of that, we have to query gitattributes in order to determine
how to diff the current files. Instead of hitting the '.gitattributes'
file every time we need to query an attribute, which can get expensive
especially on networked file systems, we try to cache them instead. This
works perfectly fine for every '.gitattributes' file that is found, but
we hit cache invalidation problems when we determine that an attribuse
file is _not_ existing. We do create an entry in the cache for missing
'.gitattributes' files, but as soon as we hit that file again we
invalidate it and stat it again to see if it has now appeared.

In the case of diffing large trees with each other, this behaviour is
very suboptimal. For each pair of files that is to be diffed, we will
repeatedly query every directory component leading towards their
respective location for an attributes file. This leads to thousands or
even hundreds of thousands of wasted syscalls.

The attributes cache already has a mechanism to help in that scenario in
form of the `git_attr_session`. As long as the same attributes session
is still active, we will not try to re-query the gitmodules files at all
but simply retain our currently cached results. To fix our problem, we
can create a session at the top-most level, which is the initialization
of the `git_diff` structure, and use it in order to look up the correct
diff driver. As the `git_diff` structure is used to generate patches for
multiple files at once, this neatly solves our problem by retaining the
session until patches for all files have been generated.

The fix has been tested with linux.git by calling
`git_diff_tree_to_tree` and `git_diff_to_buf` with v4.10^{tree} and
v4.14^{tree}.

                | time    | .gitattributes stats
    without fix | 33.201s | 844614
    with fix    | 30.327s | 4441

While execution only improved by roughly 10%, the stat(3) syscalls for
.gitattributes files decreased by 99.5%. The benchmarks were quite
simple with best-of-three timings on Linux ext4 systems. One can assume
that for network based file systems the performance gain will be a lot
larger due to a much higher latency.

committed 7 years ago

d8896bda Browse File

30 Nov, 2017 1 commit

diff_generate: fix unsetting diff flags · 5ca3f115

The macro `DIFF_FLAG_SET` can be used to set or unset a flag by
modifying the diff's bitmask. While the case of setting the flag is
handled correctly, the case of unsetting the flag was not. Instead of
inverting the flags, we are inverting the value which is used to decide
whether we want to set or unset the bits.

The value being used here is a simple `bool` which is `false`. As that
is being uplifted to `int` when getting the bitwise-complement, we will
end up retaining all bits inside of the bitmask. As that's only ever
used to set `GIT_DIFF_IGNORE_CASE`, we were actually always ignoring
case for generated diffs.

Fix that by instead getting the bitwise-complement of `FLAG`, not `VAL`.

committed 7 years ago

5ca3f115 Browse File

18 Nov, 2017 1 commit

refcount: make refcounting conform to aliasing rules · 585b5dac

Strict aliasing rules dictate that for most data types, you are not
allowed to cast them to another data type and then access the casted
pointers. While this works just fine for most compilers, technically we
end up in undefined behaviour when we hurt that rule.

Our current refcounting code makes heavy use of casting and thus
violates that rule. While we didn't have any problems with that code,
Travis started spitting out a lot of warnings due to a change in their
toolchain. In the refcounting case, the code is also easy to fix:
as all refcounting-statements are actually macros, we can just access
the `rc` field directly instead of casting.

There are two outliers in our code where that doesn't work. Both the
`git_diff` and `git_patch` structures have specializations for generated
and parsed diffs/patches, which directly inherit from them. Because of
that, the refcounting code is only part of the base structure and not of
the children themselves. We can help that by instead passing their base
into `GIT_REFCOUNT_INC`, though.

committed 7 years ago

585b5dac Browse File

03 Jul, 2017 1 commit

Make sure to always include "common.h" first · 0c7f49dd

Next to including several files, our "common.h" header also declares
various macros which are then used throughout the project. As such, we
have to make sure to always include this file first in all
implementation files. Otherwise, we might encounter problems or even
silent behavioural differences due to macros or defines not being
defined as they should be. So in fact, our header and implementation
files should make sure to always include "common.h" first.

This commit does so by establishing a common include pattern. Header
files inside of "src" will now always include "common.h" as its first
other file, separated by a newline from all the other includes to make
it stand out as special. There are two cases for the implementation
files. If they do have a matching header file, they will always include
this one first, leading to "common.h" being transitively included as
first file. If they do not have a matching header file, they instead
include "common.h" as first file themselves.

This fixes the outlined problems and will become our standard practice
for header and source files inside of the "src/" from now on.

committed 7 years ago

0c7f49dd Browse File

29 Dec, 2016 1 commit

giterr_set: consistent error messages · 909d5494

Error messages should be sentence fragments, and therefore:

1. Should not begin with a capital letter,
2. Should not conclude with punctuation, and
3. Should not end a sentence and begin a new one

committed 8 years ago

909d5494 Browse File

24 Aug, 2016 1 commit

Teach `git_patch_from_diff` about parsed diffs · b859faa6

Ensure that `git_patch_from_diff` can return the patch for parsed diffs,
not just generate a patch for a generated diff.

committed 8 years ago

b859faa6 Browse File

26 May, 2016 3 commits
- introduce `git_diff_from_buffer` to parse diffs · 7166bb16
```
Parse diff files into a `git_diff` structure.
```
  Edward Thomson committed 8 years ago
  7166bb16 Browse File
- git_diff_generated: abstract generated diffs · 9be638ec
  Edward Thomson committed 8 years ago
  
  9be638ec Browse File
- diff: include oid length in deltas · d68cb736
```
Now that `git_diff_delta` data can be produced by reading patch
file data, which may have an abbreviated oid, allow consumers to
know that the id is abbreviated.
```
  Edward Thomson committed 8 years ago
  d68cb736 Browse Directory
03 May, 2016 1 commit

diff: simplify code for handling empty dirs · fe3057b4

When determining diffs between two iterators we may need to
recurse into an unmatched directory for the "new" iterator when
it is either a prefix to the current item of the "old" iterator
or when untracked/ignored changes are requested by the user and
the directory is untracked/ignored.

When advancing into the directory and no files are found, we will
get back `GIT_ENOTFOUND`. If so, we simply skip the directory,
handling resulting unmatched old items in the next iteration. The
other case of `iterator_advance_into` returning either
`GIT_NOERROR` or any other error but `GIT_ENOTFOUND` will be
handled by the caller, which will now either compare the first
directory entry of the "new" iterator in case of `GIT_ENOERROR`
or abort on other cases.

Improve readability of the code to make the above logic more
clear.

committed 8 years ago

fe3057b4 Browse Directory

24 Mar, 2016 1 commit
- iterator: cleanups · 9eb9e5fa
```
Remove some unused functions, refactor some ugliness.
```
  Edward Thomson committed 8 years ago
  9eb9e5fa Browse Directory
23 Mar, 2016 2 commits

diff: stop processing nitem when its removed · 67885532
```
When a directory is removed out from underneath us, stop trying to
manipulate it.
```
Edward Thomson committed 8 years ago
67885532 Browse Directory

iterator: combine fs+workdir iterators more completely · 0e0589fc

Drop some of the layers of indirection between the workdir and the
filesystem iterators.  This makes the code a little bit easier to
follow, and reduces the number of unnecessary allocations a bit as
well.  (Prior to this, when we filter entries, we would allocate them,
filter them and then free them; now we do the filtering before
allocation.)

Also, rename `git_iterator_advance_over_with_status` to just
`git_iterator_advance_over`.  Mostly because it's a fucking long-ass
function name otherwise.

committed 8 years ago

0e0589fc Browse Directory

11 Feb, 2016 1 commit
- Horrible fix for #3173. · 3679ebae
  Arthur Schreiber committed 9 years ago
  
  3679ebae Browse Directory
01 Dec, 2015 1 commit

diff: include commit message when formatting patch · 254e0a33

When formatting a patch as email we do not include the commit's
message in the formatted patch output. Implement this and add a
test that verifies behavior.

committed 9 years ago

254e0a33 Browse Directory

23 Nov, 2015 1 commit

checkout: only consider nsecs when built that way · 25e84f95

When examining the working directory and determining whether it's
up-to-date, only consider the nanoseconds in the index entry when
built with `GIT_USE_NSEC`. This prevents us from believing that
the working directory is always dirty when the index was originally
written with a git client that uinderstands nsecs (like git 2.x).

committed 9 years ago

25e84f95 Browse Directory

02 Nov, 2015 1 commit
- Add diff progress callback. · 3138ad93
  Jason Haslam committed 9 years ago
  
  3138ad93 Browse Directory
28 Oct, 2015 1 commit
- pool: Simplify implementation · 1e5e02b4
  Vicent Marti committed 9 years ago
  
  1e5e02b4 Browse Directory