Commits · e0973bc0fc2b04b6bb27e4ce4db2b37e9577a75d · lvzhengyang / git2

28 Apr, 2017 8 commits

odb: verify hashes in read_prefix_1 · e0973bc0

While the function reading an object from the complete OID already
verifies OIDs, we do not yet do so for reading objects from a partial
OID. Do so when strict OID verification is enabled.

committed Apr 28, 2017

e0973bc0 Browse Files

odb: improve error handling in read_prefix_1 · 14109620

The read_prefix_1 function has several return statements springled
throughout the code. As we have to free memory upon getting an error,
the free code has to be repeated at every single retrun -- which it is
not, so we have a memory leak here.

Refactor the code to use the typical `goto out` pattern, which will free
data when an error has occurred. While we're at it, we can also improve
the error message thrown when multiple ambiguous prefixes are found. It
will now include the colliding prefixes.

committed Apr 28, 2017

14109620 Browse Files

odb: add option to turn off hash verification · 35079f50

Verifying hashsums of objects we are reading from the ODB may be costly
as we have to perform an additional hashsum calculation on the object.
Especially when reading large objects, the penalty can be as high as
35%, as can be seen when executing the equivalent of `git cat-file` with
and without verification enabled. To mitigate for this, we add a global
option for libgit2 which enables the developer to turn off the
verification, e.g. when he can be reasonably sure that the objects on
disk won't be corrupted.

committed Apr 28, 2017

35079f50 Browse Files

odb: verify object hashes · 28a0741f

The upstream git.git project verifies objects when looking them up from
disk. This avoids scenarios where objects have somehow become corrupt on
disk, e.g. due to hardware failures or bit flips. While our mantra is
usually to follow upstream behavior, we do not do so in this case, as we
never check hashes of objects we have just read from disk.

To fix this, we create a new error class `GIT_EMISMATCH` which denotes
that we have looked up an object with a hashsum mismatch. `odb_read_1`
will then, after having read the object from its backend, hash the
object and compare the resulting hash to the expected hash. If hashes do
not match, it will return an error.

This obviously introduces another computation of checksums and could
potentially impact performance. Note though that we usually perform I/O
operations directly before doing this computation, and as such the
actual overhead should be drowned out by I/O. Running our test suite
seems to confirm this guess. On a Linux system with best-of-five
timings, we had 21.592s with the check enabled and 21.590s with the
ckeck disabled. Note though that our test suite mostly contains very
small blobs only. It is expected that repositories with bigger blobs may
notice an increased hit by this check.

In addition to a new test, we also had to change the
odb::backend::nonrefreshing test suite, which now triggers a hashsum
mismatch when looking up the commit "deadbeef...". This is expected, as
the fake backend allocated inside of the test will return an empty
object for the OID "deadbeef...", which will obviously not hash back to
"deadbeef..." again. We can simply adjust the hash to equal the hash of
the empty object here to fix this test.

committed Apr 28, 2017

28a0741f Browse Files

tests: object: test looking up corrupted objects · d59dabe5

We currently have no tests which check whether we fail reading corrupted
objects. Add one which modifies contents of an object stored on disk and
then tries to read the object.

committed Apr 28, 2017

d59dabe5 Browse Files

tests: object: create sandbox · 86c03552

The object::lookup tests do use the "testrepo.git" repository in a
read-only way, so we do not set up the repository as a sandbox but
simply open it. But in a future commit, we will want to test looking up
objects which are corrupted in some way, which requires us to modify the
on-disk data. Doing this in a repository without creating the sandbox
will modify contents of our libgit2 repository, though.

Create the repository in a sandbox to avoid this.

committed Apr 28, 2017

86c03552 Browse Files

tests: odb: make hash of fake backend configurable · e29e8029

In the odb::backend::nonrefreshing test suite, we set up a fake backend
so that we are able to determine if backend functions are called
correctly. During the setup, we also parse an OID which is later on used
to read out the pseudo-object. While this procedure works right now, it
will create problems later when we implement hash verification for
looked up objects. The current OID ("deadbeef") will not match the hash
of contents we give back to the ODB layer and thus cannot be verified.

Make the hash configurable so that we can simply switch the returned for
single tests.

committed Apr 28, 2017

e29e8029 Browse Files

Merge pull request #4191 from pks-t/pks/wt-ref-renames · 7df580fa
```
Branch renames with worktrees
```
Edward Thomson committed Apr 28, 2017
7df580fa Browse Files

26 Apr, 2017 5 commits

Merge pull request #4219 from pks-t/pks/socket-stream-addrinfo-loop · 6cf25a39
```
socket_stream: continue to next addrinfo on socket creation failure
```
Edward Thomson committed Apr 26, 2017
6cf25a39 Browse Files
Merge pull request #4217 from pks-t/pks/readonly-cfg-backend · cecd41fb
```
Honor read-only flag when writing to config backends
```
Edward Thomson committed Apr 26, 2017
cecd41fb Browse Files

socket_stream: continue to next addrinfo on socket creation failure · 954e06a8

When connecting to a remote via socket stream, we first use getaddrinfo
to obtain the possible connection methods followed by creating and
connecting the socket. But when creating the socket, we error out as
soon as we get an invalid socket instead of trying out other address
hints returned by addrinfo.

Fix this by continuing on invalid socket instead of returning an error.
This fixes connection establishment with musl libc.

committed Apr 26, 2017

954e06a8 Browse Files

tests: config: verify functionality with read-only backends · 2a7086fa
Patrick Steinhardt committed Apr 26, 2017

2a7086fa Browse Files

config: skip r/o backends when writing · 95f29fb3

Configuration backends have a readonly-flag which is currently used to
distinguish configuration snapshots. But somewhat unexpectedly, we do
not use the flag to prevent writing to a readonly backend but happily
proceed to do so.

This commit modifies logic to also honor the readonly flag for
configuration setters. We will now traverse through all backends and
pick the first one which is not marked as read-only whenever we want to
write new configuration.

committed Apr 26, 2017

95f29fb3 Browse Files

25 Apr, 2017 3 commits

Merge pull request #4215 from pks-t/pks/diff-stack-free · 7f75eeaf
```
diff_parse: free object instead of its pointer
```
Edward Thomson committed Apr 25, 2017
7f75eeaf Browse Files

config_file: add missing include for `git_config_backend` · 64244131

The config_file.h header provides some inline declarations accessing the
`git_config_backend`, but misses its declaration. Add the missing
include for "git2/sys/config.h" to add it.

committed Apr 25, 2017

64244131 Browse Files

diff_parse: free object instead of its pointer · 1cb30b1b

In e7330016 (diff_parse: check return value of `git_diff_init_options`,
2017-03-20), we've introduced an error check whether we're able to
correctly initialize the diff options. This simple commit actually
introduced a segfault in that we now try to free the pointer to the
allocated diff in an error case, instead of the allocated diff itself.
This commit fixes the issue.

committed Apr 25, 2017

1cb30b1b Browse Files

21 Apr, 2017 3 commits
- Merge pull request #4210 from pks-t/pks/misc-fixes · 0d2f6824
```
Misc fixes
```
  Patrick Steinhardt committed Apr 21, 2017
  0d2f6824 Browse Files
- tests: threads::diff: fix warning for unused variable · 13c275ab
```
The threads::diff test suite has a static variable `_retries`, which is
used on Windows platforms only. As it is unused on other systems, the
compiler throws a warning there. Fix the warning by wrapping the
declaration in an ifdef.
```
  Patrick Steinhardt committed Apr 21, 2017
  13c275ab Browse Files
- global: fix typo in `git_libgit2_init` description · f4d1592c
  Patrick Steinhardt committed Apr 21, 2017
  
  f4d1592c Browse Files
17 Apr, 2017 2 commits
- Merge pull request #4192 from libgit2/ethomson/win32_posix · 8d89e409
```
Refactor some of the win32 POSIX emulation
```
  Carlos Martín Nieto committed Apr 17, 2017
  8d89e409 Browse Files
- win32: `remediation` not `cleanup` · 86536c7e
```
The `remediation` function is run in the retry loop in order to attempt
to fix any problems that the prior run encountered.  There is nothing
"cleaned up".  Clarify the name.
```
  Edward Thomson committed Apr 17, 2017
  86536c7e Browse Files
12 Apr, 2017 2 commits

Merge pull request #4201 from pks-t/pks/fileops-fd-leak · f9d3b0d0
```
fileops: fix leaking fd in `mmap_ro_file`
```
Edward Thomson committed Apr 12, 2017
f9d3b0d0 Browse Files

fileops: fix leaking fd in `mmap_ro_file` · 38b6e700

When the `git_futils_mmap_ro_file` function encounters an error after
the file has been opened, it will do a simple returns. Instead, we
should close the opened file descriptor to avoid a leak. This commit
fixes the issue.

committed Apr 12, 2017

38b6e700 Browse Files

11 Apr, 2017 4 commits
- Merge pull request #4196 from pks-t/pks/filter-segfault · d476d024
```
filter: only close filter if it's been initialized correctly
```
  Edward Thomson committed Apr 11, 2017
  d476d024 Browse Files
- Merge pull request #4195 from pks-t/pks/openssl-1.1 · a5781e2a
```
Fix building against OpenSSL v1.1
```
  Edward Thomson committed Apr 11, 2017
  a5781e2a Browse Files
- Merge pull request #4198 from pks-t/pks/git-compat · 1262963a
```
README: document our relation to changes in upstream
```
  Edward Thomson committed Apr 11, 2017
  1262963a Browse Files
- README: document our relation to changes in upstream · 19a04f67
```
libgit2 is a mere consumer of changes which are trickling down from the
upstream git.git project. This commit documents the ramifications caused
by this relation.
```
  Patrick Steinhardt committed Apr 11, 2017
  19a04f67 Browse Files
10 Apr, 2017 2 commits

openssl_stream: use new initialization function on OpenSSL version >=1.1 · 88520151

Previous to OpenSSL version 1.1, the user had to initialize at least the error
strings as well as the SSL algorithms by himself. OpenSSL version 1.1 instead
provides a new function `OPENSSL_init_ssl`, which handles initialization of all
subsystems. As the new API call will by default load error strings and
initialize the SSL algorithms, we can safely replace these calls when compiling
against version 1.1 or later.

This fixes a compiler error when compiling against OpenSSL version 1.1 which has
been built without stubs for deprecated syntax.

committed Apr 10, 2017

88520151 Browse Files

openssl_stream: remove locking initialization on OpenSSL version >=1.1 · 29081c2f

Up to version 1.0, OpenSSL required us to provide a callback which implements
a locking mechanism. Due to problems in the API design though this mechanism was
inherently broken, especially regarding that the locking callback cannot report
errors in an obvious way. Due to this shortcoming, the locking initialization
has been completely removed in OpenSSL version 1.1. As the library has also been
refactored to not make any use of these callback functions, we can safely remove
all initialization of the locking subsystem if compiling against OpenSSL version
1.1 or higher.

This fixes a compilation error when compiling against OpenSSL version 1.1 which
has been built without stubs for deprecated syntax.

committed Apr 10, 2017

29081c2f Browse Files

07 Apr, 2017 3 commits

filter: only close filter if it's been initialized correctly · cf07db2f

In the function `git_filter_list_stream_data`, we initialize, write and
subesquently close the stream which should receive content processed by
the filter. While we skip writing to the stream if its initialization
failed, we still try to close it unconditionally -- even if the
initialization failed, where the stream might not be set at all, leading
us to segfault.

Semantics in this code is not really clear. The function handling the
same logic for files instead of data seems to do the right thing here in
only closing the stream when initialization succeeded. When stepping
back a bit, this is only reasonable: if a stream cannot be initialized,
the caller would not expect it to be closed again. So actually, both
callers of `stream_list_init` fail to do so. The data streaming function
will always close the stream and the file streaming function will not
close the stream if writing to it has failed.

The fix is thus two-fold:

- callers of `stream_list_init` now close the stream iff it has been
  initialized
- `stream_list_init` now closes the lastly initialized stream if
  the current stream in the chain failed to initialize

Add a test which segfaulted previous to these changes.

committed Apr 07, 2017

cf07db2f Browse Files

Merge pull request #4183 from pks-t/pks/coverity · e572b631
```
Coverity
```
Edward Thomson committed Apr 07, 2017
e572b631 Browse Files
Merge pull request #4193 from pks-t/pks/libdir · 44998cdb
```
pkgconfig: fix handling of prefixes containing whitespaces
```
Edward Thomson committed Apr 07, 2017
44998cdb Browse Files

05 Apr, 2017 8 commits

pkgconfig: fix handling of prefixes containing whitespaces · 22436f29

Our libgit2.pc.in file is quoting the `libdir` variable in our declared
"Libs:" line. The intention is to handle whitespaces here, but pkgconfig
already does so by automatically escaping whitespace with backslashes.
The correct thing to do is to instead quote the prefix, as this is the
one which is being substituted by CMake upon installation. As both
libdir and includedir will be expanded to "${prefix}/lib" and
"${prefix}/include", respectively, pkgconfig will also correctly escape
whitespaces.

Note that this will actually break when a user manually wants to
override libdir and includedir with a path containing whitespace. But
actually, this cannot be helped, as always quoting these variables will
actuall break the common case of being prefixed with "${prefix}". So we
just bail out here and declare this as unsupported out of the box.

committed Apr 05, 2017

22436f29 Browse Files

refs: update worktree HEADs when renaming branches · 2a485dab

Whenever we rename a branch, we update the repository's symbolic HEAD
reference if it currently points to the branch that is to be renamed.
But with the introduction of worktrees, we also have to iterate over all
HEADs of linked worktrees to adjust them. Do so.

committed Apr 05, 2017

2a485dab Browse Files

branch: use `foreach_head` to see if a branch is checked out · 38fc5ab0

Previously, we have extracted the logic to find and iterate over all
HEADs of a repository. Use this function in `git_branch_is_checked_out`.

committed Apr 05, 2017

38fc5ab0 Browse Files

repository: add function to iterate over all HEADs · 74511aa2

While we already provide functions to get the current repository's HEAD,
it is quite involved to iterate over HEADs of both the repository and
all linked work trees. This commit implements a function
`git_repository_foreach_head`, which accepts a callback which is then
called for all HEAD files.

committed Apr 05, 2017

74511aa2 Browse Files

repository: get worktree HEAD via `git_reference__read_head` · 3e84aa50

The functions `git_repository_head_for_worktree` and
`git_repository_detached_head_for_worktree` both implement their
own logic to read the HEAD reference file. Use the new function
`git_reference__read_head` instead to unify the code paths.

committed Apr 05, 2017

3e84aa50 Browse Files

repository: extract function to get path to a file in a work tree · 987f5659

The function `read_worktree_head` has the logic embedded to construct
the path to `HEAD` in the work tree's git directory, which is quite
useful for other callers. Extract the logic into its own function to
make it reusable by others.

committed Apr 05, 2017

987f5659 Browse Files

repository: set error message if trying to set HEAD to a checked out one · 8242cc1a

If trying to set the HEAD of a repository to another reference, we have
to check whether this reference is already checked out in another linked
work tree. If it is, we will refuse setting the HEAD and return an
error, but do not set a meaningful error message. Add one.

committed Apr 05, 2017

8242cc1a Browse Files

refs: implement function to read references from file · 5b65ac25

Currently, we only provide functions to read references directly from a
repository's reference store via e.g. `git_reference_lookup`. But in
some cases, we may want to read files not connected to the current
repository, e.g. when looking up HEAD of connected work trees. This
commit implements `git_reference__read_head`, which will read out and
allocate a reference at an arbitrary path.

committed Apr 05, 2017

5b65ac25 Browse Files