Commits · b7dcea0492a424e04d5441a91919701b89b5a572 · lvzhengyang / git2

05 Nov, 2019 6 commits

config_entries: micro-optimize storage of multivars · b7dcea04

Multivars are configuration entries that have many values for the same
name; we can thus micro-optimize this case by just retaining the name of
the first configuration entry and freeing all the others, letting them
point to the string of the first entry.

The attached test case is an extreme example that demonstrates this. It
contains a section name that is approximately 500kB in size with 20.000
entries "a=b". Without the optimization, this would require at least
20000*500kB bytes, which is around 10GB. With this patch, it only
requires 500kB+20000*1B=20500kB.

The obvious culprit here is the section header, which we repeatedly
include in each of the configuration entry's names. This makes it very
easier for an adversary to provide a small configuration file that
disproportionally blows up in memory during processing and is thus a
feasible way for a denial-of-service attack. Unfortunately, we cannot
fix the root cause by e.g. having a separate "section" field that may
easily be deduplicated due to the `git_config_entry` structure being
part of our public API. So this micro-optimization is the best we can do
for now.

committed Nov 05, 2019

b7dcea04 Browse Files

config_entries: only keep track of a single entry list · 62320860

Whenever adding a configuration entry to the config entries structure,
we allocate two list heads:

    - The first list head is added to the global list of config entries
      in order to be able to iterate over configuration entries in the
      order they were originally added.

    - The second list head is added to the map of entries in order to
      efficiently look up an entry by its name. If no entry with the
      same name exists in the map, then we add the new entry to the map
      directly. Otherwise, we append the new entry's list head to the
      pre-existing entry's list in order to keep track of multivars.

While the former usecase is perfectly sound, the second usecase can be
optimized. The only reason why we keep track of multivar entries in
another separate list is to be able to determine whether an entry is
unique or not by seeing whether its `next` pointer is set. So we keep
track of a complete list of multivar entries just to have a single bit
of information of whether it has other multivar entries with the same
entry name.

We can completely get rid of this secondary list by just adding a
`first` field to the list structure itself. When executing
`git_config_entries_append`, we will then simply check whether the
configuration map already has an entry with the same name -- if so, we
will set the `first` to zero to indicate that it is not the initial
entry anymore. Instead of a second list head in the map, we can thus now
directly store the list head of the first global list inside of the map
and just refer to that bit.

Note that the more obvious solution would be to store a `unique` field
instead of a `first` field. But as we will only ever inspect the `first`
field of the _last_ entry that has been moved into the map, these are
semantically equivalent in that case.

Having a `first` field also allows for a minor optimization: for
multivar values, we can free the `name` field of all entries that are
_not_ first and have them point to the name of the first entry instead.

committed Nov 05, 2019

62320860 Browse Files

config_entries: mark local functions as static · 8a88701e

Some functions which are only used in "config_entries.c" are not marked
as static, which is being fixed by this very commit.

committed Nov 05, 2019

8a88701e Browse Files

Merge pull request #5293 from csware/config_snapshot-snapshot · 82d7a114
```
Fix crash if snapshotting a config_snapshot
```
Patrick Steinhardt committed Nov 05, 2019
82d7a114 Browse Files
Merge pull request #5295 from romkatv/fix-diff-res · 45c8d3f4
```
fix a bug introduced in 8a23597b
```
Patrick Steinhardt committed Nov 05, 2019
45c8d3f4 Browse Files
fix a bug introduced in 8a23597b · 1886478d
romkatv committed Nov 05, 2019

1886478d Browse Files

02 Nov, 2019 1 commit
- Merge pull request #5275 from pks-t/pks/reflogs-with-newlines · bf2911d7
```
reflogs: fix behaviour around reflogs with newlines
```
  Edward Thomson committed Nov 02, 2019
  bf2911d7 Browse Files
01 Nov, 2019 2 commits
- Fix crash if snapshotting a config_snapshot · dadbb33b
```
Signed-off-by: Sven Strickroth <email@cs-ware.de>
```
  Sven Strickroth committed Nov 01, 2019
  dadbb33b Browse Files
- Merge pull request #5289 from libgit2/cmn/create-with-signature-verification · d5017a14
```
commit: verify objects exist in git_commit_with_signature
```
  Edward Thomson committed Nov 01, 2019
  d5017a14 Browse Files
30 Oct, 2019 2 commits

commit: verify objects exist in git_commit_with_signature · 718f24ad

There can be a significant difference between the system where we created the
buffer (if at all) and when the caller provides us with the contents of a
commit.

Verify that the commit we are being asked to create references objects which do
exist in the target repository.

committed Oct 30, 2019

718f24ad Browse Files

commit: add failing tests for object checking for git_commit_with_signature · 0974e02f

There can be a significant difference between the system where we created the
buffer (if at all) and when the caller provides us with the contents of a
commit.

Provide some test cases (we have to adapt the existing ones because they refer
to trees and commits which do not exist).

committed Oct 30, 2019

0974e02f Browse Files

29 Oct, 2019 1 commit
- Merge pull request #5276 from pks-t/pks/patch-fuzzing-fixes · 2a7d6de3
```
patch_parse: fixes for fuzzing errors
```
  Patrick Steinhardt committed Oct 29, 2019
  2a7d6de3 Browse Files
24 Oct, 2019 2 commits
- Merge pull request #5227 from ddevault/check · a31f4c4b
```
apply: add GIT_APPLY_CHECK
```
  Patrick Steinhardt committed Oct 24, 2019
  a31f4c4b Browse Files
- Merge pull request #5264 from henkesn/refs-unlock-on-commit · c405f231
```
refs: unlock unmodified refs on transaction commit
```
  Patrick Steinhardt committed Oct 24, 2019
  c405f231 Browse Files
22 Oct, 2019 1 commit

apply: add GIT_APPLY_CHECK · 02af1fcb

This adds an option which will check if a diff is applicable without
actually applying it; equivalent to git apply --check.

committed Oct 22, 2019

02af1fcb Browse Files

21 Oct, 2019 1 commit

patch_parse: detect overflow when calculating old/new line position · 37141ff7

When the patch contains lines close to INT_MAX, then it may happen that
we end up with an integer overflow when calculating the line of the
current diff hunk. Reject such patches as unreasonable to avoid the
integer overflow.

As the calculation is performed on integers, we introduce two new
helpers `git__add_int_overflow` and `git__sub_int_overflow` that perform
the integer overflow check in a generic way.

committed Oct 21, 2019

37141ff7 Browse Files

19 Oct, 2019 3 commits

patch_parse: fix out-of-bounds read with No-NL lines · 468e3ddc

We've got two locations where we copy lines into the patch. The first
one is when copying normal " ", "-" or "+" lines, while the second
location gets executed when we copy "\ No newline at end of file" lines.
While the first one correctly uses `git__strndup` to copy only until the
newline, the other one doesn't. Thus, if the line occurs at the end of
the patch and if there is no terminating NUL character, then it may
result in an out-of-bounds read.

Fix the issue by using `git__strndup`, as was already done in the other
location. Furthermore, add allocation checks to both locations to detect
out-of-memory situations.

committed Oct 19, 2019

468e3ddc Browse Files

patch_parse: reject empty path names · 6c6c15e9

When parsing patch headers, we currently accept empty path names just
fine, e.g. a line "--- \n" would be parsed as the empty filename. This
is not a valid patch format and may cause `NULL` pointer accesses at a
later place as `git_buf_detach` will return `NULL` in that case.

Reject such patches as malformed with a nice error message.

committed Oct 19, 2019

6c6c15e9 Browse Files

patch_parse: reject patches with multiple old/new paths · 223e7e43

It's currently possible to have patches with multiple old path name
headers. As we didn't check for this case, this resulted in a memory
leak when overwriting the old old path with the new old path because we
simply discarded the old pointer.

Instead of fixing this by free'ing the old pointer, we should reject
such patches altogether. It doesn't make any sense for the "---" or
"+++" markers to occur multiple times within a patch n the first place.
This also implicitly fixes the memory leak.

committed Oct 19, 2019

223e7e43 Browse Files

18 Oct, 2019 6 commits

Merge pull request #5269 from durin42/fuzzpatch · b246bed5
```
fuzzers: add a new fuzzer for patch parsing
```
Patrick Steinhardt committed Oct 18, 2019
b246bed5 Browse Files

refdb_fs: properly parse corrupted reflogs · 7968e90f

In previous versions, libgit2 could be coerced into writing reflog
messages with embedded newlines into the reflog by using
`git_stash_save` with a message containing newlines. While the root
cause is fixed now, it was noticed that upstream git is in fact able to
read such corrupted reflog messages just fine.

Make the reflog parser more lenient in order to just skip over
malformatted reflog lines to bring us in line with git. This requires us
to change an existing test that verified that we do indeed _fail_ to
parse such logs.

committed Oct 18, 2019

7968e90f Browse Files

refdb_fs: convert reflog parsing to use parser · 8532ed11

The refdb_fs code to parse the reflog currently uses a hand-rolled
parser. Convert it to use our `git_parse_ctx` structure instead.

committed Oct 18, 2019

8532ed11 Browse Files

reflog: allow adding entries with newlines in their message · d8233feb

Currently, the reflog disallows any entries that have a message with
newlines, as that would effectively break the reflog format, which may
contain a single line per entry, only. Upstream git behaves a bit
differently, though, especially when considering stashes: instead of
rejecting any reflog entry with newlines, git will simply replace
newlines with spaces. E.g. executing 'git stash push -m "foo\nbar"' will
create a reflog entry with "foo bar" as entry message.

This commit adjusts our own logic to stop rejecting commit messages with
newlines. Previously, this logic was part of `git_reflog_append`, only.
There is a second place though where we add reflog entries, which is the
serialization code in the filesystem refdb. As it didn't contain any
sanity checks whatsoever, the refdb would have been perfectly happy to
write malformatted reflog entries to the disk. This is being fixed with
the same logic as for the reflog itself.

committed Oct 18, 2019

d8233feb Browse Files

stash: refactor code that prepares commit messages · 28481609
Patrick Steinhardt committed Oct 18, 2019

28481609 Browse Files

stash: modernize code style of `git_stash_save` · ca2d34a8

The code style of `git_stash_save` doesn't really match our current
coding style. Update it to match our current policies more closely.

committed Oct 18, 2019

ca2d34a8 Browse Files

17 Oct, 2019 9 commits

fuzzers: add a new fuzzer for patch parsing · 92e011a7

I was looking at this code anyway because the sr.ht people nerdsniped
me, and it gave me that "I should fuzz this" feeling. So have a fuzzer!

committed Oct 17, 2019

92e011a7 Browse Files

Merge pull request #5273 from dlax/parse-diff-without-extended-headers · c9464bf7
```
patch_parse: handle patches without extended headers
```
Patrick Steinhardt committed Oct 17, 2019
c9464bf7 Browse Files
Merge pull request #4637 from tiennou/feature/submodule-easy-clone · 4c0dea5a
```
Provide a wrapper for simple submodule clone steps
```
Patrick Steinhardt committed Oct 17, 2019
4c0dea5a Browse Files

tests: submodule: test cloning edge cases · 73e9535d

Add two more tests that verify our behaviour in some edge cases, notably
when cloning into a non-empty directory and when cloning the same
submodule twice.

committed Oct 17, 2019

73e9535d Browse Files

tests: submodule: make use of sandboxes to clean repos · de412fc2

The test submodule::add::submodule_clone doesn't use a sandbox, and thus
the created repo will not get deleted after the test has finished.
Convert the test to use the empty standard repo sandbox instead to fix
this.

committed Oct 17, 2019

de412fc2 Browse Files

tests: submodule: fix tests for cloning submodules · 09b1ac11

The test submodule::add::homemade_clone unfortunately doesn't test
what's expected, but does instead clone the submodule to a directory
that is outside of the parent repository. Fixing this by cloning to the
correct location isn't possible, though, as `git_submodule_add_setup`
will have pre-created a ".git" file already, which will cause
`git_clone` to error out.

As it's not possible to perform the clone without fiddling around with
the repo's layout, let's just remove this test as that is in fact what
the new `git_submodule_clone` function is for.

committed Oct 17, 2019

09b1ac11 Browse Files

refs: unlock unmodified refs on transaction commit · 47531f47

Refs which are locked in a transaction without an altered target,
still should to be unlocked on `git_transaction_commit`.
`git_transaction_free` also unlocks refs but the moment of calling of `git_transaction_free`
cannot be controlled in all situations.
Some binding libs call `git_transaction_free` on garbage collection or not at all if the
application exits before and don't provide public access to `git_transaction_free`.
It is better to release locks as soon as possible.

committed Oct 17, 2019

47531f47 Browse Files

submodule: provide a wrapper for simple submodule clone steps · 3c5d78bd
Etienne Samson committed Oct 17, 2019

3c5d78bd Browse Files
Merge pull request #5238 from tiennou/feature/macos-gss · 0298e0a6
```
macOS GSS Support
```
Patrick Steinhardt committed Oct 17, 2019
0298e0a6 Browse Files

16 Oct, 2019 2 commits

patch_parse: handle patches without extended headers · 11de594f

Extended header lines (especially the "index <hash>..<hash> <mode>") are
not required by "git apply" so it import patches. So we allow the
from-file/to-file lines (--- a/file\n+++ b/file) to directly follow the
git diff header.

This fixes #5267.

committed Oct 16, 2019

11de594f Browse Files

Merge pull request #5265 from tiennou/fix/xcode-11 · 9333fc2a
```
cmake: correct the link stanza for CoreFoundation
```
Edward Thomson committed Oct 16, 2019
9333fc2a Browse Files

13 Oct, 2019 4 commits
- cmake: correct the link stanza for CoreFoundation · a088a1ff
```
LIBRARIES is the (absolute?) path to the library.
LDFLAGS is the full linker stanza to correctly link with this lib.

By passing LIBRARIES as LIBGIT_LIBS, the linker ends up with the
absolute path for the SDK'ed version of CoreFoundation (which doesn't
exist), instead of the familiar `-framework CoreFoundation`.
```
  Etienne Samson committed Oct 13, 2019
  a088a1ff Browse Files
- azure: enable GSS on macOS builds · 3e862514
  Etienne Samson committed Oct 13, 2019
  
  3e862514 Browse Files
- negotiate: use GSS.framework on macOS · dbc17a7e
  Etienne Samson committed Oct 13, 2019
  
  dbc17a7e Browse Files
- cmake: remove extra GIT_NTLM define · 0eecb660
  Etienne Samson committed Oct 13, 2019
  
  0eecb660 Browse Files