Commits · 0e0472686680102ebfc81e85089273990fb86757 · lvzhengyang / git2

11 May, 2021 2 commits
- buf: bom enum is in the buf namespace · 14f6950b
```
Instead of a `git_bom_t` that a `git_buf` function returns, let's keep
it `git_buf_bom_t`.
```
  Edward Thomson committed 3 years ago
  14f6950b Browse Directory
- buf: remove internal `git_buf_text` namespace · d525e063
```
The `git_buf_text` namespace is unnecessary and strange.  Remove it,
just keep the functions prefixed with `git_buf`.
```
  Edward Thomson committed 3 years ago
  d525e063 Browse Directory
11 Apr, 2021 1 commit
- tests: fix variable name in list.c · 52447c5c
  Tobias Nießen committed 3 years ago
  
  52447c5c Browse Directory
05 Jun, 2020 1 commit
- clar: include the function name · cad7a1ba
  Edward Thomson committed 4 years ago
  
  cad7a1ba Browse Directory
01 Jun, 2020 1 commit

strarray: we should `dispose` instead of `free` · 51eff5a5

We _dispose_ the contents of objects; we _free_ objects (and their
contents).  Update `git_strarray_free` to be `git_strarray_dispose`.
`git_strarray_free` remains as a deprecated proxy function.

committed 4 years ago

51eff5a5 Browse Directory

18 Feb, 2020 1 commit

tests: object: decrease number of concurrent cache accesses · eaa70c6c

In our test case object::cache::fast_thread_rush, we're creating 100
concurrent threads opening a repository and reading objects from it.
This test actually fails on ARM32 with an out-of-memory error, which
isn't entirely unexpected.

Work around the issue by halving the number of threads.

committed 4 years ago

eaa70c6c Browse Directory

10 Dec, 2019 2 commits
- tree: ensure we protect NTFS paths everywhere · ba4c769b
  Edward Thomson committed 5 years ago
  
  ba4c769b Browse Directory
- test: ensure treebuilder validate new protection rules · f3b28604
```
Ensure that the new protection around .git::$INDEX_ALLOCATION rules are
enabled for using the treebuilder when core.protectNTFS is set.
```
  Edward Thomson committed 5 years ago
  f3b28604 Browse Directory
25 Nov, 2019 1 commit
- internal: use off64_t instead of git_off_t · 6460e8ab
```
Prefer `off64_t` internally.
```
  Edward Thomson committed 5 years ago
  6460e8ab Browse Directory
20 Jul, 2019 1 commit

fileops: rename to "futils.h" to match function signatures · e54343a4

Our file utils functions all have a "futils" prefix, e.g.
`git_futils_touch`. One would thus naturally guess that their
definitions and implementation would live in files "futils.h" and
"futils.c", respectively, but in fact they live in "fileops.h".

Rename the files to match expectations.

committed 5 years ago

e54343a4 Browse Directory

24 Jun, 2019 1 commit
- object: use literal constant in bigfile test · 3edbc441
```
Don't calculate 4 GiB as that will produce a compiler warning on MSVC.
Just hardcode it.
```
  Edward Thomson committed 5 years ago
  3edbc441 Browse Directory
23 Jun, 2019 1 commit
- largefile tests: only write 2GB on 32-bit platforms · 8eb910b0
```
Don't try to feed 4 GB of data to APIs that only take a `size_t` on
32-bit platforms.
```
  Edward Thomson committed 5 years ago
  8eb910b0 Browse Directory
15 Jun, 2019 2 commits

index: rename `frombuffer` to `from_buffer` · 6574cd00

The majority of functions are named `from_something` (with an
underscore) instead of `fromsomething`.  Update the index functions for
consistency with the rest of the library.

committed 5 years ago

6574cd00 Browse Directory

blob: add underscore to `from` functions · 08f39208

The majority of functions are named `from_something` (with an
underscore) instead of `fromsomething`.  Update the blob functions for
consistency with the rest of the library.

committed 5 years ago

08f39208 Browse Directory

14 Jun, 2019 1 commit

tests: object: refactor largefile test to not use `p_fallocate` · 0c2d0d4b

The `p_fallocate` platform is currently in use in our tests,
only, but it proved to be quite burdensome to get it implemented
in a cross-platform way. The only "real" user is the test
object::tree::read::largefile, where it's used to allocate a
large file in the filesystem only to commit it to the repo and
read its object back again. We can simplify this quite a bit by
just using an in-memory buffer of 4GB. Sure, this cannot be used
on platforms with low resources. But creating 4GB files is not
any better, and we already skip the test if the environment
variable "GITTEST_INVASIVE_FS_SIZE" is not set. So we're arguably
not worse off than before.

committed 5 years ago

0c2d0d4b Browse Directory

07 Jun, 2019 1 commit

tests: object: consolidate cache tests · 1f47efc4

The object::cache test module has two tests that do nearly the
same thing: given a cache limit, load a certain set of objects
and verify if those objects have been cached or not.

Convert those tests to the new data-driven initializers to
demonstrate how these are to be used. Furthermore, add some
additional test data. This conversion is mainly done to show this
new facility.

committed 5 years ago

1f47efc4 Browse Directory

04 Apr, 2019 1 commit
- tests: test largefiles on win32 · fb7614c0
  Edward Thomson committed 5 years ago
  
  fb7614c0 Browse Directory
30 Jan, 2019 1 commit
- tests: test that largefiles can be read through the tree API · 4e3949b7
  Etienne Samson committed 5 years ago
  
  4e3949b7 Browse Directory
22 Jan, 2019 1 commit
- git_error: use new names in internal APIs and usage · f673e232
```
Move to the `git_error` name in the internal API for error-related
functions.
```
  Edward Thomson committed 5 years ago
  f673e232 Browse Directory
17 Jan, 2019 1 commit

object_type: GIT_OBJECT_BAD is now GIT_OBJECT_INVALID · cd350852

We use the term "invalid" to refer to bad or malformed data, eg
`GIT_REF_INVALID` and `GIT_EINVALIDSPEC`.  Since we're changing the
names of the `git_object_t`s in this release, update it to be
`GIT_OBJECT_INVALID` instead of `BAD`.

committed 6 years ago

cd350852 Browse Directory

01 Dec, 2018 2 commits
- object_type: use new enumeration names · 168fe39b
```
Use the new object_type enumeration names within the codebase.
```
  Edward Thomson committed 6 years ago
  168fe39b Browse Directory
- index: use new enum and structure names · 18e71e6d
```
Use the new-style index names throughout our own codebase.
```
  Edward Thomson committed 6 years ago
  18e71e6d Browse Directory
02 Nov, 2018 3 commits

tree: fix integer overflow when reading unreasonably large filemodes · 7fafec0e

The `parse_mode` option uses an open-coded octal number parser. The
parser is quite naive in that it simply parses until hitting a character
that is not in the accepted range of '0' - '7', completely ignoring the
fact that we can at most accept a 16 bit unsigned integer as filemode.
If the filemode is bigger than UINT16_MAX, it will thus overflow and
provide an invalid filemode for the object entry.

Fix the issue by using `git__strntol32` instead and doing a bounds
check. As this function already handles overflows, it neatly solves the
problem.

Note that previously, `parse_mode` was also skipping the character
immediately after the filemode. In proper trees, this should be a simple
space, but in fact the parser accepted any character and simply skipped
over it. As a consequence of using `git__strntol32`, we now need to an
explicit check for a trailing whitespace after having parsed the
filemode. Because of the newly introduced error message, the test
object::tree::parse::mode_doesnt_cause_oob_read needs adjustment to its
error message check, which in fact is a good thing as it demonstrates
that we now fail looking for the whitespace immediately following the
filemode.

Add a test that shows that we will fail to parse such invalid filemodes
now.

committed 6 years ago

7fafec0e Browse Directory

tree: fix mode parsing reading out-of-bounds · f647bbc8

When parsing a tree entry's mode, we will eagerly parse until we hit a
character that is not in the accepted set of octal digits '0' - '7'. If
the provided buffer is not a NUL terminated one, we may thus read
out-of-bounds.

Fix the issue by passing the buffer length to `parse_mode` and paying
attention to it. Note that this is not a vulnerability in our usual code
paths, as all object data read from the ODB is NUL terminated.

committed 6 years ago

f647bbc8 Browse Directory

tree: add various tests exercising the tree parser · d4ad658a

We currently don't have any tests that directly exercise the tree
parser. This is due to the fact that the parsers for raw object data has
only been recently introduce with commit ca4db5f4 (object: implement
function to parse raw data, 2017-10-13), and previous to that the setup
simply was too cumbersome as it always required going through the ODB.

Now that we have the infrastructure, add a suite of tests that directly
exercise the tree parser and various edge cases.

committed 6 years ago

d4ad658a Browse Directory

25 Oct, 2018 4 commits

commit: fix reading out of bounds when parsing encoding · 7655b2d8

The commit message encoding is currently being parsed by the
`git__prefixcmp` function. As this function does not accept a buffer
length, it will happily skip over a buffer's end if it is not `NUL`
terminated.

Fix the issue by using `git__prefixncmp` instead. Add a test that
verifies that we are unable to parse the encoding field if it's cut off
by the supplied buffer length.

committed 6 years ago

7655b2d8 Browse Directory

tests: add tests that exercise commit parsing · c2e3d8ef

We currently do not have any test suites dedicated to parsing commits
from their raw representations. Add one based on `git_object__from_raw`
to be able to test special cases more easily.

committed 6 years ago

c2e3d8ef Browse Directory

tag: fix out of bounds read when searching for tag message · ee11d47e

When parsing tags, we skip all unknown fields that appear before the tag
message. This skipping is done by using a plain `strstr(buffer, "\n\n")`
to search for the two newlines that separate tag fields from tag
message. As it is not possible to supply a buffer length to `strstr`,
this call may skip over the buffer's end and thus result in an out of
bounds read. As `strstr` may return a pointer that is out of bounds, the
following computation of `buffer_end - buffer` will overflow and result
in an allocation of an invalid length.

Fix the issue by using `git__memmem` instead. Add a test that verifies
parsing the tag fails not due to the allocation failure but due to the
tag having no message.

committed 6 years ago

ee11d47e Browse Directory

tests: add tests that exercise tag parsing · 4c738e56

While the tests in object::tag::read exercises reading and parsing valid
tags from the ODB, they barely try to verify that the parser fails in a
sane way when parsing invalid tags. Create a new test suite
object::tag::parse that directly exercise the parser by using
`git_object__from_raw` and add various tests for valid and invalid tags.

committed 6 years ago

4c738e56 Browse Directory

27 Jul, 2018 1 commit
- tree: rename from_tree to validate and clarify the tree in the test · f00db9ed
  Carlos Martín Nieto committed 6 years ago
  
  f00db9ed Browse Directory
18 Jul, 2018 1 commit

tree: accept null ids in existing trees when updating · 2dff7e28

When we add entries to a treebuilder we validate them. But we validate even
those that we're adding because they exist in the base tree. This disables
using the normal mechanisms on these trees, even to fix them.

Keep track of whether the entry we're appending comes from an existing tree and
bypass the name and id validation if it's from existing data.

committed 6 years ago

2dff7e28 Browse Directory

13 Jul, 2018 1 commit

treewide: remove use of C++ style comments · 9994cd3f

C++ style comment ("//") are not specified by the ISO C90 standard and
thus do not conform to it. While libgit2 aims to conform to C90, we did
not enforce it until now, which is why quite a lot of these
non-conforming comments have snuck into our codebase. Do a tree-wide
conversion of all C++ style comments to the supported C style comments
to allow us enforcing strict C90 compliance in a later commit.

committed 6 years ago

9994cd3f Browse Directory

10 Jun, 2018 1 commit
- Convert usage of `git_buf_free` to new `git_buf_dispose` · ecf4f33a
  Patrick Steinhardt committed 6 years ago
  
  ecf4f33a Browse Directory
28 Feb, 2018 1 commit

tree: initialize the id we use for testing submodule insertions · a554d588

Instead of laving it uninitialized and relying on luck for it to be non-zero,
let's give it a dummy hash so we make valgrind happy (in this case the hash
comes from `sha1sum </dev/null`.

committed 6 years ago

a554d588 Browse Directory

26 Jan, 2018 1 commit

tree: reject writing null-OID entries to a tree · c0487bde

In commit a96d3cc3f (cache-tree: reject entries with null sha1,
2017-04-21), the git.git project has changed its stance on null OIDs in
tree objects. Previously, null OIDs were accepted in tree entries to
help tools repair broken history. This resulted in some problems though
in that many code paths mistakenly passed null OIDs to be added to a
tree, which was not properly detected.

Align our own code base according to the upstream change and reject
writing tree entries early when the OID is all-zero.

committed 6 years ago

c0487bde Browse Directory

01 May, 2017 1 commit
- object validation: free some memleaks · 1dc89aab
  Edward Thomson committed 7 years ago
  
  1dc89aab Browse Directory
28 Apr, 2017 4 commits

odb: add option to turn off hash verification · 35079f50

Verifying hashsums of objects we are reading from the ODB may be costly
as we have to perform an additional hashsum calculation on the object.
Especially when reading large objects, the penalty can be as high as
35%, as can be seen when executing the equivalent of `git cat-file` with
and without verification enabled. To mitigate for this, we add a global
option for libgit2 which enables the developer to turn off the
verification, e.g. when he can be reasonably sure that the objects on
disk won't be corrupted.

committed 7 years ago

35079f50 Browse Directory

odb: verify object hashes · 28a0741f

The upstream git.git project verifies objects when looking them up from
disk. This avoids scenarios where objects have somehow become corrupt on
disk, e.g. due to hardware failures or bit flips. While our mantra is
usually to follow upstream behavior, we do not do so in this case, as we
never check hashes of objects we have just read from disk.

To fix this, we create a new error class `GIT_EMISMATCH` which denotes
that we have looked up an object with a hashsum mismatch. `odb_read_1`
will then, after having read the object from its backend, hash the
object and compare the resulting hash to the expected hash. If hashes do
not match, it will return an error.

This obviously introduces another computation of checksums and could
potentially impact performance. Note though that we usually perform I/O
operations directly before doing this computation, and as such the
actual overhead should be drowned out by I/O. Running our test suite
seems to confirm this guess. On a Linux system with best-of-five
timings, we had 21.592s with the check enabled and 21.590s with the
ckeck disabled. Note though that our test suite mostly contains very
small blobs only. It is expected that repositories with bigger blobs may
notice an increased hit by this check.

In addition to a new test, we also had to change the
odb::backend::nonrefreshing test suite, which now triggers a hashsum
mismatch when looking up the commit "deadbeef...". This is expected, as
the fake backend allocated inside of the test will return an empty
object for the OID "deadbeef...", which will obviously not hash back to
"deadbeef..." again. We can simply adjust the hash to equal the hash of
the empty object here to fix this test.

committed 7 years ago

28a0741f Browse Directory

tests: object: test looking up corrupted objects · d59dabe5

We currently have no tests which check whether we fail reading corrupted
objects. Add one which modifies contents of an object stored on disk and
then tries to read the object.

committed 7 years ago

d59dabe5 Browse Directory

tests: object: create sandbox · 86c03552

The object::lookup tests do use the "testrepo.git" repository in a
read-only way, so we do not set up the repository as a sandbox but
simply open it. But in a future commit, we will want to test looking up
objects which are corrupted in some way, which requires us to modify the
on-disk data. Doing this in a repository without creating the sandbox
will modify contents of our libgit2 repository, though.

Create the repository in a sandbox to avoid this.

committed 7 years ago

86c03552 Browse Directory