Commits · 1cc6cc990f514430068ccca747b990c27915caec · lvzhengyang / git2

28 Feb, 2018 1 commit

tree: initialize the id we use for testing submodule insertions · a554d588

Instead of laving it uninitialized and relying on luck for it to be non-zero,
let's give it a dummy hash so we make valgrind happy (in this case the hash
comes from `sha1sum </dev/null`.

committed 6 years ago

a554d588 Browse Directory

26 Jan, 2018 1 commit

tree: reject writing null-OID entries to a tree · c0487bde

In commit a96d3cc3f (cache-tree: reject entries with null sha1,
2017-04-21), the git.git project has changed its stance on null OIDs in
tree objects. Previously, null OIDs were accepted in tree entries to
help tools repair broken history. This resulted in some problems though
in that many code paths mistakenly passed null OIDs to be added to a
tree, which was not properly detected.

Align our own code base according to the upstream change and reject
writing tree entries early when the OID is all-zero.

committed 6 years ago

c0487bde Browse Directory

01 May, 2017 1 commit
- object validation: free some memleaks · 1dc89aab
  Edward Thomson committed 7 years ago
  
  1dc89aab Browse Directory
28 Apr, 2017 4 commits

odb: add option to turn off hash verification · 35079f50

Verifying hashsums of objects we are reading from the ODB may be costly
as we have to perform an additional hashsum calculation on the object.
Especially when reading large objects, the penalty can be as high as
35%, as can be seen when executing the equivalent of `git cat-file` with
and without verification enabled. To mitigate for this, we add a global
option for libgit2 which enables the developer to turn off the
verification, e.g. when he can be reasonably sure that the objects on
disk won't be corrupted.

committed 7 years ago

35079f50 Browse Directory

odb: verify object hashes · 28a0741f

The upstream git.git project verifies objects when looking them up from
disk. This avoids scenarios where objects have somehow become corrupt on
disk, e.g. due to hardware failures or bit flips. While our mantra is
usually to follow upstream behavior, we do not do so in this case, as we
never check hashes of objects we have just read from disk.

To fix this, we create a new error class `GIT_EMISMATCH` which denotes
that we have looked up an object with a hashsum mismatch. `odb_read_1`
will then, after having read the object from its backend, hash the
object and compare the resulting hash to the expected hash. If hashes do
not match, it will return an error.

This obviously introduces another computation of checksums and could
potentially impact performance. Note though that we usually perform I/O
operations directly before doing this computation, and as such the
actual overhead should be drowned out by I/O. Running our test suite
seems to confirm this guess. On a Linux system with best-of-five
timings, we had 21.592s with the check enabled and 21.590s with the
ckeck disabled. Note though that our test suite mostly contains very
small blobs only. It is expected that repositories with bigger blobs may
notice an increased hit by this check.

In addition to a new test, we also had to change the
odb::backend::nonrefreshing test suite, which now triggers a hashsum
mismatch when looking up the commit "deadbeef...". This is expected, as
the fake backend allocated inside of the test will return an empty
object for the OID "deadbeef...", which will obviously not hash back to
"deadbeef..." again. We can simply adjust the hash to equal the hash of
the empty object here to fix this test.

committed 7 years ago

28a0741f Browse Directory

tests: object: test looking up corrupted objects · d59dabe5

We currently have no tests which check whether we fail reading corrupted
objects. Add one which modifies contents of an object stored on disk and
then tries to read the object.

committed 7 years ago

d59dabe5 Browse Directory

tests: object: create sandbox · 86c03552

The object::lookup tests do use the "testrepo.git" repository in a
read-only way, so we do not set up the repository as a sandbox but
simply open it. But in a future commit, we will want to test looking up
objects which are corrupted in some way, which requires us to modify the
on-disk data. Doing this in a repository without creating the sandbox
will modify contents of our libgit2 repository, though.

Create the repository in a sandbox to avoid this.

committed 7 years ago

86c03552 Browse Directory

14 Nov, 2016 1 commit
- tree: add a failing test for unsorted input · 1d41b86c
```
We do not currently use the sorted version of this input in the
function, which means we produce bad results.
```
  Carlos Martín Nieto committed 8 years ago
  1d41b86c Browse Directory
09 Aug, 2016 1 commit
- tests: blob: remove unused callback function · 4006455f
  Patrick Steinhardt committed 8 years ago
  
  4006455f Browse Directory
20 Jun, 2016 1 commit
- threads: split up OS-dependent thread code · faebc1c6
  Patrick Steinhardt committed 8 years ago
  
  faebc1c6 Browse Directory
24 May, 2016 1 commit
- tree: handle removal of all entries in the updater · a2cb4713
```
When we remove all entries in a tree, we should remove that tree from
its parent rather than include the empty tree.
```
  Carlos Martín Nieto committed 8 years ago
  a2cb4713 Browse Directory
19 May, 2016 2 commits
- tree: plug leaks in the tree updater · 53412305
  Carlos Martín Nieto committed 8 years ago
  
  53412305 Browse Directory
- tree: use testrepo2 for the tree updater tests · 92249656
```
This gives us trees with subdirectories, which the new test needs.
```
  Carlos Martín Nieto committed 8 years ago
  92249656 Browse Directory
17 May, 2016 1 commit

Introduce a function to create a tree based on a different one · 9464f9eb

Instead of going through the usual steps of reading a tree recursively
into an index, modifying it and writing it back out as a tree, introduce
a function to perform simple updates more efficiently.

`git_tree_create_updated` avoids reading trees which are not modified
and supports upsert and delete operations. It is not as versatile as
modifying the index, but it makes some common operations much more
efficient.

committed 8 years ago

9464f9eb Browse Directory

25 Apr, 2016 1 commit

tag: ignore extra header fields · eb39284b

While no extra header fields are defined for tags, git accepts them by
ignoring them and continuing the search for the message. There are a few
tags like this in the wild which git parses just fine, so we should do
the same.

committed 8 years ago

eb39284b Browse Directory

22 Mar, 2016 3 commits

blob: remove _fromchunks() · 6669e3e8

The callback mechanism makes it awkward to write data from an IO
source; move to `_fromstream()` which lets the caller remain in control,
in the same vein as we prefer iterators over foreach callbacks.

committed 8 years ago

6669e3e8 Browse Directory

blob: fix fromchunks iteration counter · 35e68606

By returning when the count goes to zero rather than below it, setting
`howmany` to 7 in fact writes out the string 6 times.

Correct the termination condition to write out the string the amount of
times we specify.

committed 8 years ago

35e68606 Browse Directory

blob: introduce creating a blob by writing into a stream · 0a5c6028

The pair of `git_blob_create_frombuffer()` and
`git_blob_create_frombuffer_commit()` is meant to replace
`git_blob_create_fromchunks()` by providing a way for a user to write a
new blob when they want filtering or they do not know the size.

This approach allows the caller to retain control over when to add data
to this buffer and a more natural fit into higher-level language's own
stream abstractions instead of having to handle IO wait in the callback.

The in-memory buffer size of 2MB is chosen somewhat arbitrarily to be a
round multiple of usual page sizes and a value where most blobs seem
likely to be either going to be way below or way over that size. It's
also a round number of pages.

This implementation re-uses the helper we have from `_fromchunks()` so
we end up writing everything to disk, but hopefully more efficiently
than with a default filebuf. A later optimisation can be to avoid
writing the in-memory contents to disk, with some extra complexity.

committed 8 years ago

0a5c6028 Browse Directory

20 Mar, 2016 1 commit
- tree: re-use the id and filename in the odb object · 60a194aa
```
Instead of copying over the data into the individual entries, point to
the originals, which are already in a format we can use.
```
  Carlos Martín Nieto committed 8 years ago
  60a194aa Browse Directory
04 Mar, 2016 1 commit

treebuilder: don't try to verify submodules exist in the odb · ea5bf6bb

Submodules don't exist in the objectdb and the code is making us try to
look for a blob with its commit id, which is obviously not going to
work.

Skip the test if the user wants to insert a submodule.

committed 8 years ago

ea5bf6bb Browse Directory

28 Feb, 2016 3 commits
- turn on strict object validation by default · f2dddf52
  Edward Thomson committed 8 years ago
  
  f2dddf52 Browse Directory
- tests: use legitimate object ids · 4afe536b
```
Use legitimate (existing) object IDs in tests so that we have the
ability to turn on strict object validation when running tests.
```
  Edward Thomson committed 8 years ago
  4afe536b Browse Directory
- treebuilder: validate tree entries (optionally) · 2bbc7d3e
```
When `GIT_OPT_ENABLE_STRICT_OBJECT_CREATION` is turned on, validate
the tree and parent ids given to treebuilder insertion.
```
  Edward Thomson committed 8 years ago
  2bbc7d3e Browse Directory
28 May, 2015 1 commit
- conflict tests: use GIT_IDXENTRY_STAGE_SET · 2f1080ea
  Edward Thomson committed 9 years ago
  
  2f1080ea Browse Directory
04 Jan, 2015 1 commit
- Plug a couple of leaks · c4a2fd5c
  Carlos Martín Nieto committed 10 years ago
  
  c4a2fd5c Browse Directory
27 Dec, 2014 1 commit

treebuilder: rename _create() to _new() · 208a2c8a

This function is a constructor, so let's name it like one and leave
_create() for the reference functions, which do create/write the
reference.

committed 10 years ago

208a2c8a Browse Directory

17 Dec, 2014 1 commit

treebuilder: take a repository for path validation · dce7b1a4

Path validation may be influenced by `core.protectHFS` and
`core.protectNTFS` configuration settings, thus treebuilders
can take a repository to influence their configuration.

committed 10 years ago

dce7b1a4 Browse Directory

22 Nov, 2014 1 commit

peel: reject bad queries with EINVALIDSPEC · 753e17b0

There are some combination of objects and target types which we know
cannot be fulfilled. Return EINVALIDSPEC for those to signify that there
is a mismatch in the user-provided data and what the object model is
capable of satisfying.

If we start at a tag and in the course of peeling find out that we
cannot reach a particular type, we return EPEEL.

committed 10 years ago

753e17b0 Browse Directory

16 Sep, 2014 1 commit
- Factor 40 and 41 constants from source. · 3b2cb2c9
  Ciro Santilli committed 10 years ago
  
  3b2cb2c9 Browse Directory
18 Aug, 2014 1 commit

oid: Export `git_oid_tostr_s` instead of `_allocfmt` · 4ca0b566

The old `allocfmt` is of no use to callers, as they are not able to free
the returned buffer. Export a new API that returns a static string that
doesn't need to be freed.

committed 10 years ago

4ca0b566 Browse Directory

01 Jul, 2014 1 commit
- Introduce cl_assert_equal_oid · 0cee70eb
  Edward Thomson committed 10 years ago
  
  0cee70eb Browse Directory
10 Jun, 2014 1 commit

treebuilder: use a map instead of vector to store the entries · 4d3f1f97

Finding a filename in a vector means we need to resort it every time we
want to read from it, which includes every time we want to write to it
as well, as we want to find duplicate keys.

A hash-map fits what we want to do much more accurately, as we do not
care about sorting, but just the particular filename.

We still keep removed entries around, as the interface let you assume
they were going to be around until the treebuilder is cleared or freed,
but in this case that involves an append to a vector in the filter case,
which can now fail.

The only time we care about sorting is when we write out the tree, so
let's make that the only time we do any sorting.

committed 10 years ago

4d3f1f97 Browse Directory

07 Jun, 2014 1 commit
- Win32: Fix object::cache::threadmania test on x64 · fb591767
  Philip Kelley committed 10 years ago
  
  fb591767 Browse Directory
18 May, 2014 1 commit

message: don't assume the comment char · 49e369b2

The comment char is configurable and we need to provide a way for the
user to specify which comment char they chose for their message.

committed 10 years ago

49e369b2 Browse Directory

08 May, 2014 1 commit

Be more careful with user-supplied buffers · 1e4976cb

This adds in missing calls to `git_buf_sanitize` and fixes a
number of places where `git_buf` APIs could inadvertently write
NUL terminator bytes into invalid buffers.  This also changes the
behavior of `git_buf_sanitize` to NUL terminate a buffer if it can
and of `git_buf_shorten` to do nothing if it can.

Adds tests of filtering code with zeroed (i.e. unsanitized) buffer
which was previously triggering a segfault.

committed 10 years ago

1e4976cb Browse Directory

06 May, 2014 1 commit

Add filter options and ALLOW_UNSAFE · 5269008c

Diff and status do not want core.safecrlf to actually raise an
error regardless of the setting, so this extends the filter API
with an additional options flags parameter and adds a flag so that
filters can be applied with GIT_FILTER_OPT_ALLOW_UNSAFE, indicating
that unsafe filter application should be downgraded from a failure
to a warning.

committed 10 years ago

5269008c Browse Directory

29 Apr, 2014 1 commit

commit: safer commit creation with reference update · 217c029b

The current version of the commit creation and amend function are unsafe
to use when passing the update_ref parameter, as they do not check that
the reference at the moment of update points to what the user expects.

Make sure that we're moving history forward when we ask the library to
update the reference for us by checking that the first parent of the new
commit is the current value of the reference. We also make sure that the
ref we're updating hasn't moved between the read and the write.

Similarly, when amending a commit, make sure that the current tip of the
branch is the commit we're amending.

committed 10 years ago

217c029b Browse Directory

10 Mar, 2014 1 commit
- Add failing test for git_object_short_id · eb46fb2b
  Jiri Pospisil committed 10 years ago
  
  eb46fb2b Browse Directory
05 Mar, 2014 1 commit

Add git_object_short_id API to get short id string · 13f7ecd7

This finds a short id string that will unambiguously select the
given object, starting with the core.abbrev length (usually 7)
and growing until it is no longer ambiguous.

committed 10 years ago

13f7ecd7 Browse Directory

08 Feb, 2014 1 commit

Add git_commit_amend API · 80c29fe9

This adds an API to amend an existing commit, basically a shorthand
for creating a new commit filling in missing parameters from the
values of an existing commit. As part of this, I also added a new
"sys" API to create a commit using a callback to get the parents.
This allowed me to rewrite all the other commit creation APIs so
that temporary allocations are no longer needed.

committed 10 years ago

80c29fe9 Browse Directory