Commits · 2848923a2f5099b6d105b0b30212134d84377dee · lvzhengyang / git2

01 Dec, 2018 1 commit
- index: use new enum and structure names · 18e71e6d
```
Use the new-style index names throughout our own codebase.
```
  Edward Thomson committed 6 years ago
  18e71e6d Browse File
28 Nov, 2018 1 commit

khash: remove intricate knowledge of khash types · 852bc9f4

Instead of using the `khiter_t`, `git_strmap_iter` and `khint_t` types,
simply use `size_t` instead. This decouples code from the khash stuff
and makes it possible to move the khash includes into the implementation
files.

committed 6 years ago

852bc9f4 Browse File

14 Nov, 2018 1 commit

index: introduce git_index_iterator · c358bbc5

Provide a public git_index_iterator API that is backed by an index
snapshot.  This allows consumers to provide a stable iteration even
while manipulating the index during iteration.

committed 6 years ago

c358bbc5 Browse File

19 Oct, 2018 2 commits

index: fix adding index entries with conflicting files · 8b6e2895

When adding an index entry "a/b/c" while an index entry "a/b" already
exists, git will happily remove "a/b/c" and only add the new index
entry:

    $ git init test
    Initialized empty Git repository in /tmp/test.repo/test/.git/
    $ touch x
    $ git add x
    $ rm x
    $ mkdir x
    $ touch x/y
    $ git add x/y
    $ git status
    A x/y

The other way round, adding an index entry "a/b" with an entry "a/b/c"
already existing is equivalent, where git will remove "a/b/c" and add
"a/b".

In contrast, libgit2 will currently fail to add these properly and
instead complain about the entry appearing as both a file and a
directory. This is a programming error, though: our current code already
tries to detect and, in the case of `git_index_add`, to automatically
replace such index entries. Funnily enough, we already remove the
conflicting index entries, but instead of adding the new entry we then
bail out afterwards. This leaves callers with the worst of both worlds:
we both remove the old entry but fail to add the new one.

The root cause is weird semantics of the `has_file_name` and
`has_dir_name` functions. While these functions only sound like they are
responsible for detecting such conflicts, they will also already remove
them in case where its `ok_to_replace` parameter is set. But even if we
tell it to replace such entries, it will return an error code.

Fix the error by returning success in case where the entries have been
replaced. Fix an already existing test which tested for wrong behaviour.
Note that the test didn't notice that the resulting tree had no entries.
Thus it is fine to change existing behaviour here, as the previous
result could've let to silently loosing data. Also add a new test that
verifies behaviour in the reverse conflicting case.

committed 6 years ago

8b6e2895 Browse File

index: modernize error handling of `index_insert` · 923317db

The current error hanling of the function `index_insert` is currently
very fragile. Instead of erroring out in case an error has happened, it
will instead verify that no error has happened for each statement. This
makes adding new code to that function an adventurous task.

Improve the situation by converting the function to use our typical
`goto out` pattern.

committed 6 years ago

923317db Browse File

18 Oct, 2018 1 commit

index: avoid out-of-bounds read when reading reuc entry stage · 600ceadd

We use `git__strtol64` to parse file modes of the index entries, which
does not limit the parsed buffer length. As the index can be essentially
treated as "untrusted" in that the data stems from the file system, it
may be misformatted and may not contain terminating `NUL` bytes. This
may lead to out-of-bounds reads when trying to parse index entries with
such malformatted modes.

Fix the issue by using `git__strntol64` instead.

committed 6 years ago

600ceadd Browse File

11 Sep, 2018 1 commit

index: release the snapshot instead of freeing the index · c70713d6

Previously we would assert in index_free because the reader incrementation
would not be balanced. Release the snapshot normally, so the variable gets
decremented before the index is freed.

committed 6 years ago

c70713d6 Browse File

16 Aug, 2018 1 commit
- Fix leak in index.c · 581d5492
  abyss7 committed 6 years ago
  
  581d5492 Browse File
29 Jun, 2018 4 commits

settings: optional unsaved index safety · bfa1f022

Add the `GIT_OPT_ENABLE_UNSAVED_INDEX_SAFETY` option, which will cause
commands that reload the on-disk index to fail if the current
`git_index` has changed that have not been saved.  This will prevent
users from - for example - adding a file to the index then calling a
function like `git_checkout` and having that file be silently removed
from the index since it was re-read from disk.

Now calls that would re-read the index will fail if the index is
"dirty", meaning changes have been made to it but have not been written.
Users can either `git_index_read` to discard those changes explicitly,
or `git_index_write` to write them.

committed 6 years ago

bfa1f022 Browse File

index: return a unique error code on dirty index · 787768c2
```
When the index is dirty, return GIT_EINDEXDIRTY so that consumers can
identify the exact problem programatically.
```
Edward Thomson committed 6 years ago
787768c2 Browse File

index: commit the changes to the index properly · b242cdbf

Now that the index has a "dirty" state, where it has changes that have
not yet been committed or rolled back, our tests need to be adapted to
actually commit or rollback the changes instead of assuming that the
index can be operated on in its indeterminate state.

committed 6 years ago

b242cdbf Browse File

index: add a dirty bit reflecting unsaved changes · 7c56c49b

Teach the index when it is "dirty", and has unsaved changes. Consider
the index dirty whenever a caller has added or removed an entry from the
main index, REUC or NAME section, including when the index is completely
cleared. Similarly, consider the index _not_ dirty immediately after it
is written, or when it is read from the on-disk index.

This allows us to ensure that unsaved changes are not lost when we
automatically refresh the index.

committed 6 years ago

7c56c49b Browse File

10 Jun, 2018 1 commit
- Convert usage of `git_buf_free` to new `git_buf_dispose` · ecf4f33a
  Patrick Steinhardt committed 6 years ago
  
  ecf4f33a Browse File
01 Jun, 2018 1 commit

index: Fix alignment issues in write_disk_entry() · 93271f59

In order to avoid alignment issues on certain target architectures,
it is necessary to use memcpy() when modifying elements of a struct
inside a buffer returned by git_filebuf_reserve().

committed 6 years ago

93271f59 Browse File

23 May, 2018 2 commits

path: reject .gitmodules as a symlink · a7168b47

Any part of the library which asks the question can pass in the mode to have it
checked against `.gitmodules` being a symlink.

This is particularly relevant for adding entries to the index from the worktree
and for checking out files.

committed 6 years ago

a7168b47 Browse File

index: stat before creating the entry · 58ff913a

This is so we have it available for the path validity checking. In a later
commit we will start rejecting `.gitmodules` files as symlinks.

committed 6 years ago

58ff913a Browse File

10 Mar, 2018 3 commits

index: error out on unreasonable prefix-compressed path lengths · 3db1af1f

When computing the complete path length from the encoded
prefix-compressed path, we end up just allocating the complete path
without ever checking what the encoded path length actually is. This can
easily lead to a denial of service by just encoding an unreasonable long
path name inside of the index. Git already enforces a maximum path
length of 4096 bytes. As we also have that enforcement ready in some
places, just make sure that the resulting path is smaller than
GIT_PATH_MAX.

Reported-by: Krishna Ram Prakash R <krp@gtux.in>
Reported-by: Vivek Parikh <viv0411.parikh@gmail.com>

committed 6 years ago

3db1af1f Browse File

index: fix out-of-bounds read with invalid index entry prefix length · 3207ddb0

The index format in version 4 has prefix-compressed entries, where every
index entry can compress its path by using a path prefix of the previous
entry. Since implmenting support for this index format version in commit
5625d86b (index: support index v4, 2016-05-17), though, we do not
correctly verify that the prefix length that we want to reuse is
actually smaller or equal to the amount of characters than the length of
the previous index entry's path. This can lead to a an integer underflow
and subsequently to an out-of-bounds read.

Fix this by verifying that the prefix is actually smaller than the
previous entry's path length.

Reported-by: Krishna Ram Prakash R <krp@gtux.in>
Reported-by: Vivek Parikh <viv0411.parikh@gmail.com>

committed 6 years ago

3207ddb0 Browse File

index: convert `read_entry` to return entry size via an out-param · 58a6fe94

The function `read_entry` does not conform to our usual coding style of
returning stuff via the out parameter and to use the return value for
reporting errors. Due to most of our code conforming to that pattern, it
has become quite natural for us to actually return `-1` in case there is
any error, which has also slipped in with commit 5625d86b (index:
support index v4, 2016-05-17). As the function returns an `size_t` only,
though, the return value is wrapped around, causing the caller of
`read_tree` to continue with an invalid index entry. Ultimately, this
can lead to a double-free.

Improve code and fix the bug by converting the function to return the
index entry size via an out parameter and only using the return value to
indicate errors.

Reported-by: Krishna Ram Prakash R <krp@gtux.in>
Reported-by: Vivek Parikh <viv0411.parikh@gmail.com>

committed 6 years ago

58a6fe94 Browse File

18 Feb, 2018 1 commit

git_index_add_frombuffer: only accept files/links · 5f774dbf

Ensure that the buffer given to `git_index_add_frombuffer` represents a
regular blob, an executable blob, or a link. Explicitly reject commit
entries (submodules) - it makes little sense to allow users to add a
submodule from a string; there's no possible path to success.

committed 7 years ago

5f774dbf Browse File

16 Feb, 2018 1 commit

index: shut up warning on uninitialized variable · 7c6e9175

Even though the `entry` variable will always be initialized when
`read_entry` returns success and even though we never dereference
`entry` in case `read_entry` fails, GCC prints a warning about
uninitialized use. Just initialize the pointer to `NULL` in order to
shut GCC up.

committed 7 years ago

7c6e9175 Browse File

03 Jul, 2017 1 commit

Make sure to always include "common.h" first · 0c7f49dd

Next to including several files, our "common.h" header also declares
various macros which are then used throughout the project. As such, we
have to make sure to always include this file first in all
implementation files. Otherwise, we might encounter problems or even
silent behavioural differences due to macros or defines not being
defined as they should be. So in fact, our header and implementation
files should make sure to always include "common.h" first.

This commit does so by establishing a common include pattern. Header
files inside of "src" will now always include "common.h" as its first
other file, separated by a newline from all the other includes to make
it stand out as special. There are two cases for the implementation
files. If they do have a matching header file, they will always include
this one first, leading to "common.h" being transitively included as
first file. If they do not have a matching header file, they instead
include "common.h" as first file themselves.

This fixes the outlined problems and will become our standard practice
for header and source files inside of the "src/" from now on.

committed 7 years ago

0c7f49dd Browse File

06 Jun, 2017 9 commits

index: verify we have enough space left when writing index entries · 064a60e9

In our code writing index entries, we carry around a `disk_size`
representing how much memory we have in total and pass this value to
`git_encode_varint` to do bounds checks. This does not make much sense,
as at the time when passing on this variable it is already out of date.
Fix this by subtracting used memory from `disk_size` as we go along.
Furthermore, assert we've actually got enough space left to do the final
path memcpy.

committed 7 years ago

064a60e9 Browse File

index: fix shared prefix computation when writing index entry · c71dff7e

When using compressed index entries, each entry's path is preceded by a
varint encoding how long the shared prefix with the previous index entry
actually is. We currently encode a length of `(path_len - same_len)`,
which is doubly wrong. First, `path_len` is already set to `path_len -
same_len` previously. Second, we want to encode the shared prefix rather
than the un-shared suffix length.

Fix this by using `same_len` as the varint value instead.

committed 7 years ago

c71dff7e Browse File

index: also sanity check entry size with compressed entries · 83e0392c

We have a check in place whether the index has enough data left for the
required footer after reading an index entry, but this was only used for
uncompressed entries. Move the check down a bit so that it is executed
for both compressed and uncompressed index entries.

committed 7 years ago

83e0392c Browse File

index: remove file-scope entry size macros · 350d2c47

All index entry size computations are now performed in
`index_entry_size`. As such, we do not need the file-scope macros for
computing these sizes anymore. Remove them and move the `entry_size`
macro into the `index_entry_size` function.

committed 7 years ago

350d2c47 Browse File

index: don't right-pad paths when writing compressed entries · 46b67034

Our code to write index entries to disk does not check whether the
entry that is to be written should use prefix compression for the path.
As such, we were overallocating memory and added bogus right-padding
into the resulting index entries. As there is no padding allowed in the
index version 4 format, this should actually result in an invalid index.

Fix this by re-using the newly extracted `index_entry_size` function.

committed 7 years ago

46b67034 Browse File

index: move index entry size computation into its own function · 29f498e0

Create a new function `index_entry_size` which encapsulates the logic to
calculate how much space is needed for an index entry, whether it is
simple/extended or compressed/uncompressed. This can later be re-used by
our code writing index entries.

committed 7 years ago

29f498e0 Browse File

index: set last written index entry in foreach-entry-loop · 8ceb890b

The last written disk entry is currently being written inside of the
function `write_disk_entry`. Make behavior a bit more obviously by
instead setting it inside of `write_entries` while iterating all
entries.

committed 7 years ago

8ceb890b Browse File

index: set last entry when reading compressed entries · 11d0be23

To calculate the path of a compressed index entry, we need to know the
preceding entry's path. While we do actually set the first predecessor
correctly to "", we fail to update this while reading the entries.

Fix the issue by updating `last` inside of the loop. Previously, we've
been passing a double-pointer to `read_entry`, which it didn't update.
As it is more obvious to update the pointer inside the loop itself,
though, we can simply convert it to a normal pointer.

committed 7 years ago

11d0be23 Browse File

index: fix confusion with shared prefix in compressed path names · febe8c14

The index version 4 introduced compressed path names for the entries.
From the git.git index-format documentation:

At the beginning of an entry, an integer N in the variable width
encoding [...] is stored, followed by a NUL-terminated string S.
Removing N bytes from the end of the path name for the previous
entry, and replacing it with the string S yields the path name for
this entry.

But instead of stripping N bytes from the previous path's string and
using the remaining prefix, we were instead simply concatenating the
previous path with the current entry path, which is obviously wrong.

Fix the issue by correctly copying the first N bytes of the previous
entry only and concatenating the result with our current entry's path.

committed 7 years ago

febe8c14 Browse File

17 Feb, 2017 3 commits
- idxmap: remove GIT__USE_IDXMAP · 8f1ff26b
  Patrick Steinhardt committed 8 years ago
  
  8f1ff26b Browse File
- khash: avoid using `kh_resize` directly · f14f75d4
  Patrick Steinhardt committed 8 years ago
  
  f14f75d4 Browse File
- khash: avoid using macro magic to get return address · 73028af8
  Patrick Steinhardt committed 8 years ago
  
  73028af8 Browse File
29 Dec, 2016 1 commit

giterr_set: consistent error messages · 909d5494

Error messages should be sentence fragments, and therefore:

1. Should not begin with a capital letter,
2. Should not conclude with punctuation, and
3. Should not end a sentence and begin a new one

committed 8 years ago

909d5494 Browse File

16 Nov, 2016 1 commit

use `giterr_set_str()` wherever possible · 65b78ea3

`giterr_set()` is used when it is required to format a string, and since
we don't really require it for this case, it is better to stick to
`giterr_set_str()`.

This also suppresses a warning(-Wformat-security) raised by the compiler.

Signed-off-by: Pranit Bauva <pranit.bauva@gmail.com>

committed 8 years ago

65b78ea3 Browse File

10 Aug, 2016 1 commit

index: support index v4 · 5625d86b

Support reading and writing index v4. Index v4 uses a very simple
compression scheme for pathnames, but is otherwise similar to index v3.

Signed-off-by: David Turner <dturner@twitter.com>

committed 8 years ago

5625d86b Browse File

24 Jul, 2016 1 commit
- index: cast to avoid warning · 4aaae935
  Edward Thomson committed 8 years ago
  
  4aaae935 Browse File
29 Jun, 2016 2 commits

index: include conflicts in `git_index_read_index` · 6249d960

Ensure that we include conflicts when calling `git_index_read_index`,
which will remove conflicts in the index that do not exist in the new
target, and will add conflicts from the new target.

committed 8 years ago

6249d960 Browse File

index: refactor common `read_index` functionality · 6f7ec728

Most of `git_index_read_index` is common to reading any iterator.
Refactor it out in case we want to implement `read_tree` in terms of it
in the future.

committed 8 years ago

6f7ec728 Browse File