Commits · f627ba6c7f4b40d533cc127f408cbce8353697ed · lvzhengyang / git2

18 Jul, 2019 1 commit
- configuration: cvar -> configmap · 658022c4
```
`cvar` is an unhelpful name.  Refactor its usage to `configmap` for more
clarity.
```
  Patrick Steinhardt committed 5 years ago
  658022c4 Browse File
24 Jun, 2019 1 commit
- index: safely cast file size · 7e49deba
  Edward Thomson committed 5 years ago
  
  7e49deba Browse File
15 Jun, 2019 2 commits

index: rename `frombuffer` to `from_buffer` · 6574cd00

The majority of functions are named `from_something` (with an
underscore) instead of `fromsomething`.  Update the index functions for
consistency with the rest of the library.

committed 5 years ago

6574cd00 Browse File

blob: add underscore to `from` functions · 08f39208

The majority of functions are named `from_something` (with an
underscore) instead of `fromsomething`.  Update the blob functions for
consistency with the rest of the library.

committed 5 years ago

08f39208 Browse File

15 Feb, 2019 4 commits

idxmap: have `resize` functions return proper error code · 8da93944

The currently existing function `git_idxmap_resize` and
`git_idxmap_icase_resize` do not return any error codes at all due to their
previous implementation making use of a macro. Due to that, it is impossible to
see whether the resize operation might have failed due to an out-of-memory
situation.

Fix this by providing a proper error code. Adjust callers to make use of it.

committed 5 years ago

8da93944 Browse File

idxmap: introduce high-level setter for key/value pairs · 661fc57b

Currently, one would use the function `git_idxmap_insert` to insert key/value
pairs into a map. This function has historically been a macro, which is why its
syntax is kind of weird: instead of returning an error code directly, it instead
has to be passed a pointer to where the return value shall be stored. This does
not match libgit2's common idiom of directly returning error codes.

Introduce a new function `git_idxmap_set`, which takes as parameters the map,
key and value and directly returns an error code. Convert all callers of
`git_idxmap_insert` to make use of it.

committed 5 years ago

661fc57b Browse File

idxmap: introduce high-level getter for values · d00c24a9

The current way of looking up an entry from a map is tightly coupled with the
map implementation, as one first has to look up the index of the key and then
retrieve the associated value by using the index. As a caller, you usually do
not care about any indices at all, though, so this is more complicated than
really necessary. Furthermore, it invites for errors to happen if the correct
error checking sequence is not being followed.

Introduce new high-level functions `git_idxmap_get` and `git_idxmap_icase_get`
that take a map and a key and return a pointer to the associated value if such a
key exists. Otherwise, a `NULL` pointer is returned. Adjust all callers that can
trivially be converted.

committed 5 years ago

d00c24a9 Browse File

maps: use uniform lifecycle management functions · 351eeff3

Currently, the lifecycle functions for maps (allocation, deallocation, resize)
are not named in a uniform way and do not have a uniform function signature.
Rename the functions to fix that, and stick to libgit2's naming scheme of saying
`git_foo_new`. This results in the following new interface for allocation:

- `int git_<t>map_new(git_<t>map **out)` to allocate a new map, returning an
  error code if we ran out of memory

- `void git_<t>map_free(git_<t>map *map)` to free a map

- `void git_<t>map_clear(git<t>map *map)` to remove all entries from a map

This commit also fixes all existing callers.

committed 5 years ago

351eeff3 Browse File

25 Jan, 2019 1 commit

index: explicitly cast down to a size_t · 494448a5

Quiet down a warning from MSVC about how we're potentially losing data.
This cast is safe since we've explicitly tested that `strip_len` <=
`last_len`.

committed 5 years ago

494448a5 Browse File

24 Jan, 2019 1 commit

index: preserve extension parsing errors · 0bf7e043

Previously, we would clobber any extension-specific error message with
an "extension is truncated" message. This makes `read_extension`
correctly preserve those errors, takes responsibility for truncation
errors, and adds a new message with the actual extension signature for
unsupported mandatory extensions.

committed 5 years ago

0bf7e043 Browse File

22 Jan, 2019 1 commit
- git_error: use new names in internal APIs and usage · f673e232
```
Move to the `git_error` name in the internal API for error-related
functions.
```
  Edward Thomson committed 6 years ago
  f673e232 Browse File
01 Dec, 2018 1 commit
- index: use new enum and structure names · 18e71e6d
```
Use the new-style index names throughout our own codebase.
```
  Edward Thomson committed 6 years ago
  18e71e6d Browse File
28 Nov, 2018 1 commit

khash: remove intricate knowledge of khash types · 852bc9f4

Instead of using the `khiter_t`, `git_strmap_iter` and `khint_t` types,
simply use `size_t` instead. This decouples code from the khash stuff
and makes it possible to move the khash includes into the implementation
files.

committed 6 years ago

852bc9f4 Browse File

14 Nov, 2018 1 commit

index: introduce git_index_iterator · c358bbc5

Provide a public git_index_iterator API that is backed by an index
snapshot.  This allows consumers to provide a stable iteration even
while manipulating the index during iteration.

committed 6 years ago

c358bbc5 Browse File

19 Oct, 2018 2 commits

index: fix adding index entries with conflicting files · 8b6e2895

When adding an index entry "a/b/c" while an index entry "a/b" already
exists, git will happily remove "a/b/c" and only add the new index
entry:

    $ git init test
    Initialized empty Git repository in /tmp/test.repo/test/.git/
    $ touch x
    $ git add x
    $ rm x
    $ mkdir x
    $ touch x/y
    $ git add x/y
    $ git status
    A x/y

The other way round, adding an index entry "a/b" with an entry "a/b/c"
already existing is equivalent, where git will remove "a/b/c" and add
"a/b".

In contrast, libgit2 will currently fail to add these properly and
instead complain about the entry appearing as both a file and a
directory. This is a programming error, though: our current code already
tries to detect and, in the case of `git_index_add`, to automatically
replace such index entries. Funnily enough, we already remove the
conflicting index entries, but instead of adding the new entry we then
bail out afterwards. This leaves callers with the worst of both worlds:
we both remove the old entry but fail to add the new one.

The root cause is weird semantics of the `has_file_name` and
`has_dir_name` functions. While these functions only sound like they are
responsible for detecting such conflicts, they will also already remove
them in case where its `ok_to_replace` parameter is set. But even if we
tell it to replace such entries, it will return an error code.

Fix the error by returning success in case where the entries have been
replaced. Fix an already existing test which tested for wrong behaviour.
Note that the test didn't notice that the resulting tree had no entries.
Thus it is fine to change existing behaviour here, as the previous
result could've let to silently loosing data. Also add a new test that
verifies behaviour in the reverse conflicting case.

committed 6 years ago

8b6e2895 Browse File

index: modernize error handling of `index_insert` · 923317db

The current error hanling of the function `index_insert` is currently
very fragile. Instead of erroring out in case an error has happened, it
will instead verify that no error has happened for each statement. This
makes adding new code to that function an adventurous task.

Improve the situation by converting the function to use our typical
`goto out` pattern.

committed 6 years ago

923317db Browse File

18 Oct, 2018 1 commit

index: avoid out-of-bounds read when reading reuc entry stage · 600ceadd

We use `git__strtol64` to parse file modes of the index entries, which
does not limit the parsed buffer length. As the index can be essentially
treated as "untrusted" in that the data stems from the file system, it
may be misformatted and may not contain terminating `NUL` bytes. This
may lead to out-of-bounds reads when trying to parse index entries with
such malformatted modes.

Fix the issue by using `git__strntol64` instead.

committed 6 years ago

600ceadd Browse File

11 Sep, 2018 1 commit

index: release the snapshot instead of freeing the index · c70713d6

Previously we would assert in index_free because the reader incrementation
would not be balanced. Release the snapshot normally, so the variable gets
decremented before the index is freed.

committed 6 years ago

c70713d6 Browse File

16 Aug, 2018 1 commit
- Fix leak in index.c · 581d5492
  abyss7 committed 6 years ago
  
  581d5492 Browse File
29 Jun, 2018 4 commits

settings: optional unsaved index safety · bfa1f022

Add the `GIT_OPT_ENABLE_UNSAVED_INDEX_SAFETY` option, which will cause
commands that reload the on-disk index to fail if the current
`git_index` has changed that have not been saved.  This will prevent
users from - for example - adding a file to the index then calling a
function like `git_checkout` and having that file be silently removed
from the index since it was re-read from disk.

Now calls that would re-read the index will fail if the index is
"dirty", meaning changes have been made to it but have not been written.
Users can either `git_index_read` to discard those changes explicitly,
or `git_index_write` to write them.

committed 6 years ago

bfa1f022 Browse File

index: return a unique error code on dirty index · 787768c2
```
When the index is dirty, return GIT_EINDEXDIRTY so that consumers can
identify the exact problem programatically.
```
Edward Thomson committed 6 years ago
787768c2 Browse File

index: commit the changes to the index properly · b242cdbf

Now that the index has a "dirty" state, where it has changes that have
not yet been committed or rolled back, our tests need to be adapted to
actually commit or rollback the changes instead of assuming that the
index can be operated on in its indeterminate state.

committed 6 years ago

b242cdbf Browse File

index: add a dirty bit reflecting unsaved changes · 7c56c49b

Teach the index when it is "dirty", and has unsaved changes. Consider
the index dirty whenever a caller has added or removed an entry from the
main index, REUC or NAME section, including when the index is completely
cleared. Similarly, consider the index _not_ dirty immediately after it
is written, or when it is read from the on-disk index.

This allows us to ensure that unsaved changes are not lost when we
automatically refresh the index.

committed 6 years ago

7c56c49b Browse File

10 Jun, 2018 1 commit
- Convert usage of `git_buf_free` to new `git_buf_dispose` · ecf4f33a
  Patrick Steinhardt committed 6 years ago
  
  ecf4f33a Browse File
01 Jun, 2018 1 commit

index: Fix alignment issues in write_disk_entry() · 93271f59

In order to avoid alignment issues on certain target architectures,
it is necessary to use memcpy() when modifying elements of a struct
inside a buffer returned by git_filebuf_reserve().

committed 6 years ago

93271f59 Browse File

23 May, 2018 2 commits

path: reject .gitmodules as a symlink · a7168b47

Any part of the library which asks the question can pass in the mode to have it
checked against `.gitmodules` being a symlink.

This is particularly relevant for adding entries to the index from the worktree
and for checking out files.

committed 6 years ago

a7168b47 Browse File

index: stat before creating the entry · 58ff913a

This is so we have it available for the path validity checking. In a later
commit we will start rejecting `.gitmodules` files as symlinks.

committed 6 years ago

58ff913a Browse File

10 Mar, 2018 3 commits

index: error out on unreasonable prefix-compressed path lengths · 3db1af1f

When computing the complete path length from the encoded
prefix-compressed path, we end up just allocating the complete path
without ever checking what the encoded path length actually is. This can
easily lead to a denial of service by just encoding an unreasonable long
path name inside of the index. Git already enforces a maximum path
length of 4096 bytes. As we also have that enforcement ready in some
places, just make sure that the resulting path is smaller than
GIT_PATH_MAX.

Reported-by: Krishna Ram Prakash R <krp@gtux.in>
Reported-by: Vivek Parikh <viv0411.parikh@gmail.com>

committed 6 years ago

3db1af1f Browse File

index: fix out-of-bounds read with invalid index entry prefix length · 3207ddb0

The index format in version 4 has prefix-compressed entries, where every
index entry can compress its path by using a path prefix of the previous
entry. Since implmenting support for this index format version in commit
5625d86b (index: support index v4, 2016-05-17), though, we do not
correctly verify that the prefix length that we want to reuse is
actually smaller or equal to the amount of characters than the length of
the previous index entry's path. This can lead to a an integer underflow
and subsequently to an out-of-bounds read.

Fix this by verifying that the prefix is actually smaller than the
previous entry's path length.

Reported-by: Krishna Ram Prakash R <krp@gtux.in>
Reported-by: Vivek Parikh <viv0411.parikh@gmail.com>

committed 6 years ago

3207ddb0 Browse File

index: convert `read_entry` to return entry size via an out-param · 58a6fe94

The function `read_entry` does not conform to our usual coding style of
returning stuff via the out parameter and to use the return value for
reporting errors. Due to most of our code conforming to that pattern, it
has become quite natural for us to actually return `-1` in case there is
any error, which has also slipped in with commit 5625d86b (index:
support index v4, 2016-05-17). As the function returns an `size_t` only,
though, the return value is wrapped around, causing the caller of
`read_tree` to continue with an invalid index entry. Ultimately, this
can lead to a double-free.

Improve code and fix the bug by converting the function to return the
index entry size via an out parameter and only using the return value to
indicate errors.

Reported-by: Krishna Ram Prakash R <krp@gtux.in>
Reported-by: Vivek Parikh <viv0411.parikh@gmail.com>

committed 6 years ago

58a6fe94 Browse File

18 Feb, 2018 1 commit

git_index_add_frombuffer: only accept files/links · 5f774dbf

Ensure that the buffer given to `git_index_add_frombuffer` represents a
regular blob, an executable blob, or a link. Explicitly reject commit
entries (submodules) - it makes little sense to allow users to add a
submodule from a string; there's no possible path to success.

committed 6 years ago

5f774dbf Browse File

16 Feb, 2018 1 commit

index: shut up warning on uninitialized variable · 7c6e9175

Even though the `entry` variable will always be initialized when
`read_entry` returns success and even though we never dereference
`entry` in case `read_entry` fails, GCC prints a warning about
uninitialized use. Just initialize the pointer to `NULL` in order to
shut GCC up.

committed 6 years ago

7c6e9175 Browse File

03 Jul, 2017 1 commit

Make sure to always include "common.h" first · 0c7f49dd

Next to including several files, our "common.h" header also declares
various macros which are then used throughout the project. As such, we
have to make sure to always include this file first in all
implementation files. Otherwise, we might encounter problems or even
silent behavioural differences due to macros or defines not being
defined as they should be. So in fact, our header and implementation
files should make sure to always include "common.h" first.

This commit does so by establishing a common include pattern. Header
files inside of "src" will now always include "common.h" as its first
other file, separated by a newline from all the other includes to make
it stand out as special. There are two cases for the implementation
files. If they do have a matching header file, they will always include
this one first, leading to "common.h" being transitively included as
first file. If they do not have a matching header file, they instead
include "common.h" as first file themselves.

This fixes the outlined problems and will become our standard practice
for header and source files inside of the "src/" from now on.

committed 7 years ago

0c7f49dd Browse File

06 Jun, 2017 7 commits

index: verify we have enough space left when writing index entries · 064a60e9

In our code writing index entries, we carry around a `disk_size`
representing how much memory we have in total and pass this value to
`git_encode_varint` to do bounds checks. This does not make much sense,
as at the time when passing on this variable it is already out of date.
Fix this by subtracting used memory from `disk_size` as we go along.
Furthermore, assert we've actually got enough space left to do the final
path memcpy.

committed 7 years ago

064a60e9 Browse File

index: fix shared prefix computation when writing index entry · c71dff7e

When using compressed index entries, each entry's path is preceded by a
varint encoding how long the shared prefix with the previous index entry
actually is. We currently encode a length of `(path_len - same_len)`,
which is doubly wrong. First, `path_len` is already set to `path_len -
same_len` previously. Second, we want to encode the shared prefix rather
than the un-shared suffix length.

Fix this by using `same_len` as the varint value instead.

committed 7 years ago

c71dff7e Browse File

index: also sanity check entry size with compressed entries · 83e0392c

We have a check in place whether the index has enough data left for the
required footer after reading an index entry, but this was only used for
uncompressed entries. Move the check down a bit so that it is executed
for both compressed and uncompressed index entries.

committed 7 years ago

83e0392c Browse File

index: remove file-scope entry size macros · 350d2c47

All index entry size computations are now performed in
`index_entry_size`. As such, we do not need the file-scope macros for
computing these sizes anymore. Remove them and move the `entry_size`
macro into the `index_entry_size` function.

committed 7 years ago

350d2c47 Browse File

index: don't right-pad paths when writing compressed entries · 46b67034

Our code to write index entries to disk does not check whether the
entry that is to be written should use prefix compression for the path.
As such, we were overallocating memory and added bogus right-padding
into the resulting index entries. As there is no padding allowed in the
index version 4 format, this should actually result in an invalid index.

Fix this by re-using the newly extracted `index_entry_size` function.

committed 7 years ago

46b67034 Browse File

index: move index entry size computation into its own function · 29f498e0

Create a new function `index_entry_size` which encapsulates the logic to
calculate how much space is needed for an index entry, whether it is
simple/extended or compressed/uncompressed. This can later be re-used by
our code writing index entries.

committed 7 years ago

29f498e0 Browse File

index: set last written index entry in foreach-entry-loop · 8ceb890b

The last written disk entry is currently being written inside of the
function `write_disk_entry`. Make behavior a bit more obviously by
instead setting it inside of `write_entries` while iterating all
entries.

committed 7 years ago

8ceb890b Browse File