Commits · a080037cde55ce3dccdc664b5bfaf4362d67acbb · lvzhengyang / git2

15 Jun, 2019 1 commit

wildmatch: import wildmatch from git.git · a9f57629

In commit 70a8fc999d (stop using fnmatch (either native or
compat), 2014-02-15), upstream git has switched over all code
from their internal fnmatch copy to its new wildmatch code. We
haven't followed suit, and thus have developed some
incompatibilities in how we match regular expressions.

Import git's wildmatch from v2.22.0 and add a test suite based on
their t3070-wildmatch.sh tests.

committed 5 years ago

a9f57629 Browse Directory

14 Jun, 2019 3 commits

posix: remove `p_fallocate` abstraction · 2d85c7e8

By now, we have repeatedly failed to provide a nice
cross-platform implementation of `p_fallocate`. Recent tries to
do that escalated quite fast to a set of different CMake checks,
implementations, fallbacks, etc., which started to look real
awkward to maintain. In fact, `p_fallocate` had only been
introduced in commit 4e3949b7 (tests: test that largefiles can
be read through the tree API, 2019-01-30) to support a test with
large files, but given the maintenance costs it just seems not to
be worht it.

As we have removed the sole user of `p_fallocate` in the previous
commit, let's drop it altogether.

committed 5 years ago

2d85c7e8 Browse Directory

apply: add an options struct initializer · c0dd7122
Edward Thomson committed 5 years ago

c0dd7122 Browse Directory

Rename opt init functions to `options_init` · 0b5ba0d7

In libgit2 nomenclature, when we need to verb a direct object, we name
a function `git_directobject_verb`.  Thus, if we need to init an options
structure named `git_foo_options`, then the name of the function that
does that should be `git_foo_options_init`.

The previous names of `git_foo_init_options` is close - it _sounds_ as
if it's initializing the options of a `foo`, but in fact
`git_foo_options` is its own noun that should be respected.

Deprecate the old names; they'll now call directly to the new ones.

committed 5 years ago

0b5ba0d7 Browse Directory

19 May, 2019 8 commits

core::posix: skip some locale tests on win32 · 09902985
Edward Thomson committed 5 years ago

09902985 Browse Directory
tests: regcomp: use proper character classes · 8877d7d3
```
The '[[:digit:]]' and '[[:alpha:]]' classes require double brackets, not
single.
```
Edward Thomson committed 5 years ago
8877d7d3 Browse Directory
tests: regcomp: test that regex functions succeed · ca1b07a2
```
The regex functions return nonzero (not necessarily negative values) on
failure.
```
Edward Thomson committed 5 years ago
ca1b07a2 Browse Directory

tests: regcomp: assert character groups do match normal alphabet · aea9a712

In order to avoid us being unable to match characters which are part of
the normal US alphabet in certain weird languages, add two tests to
catch this behavior.

committed 5 years ago

aea9a712 Browse Directory

tests: regex: restructure setup of locales · e207b2a2

In order to make it easier adding more locale-related tests, add a
generalized framework handling initial setup of languages as well as the
cleanup of them afterwards.

committed 5 years ago

e207b2a2 Browse Directory

tests: regex: add test with LC_COLLATE being set · b055a6b5

While we already have a test for `p_regexec` with `LC_CTYPE` being
modified, `regexec` also alters behavior as soon as `LC_COLLATE` is
being modified. Most importantly, `LC_COLLATE` changes the way how
ranges are interpreted to just not handling them at all. Thus, ensure
that either we use `regcomp_l` to avoid this, or that we've fallen back
to our builtin regex functionality which also behaves properly.

committed 5 years ago

b055a6b5 Browse Directory

tests: fix p_regcomp test not checking return type · ad4ede91
```
While the test asserts that the error value indcates a non-value, it is
actually never getting assigned to. Fix this.
```
Patrick Steinhardt committed 5 years ago
ad4ede91 Browse Directory

regexec: prefix all regexec function calls with p_ · 02683b20

Prefix all the calls to the the regexec family of functions with `p_`.
This allows us to swap out all the regular expression functions with our
own implementation.  Move the declarations to `posix_regex.h` for
simpler inclusion.

committed 5 years ago

02683b20 Browse Directory

22 Feb, 2019 1 commit
- p_fallocate: add a test for our implementation · 0345a380
  Edward Thomson committed 5 years ago
  
  0345a380 Browse Directory
15 Feb, 2019 9 commits

oidmap: remove legacy low-level interface · bd66925a

Remove the low-level interface that was exposing implementation details of
`git_oidmap` to callers. From now on, only the high-level functions shall be
used to retrieve or modify values of a map. Adjust remaining existing callers.

committed 6 years ago

bd66925a Browse Directory

strmap: remove legacy low-level interface · fdfabdc4

Remove the low-level interface that was exposing implementation details of
`git_strmap` to callers. From now on, only the high-level functions shall be
used to retrieve or modify values of a map. Adjust remaining existing callers.

committed 6 years ago

fdfabdc4 Browse Directory

maps: provide high-level iteration interface · 18cf5698

Currently, our headers need to leak some implementation details of maps due to
their direct use of indices in the implementation of their foreach macros. This
makes it impossible to completely hide the map structures away, and also makes
it impossible to include the khash implementation header in the C files of the
respective map only.

This is now being fixed by providing a high-level iteration interface
`map_iterate`, which takes as inputs the map that shall be iterated over, an
iterator as well as the locations where keys and values shall be put into. For
simplicity's sake, the iterator is a simple `size_t` that shall initialized to
`0` on the first call. All existing foreach macros are then adjusted to make use
of this new function.

committed 6 years ago

18cf5698 Browse Directory

oidmap: introduce high-level setter for key/value pairs · 2e0a3048

Currently, one would use either `git_oidmap_insert` to insert key/value pairs
into a map or `git_oidmap_put` to insert a key only. These function have
historically been macros, which is why their syntax is kind of weird: instead of
returning an error code directly, they instead have to be passed a pointer to
where the return value shall be stored. This does not match libgit2's common
idiom of directly returning error codes.Furthermore, `git_oidmap_put` is tightly
coupled with implementation details of the map as it exposes the index of
inserted entries.

Introduce a new function `git_oidmap_set`, which takes as parameters the map,
key and value and directly returns an error code. Convert all trivial callers of
`git_oidmap_insert` and `git_oidmap_put` to make use of it.

committed 6 years ago

2e0a3048 Browse Directory

oidmap: introduce high-level getter for values · 9694ef20

The current way of looking up an entry from a map is tightly coupled with the
map implementation, as one first has to look up the index of the key and then
retrieve the associated value by using the index. As a caller, you usually do
not care about any indices at all, though, so this is more complicated than
really necessary. Furthermore, it invites for errors to happen if the correct
error checking sequence is not being followed.

Introduce a new high-level function `git_oidmap_get` that takes a map and a key
and returns a pointer to the associated value if such a key exists. Otherwise,
a `NULL` pointer is returned. Adjust all callers that can trivially be
converted.

committed 6 years ago

9694ef20 Browse Directory

strmap: introduce high-level setter for key/value pairs · 03555830

Currently, one would use the function `git_strmap_insert` to insert key/value
pairs into a map. This function has historically been a macro, which is why its
syntax is kind of weird: instead of returning an error code directly, it instead
has to be passed a pointer to where the return value shall be stored. This does
not match libgit2's common idiom of directly returning error codes.

Introduce a new function `git_strmap_set`, which takes as parameters the map,
key and value and directly returns an error code. Convert all callers of
`git_strmap_insert` to make use of it.

committed 6 years ago

03555830 Browse Directory

strmap: introduce `git_strmap_get` and use it throughout the tree · ef507bc7

Introduce a new high-level function `git_strmap_get` that takes a map and a key
and returns a pointer to the associated value if such a key exists. Otherwise,
a `NULL` pointer is returned. Adjust all callers that can trivially be
converted.

committed 6 years ago

ef507bc7 Browse Directory

maps: provide a uniform entry count interface · 7e926ef3

There currently exist two different function names for getting the entry count
of maps, where offmaps offset and string maps use `num_entries` and OID maps use
`size`. In most programming languages with built-in map types, this is simply
called `size`, which is also shorter to type. Thus, this commit renames the
other two functions `num_entries` to match the common way and adjusts all
callers.

committed 6 years ago

7e926ef3 Browse Directory

maps: use uniform lifecycle management functions · 351eeff3

Currently, the lifecycle functions for maps (allocation, deallocation, resize)
are not named in a uniform way and do not have a uniform function signature.
Rename the functions to fix that, and stick to libgit2's naming scheme of saying
`git_foo_new`. This results in the following new interface for allocation:

- `int git_<t>map_new(git_<t>map **out)` to allocate a new map, returning an
  error code if we ran out of memory

- `void git_<t>map_free(git_<t>map *map)` to free a map

- `void git_<t>map_clear(git<t>map *map)` to remove all entries from a map

This commit also fixes all existing callers.

committed 6 years ago

351eeff3 Browse Directory

25 Jan, 2019 2 commits
- test: cast to a char the zstream test · 3fba5891
  Edward Thomson committed 6 years ago
  
  3fba5891 Browse Directory
- deprecation: move deprecated tests into their own file · 9c5e05ad
```
Move the deprecated stream tests into their own compilation unit.  This
will allow us to disable any preprocessor directives that apply to
deprecation just for these tests (eg, disabling `GIT_DEPRECATED_HARD`).
```
  Edward Thomson committed 6 years ago
  9c5e05ad Browse Directory
22 Jan, 2019 1 commit
- git_error: use new names in internal APIs and usage · f673e232
```
Move to the `git_error` name in the internal API for error-related
functions.
```
  Edward Thomson committed 6 years ago
  f673e232 Browse Directory
06 Jan, 2019 1 commit
- Attempt at fixing the MingW64 compilation · b5e8272f
```
It seems like MingW64's size_t is defined differently than in Linux.
```
  lhchavez committed 6 years ago
  b5e8272f Browse Directory
28 Nov, 2018 4 commits

stream registration: take an enum type · 02bb39f4

Accept an enum (`git_stream_t`) during custom stream registration that
indicates whether the registration structure should be used for standard
(non-TLS) streams or TLS streams.

committed 6 years ago

02bb39f4 Browse Directory

stream: provide generic registration API · df2cc108

Update the new stream registration API to be `git_stream_register`
which takes a registration structure and a TLS boolean.  This allows
callers to register non-TLS streams as well as TLS streams.

Provide `git_stream_register_tls` that takes just the init callback for
backward compatibliity.

committed 6 years ago

df2cc108 Browse Directory

tls: introduce a wrap function · 43b592ac

Introduce `git_tls_stream_wrap` which will take an existing `stream`
with an already connected socket and begin speaking TLS on top of it.
This is useful if you've built a connection to a proxy server and you
wish to begin CONNECT over it to tunnel a TLS connection.

Also update the pluggable TLS stream layer so that it can accept a
registration structure that provides an `init` and `wrap` function,
instead of a single initialization function.

committed 6 years ago

43b592ac Browse Directory

khash: remove intricate knowledge of khash types · 852bc9f4

Instead of using the `khiter_t`, `git_strmap_iter` and `khint_t` types,
simply use `size_t` instead. This decouples code from the khash stuff
and makes it possible to move the khash includes into the implementation
files.

committed 6 years ago

852bc9f4 Browse Directory

14 Nov, 2018 1 commit

strntol: fix out-of-bounds reads when parsing numbers with leading sign · 4209a512

When parsing a number, we accept a leading plus or minus sign to return
a positive or negative number. When the parsed string has such a leading
sign, we set up a flag indicating that the number is negative and
advance the pointer to the next character in that string. This misses
updating the number of bytes in the string, though, which is why the
parser may later on do an out-of-bounds read.

Fix the issue by correctly updating both the pointer and the number of
remaining bytes. Furthermore, we need to check whether we actually have
any bytes left after having advanced the pointer, as otherwise the
auto-detection of the base may do an out-of-bonuds access. Add a test
that detects the out-of-bound read.

Note that this is not actually security critical. While there are a lot
of places where the function is called, all of these places are guarded
or irrelevant:

- commit list: this operates on objects from the ODB, which are always
  NUL terminated any may thus not trigger the off-by-one OOB read.

- config: the configuration is NUL terminated.

- curl stream: user input is being parsed that is always NUL terminated

- index: the index is read via `git_futils_readbuffer`, which always NUL
  terminates it.

- loose objects: used to parse the length from the object's header. As
  we check previously that the buffer contains a NUL byte, this is safe.

- rebase: this parses numbers from the rebase instruction sheet. As the
  rebase code uses `git_futils_readbuffer`, the buffer is always NUL
  terminated.

- revparse: this parses a user provided buffer that is NUL terminated.

- signature: this parser the header information of objects. As objects
  read from the ODB are always NUL terminated, this is a non-issue. The
  constructor `git_signature_from_buffer` does not accept a length
  parameter for the buffer, so the buffer needs to be NUL terminated, as
  well.

- smart transport: the buffer that is parsed is NUL terminated

- tree cache: this parses the tree cache from the index extension. The
  index itself is read via `git_futils_readbuffer`, which always NUL
  terminates it.

- winhttp transport: user input is being parsed that is always NUL
  terminated

committed 6 years ago

4209a512 Browse Directory

02 Nov, 2018 2 commits

strntol: fix detection and skipping of base prefixes · 50d09407

The `git__strntol` family of functions has the ability to auto-detect
a number's base if the string has either the common '0x' prefix for
hexadecimal numbers or '0' prefix for octal numbers. The detection of
such prefixes and following handling has two major issues though that are
being fixed in one go now.

- We do not do any bounds checking previous to verifying the '0x' base.
  While we do verify that there is at least one digit available
  previously, we fail to verify that there are two digits available and
  thus may do an out-of-bounds read when parsing this
  two-character-prefix.

- When skipping the prefix of such numbers, we only update the pointer
  length without also updating the number of remaining bytes. Thus if we
  try to parse a number '0x1' of total length 3, we will first skip the
  first two bytes and then try to read 3 bytes starting at '1'.

Fix both issues by disentangling the logic. Instead of doing the
detection and skipping of such prefixes in one go, we will now first try
to detect the base while also honoring how many bytes are left. Only if
we have a valid base that is either 8 or 16 and have one of the known
prefixes, we will now advance the pointer and update the remaining bytes
in one step.

Add some tests that verify that no out-of-bounds parsing happens and
that autodetection works as advertised.

committed 6 years ago

50d09407 Browse Directory

strntol: fix out-of-bounds read when skipping leading spaces · 41863a00

The `git__strntol` family of functions accepts leading spaces and will
simply skip them. The skipping will not honor the provided buffer's
length, though, which may lead it to read outside of the provided
buffer's bounds if it is not a simple NUL-terminated string.
Furthermore, if leading space is trimmed, the function will further
advance the pointer but not update the number of remaining bytes, which
may also lead to out-of-bounds reads.

Fix the issue by properly paying attention to the buffer length and
updating it when stripping leading whitespace characters. Add a test
that verifies that we won't read past the provided buffer length.

committed 6 years ago

41863a00 Browse Directory

25 Oct, 2018 1 commit

util: provide `git__memmem` function · 83e8a6b3

Unfortunately, neither the `memmem` nor the `strnstr` functions are part
of any C standard but are merely extensions of C that are implemented by
e.g. glibc. Thus, there is no standardized way to search for a string in
a block of memory with a limited size, and using `strstr` is to be
considered unsafe in case where the buffer has not been sanitized. In
fact, there are some uses of `strstr` in exactly that unsafe way in our
codebase.

Provide a new function `git__memmem` that implements the `memmem`
semantics. That is in a given haystack of `n` bytes, search for the
occurrence of a byte sequence of `m` bytes and return a pointer to the
first occurrence. The implementation chosen is the "Not So Naive"
algorithm from [1]. It was chosen as the implementation is comparably
simple while still being reasonably efficient in most cases.
Preprocessing happens in constant time and space, searching has a time
complexity of O(n*m) with a slightly sub-linear average case.

[1]: http://www-igm.univ-mlv.fr/~lecroq/string/

committed 6 years ago

83e8a6b3 Browse Directory

19 Oct, 2018 1 commit

util: fix out of bounds read in error message · ea19efc1

When an integer that is parsed with `git__strntol32` is too big to fit
into an int32, we will generate an error message that includes the
actual string that failed to parse. This does not acknowledge the fact
that the string may either not be NUL terminated or alternative include
additional characters after the number that is to be parsed. We may thus
end up printing characters into the buffer that aren't the number or,
worse, read out of bounds.

Fix the issue by utilizing the `endptr` that was set by
`git__strntol64`. This pointer is guaranteed to be set to the first
character following the number, and we can thus use it to compute the
width of the number that shall be printed. Create a test to verify that
we correctly truncate the number.

committed 6 years ago

ea19efc1 Browse Directory

18 Oct, 2018 3 commits

tests: core::strtol: test for some more edge-cases · 39087ab8

Some edge cases were currently completely untested, e.g. parsing numbers
greater than INT64_{MIN,MAX}, truncating buffers by length and invalid
characters. Add tests to verify that the system under test performs as
expected.

committed 6 years ago

39087ab8 Browse Directory

util: remove `git__strtol32` · 8d7fa88a

The function `git__strtol32` can easily be misused when untrusted data
is passed to it that may not have been sanitized with trailing `NUL`
bytes. As all usages of this function have now been removed, we can
remove this function altogether to avoid future misuse of it.

committed 6 years ago

8d7fa88a Browse Directory

util: remove unsafe `git__strtol64` function · 68deb2cc

The function `git__strtol64` does not take a maximum buffer length as
parameter. This has led to some unsafe usages of this function, and as
such we may consider it as being unsafe to use. As we have now
eradicated all usages of this function, let's remove it completely to
avoid future misuse.

committed 6 years ago

68deb2cc Browse Directory

05 Oct, 2018 2 commits

tests: sanitize file hierarchy after running rmdir tests · ad273718

Currently, we do not clean up after ourselves after tests in core::rmdir
have created new files in the directory hierarchy. This may leave stale
files and/or directories after having run tests, confusing subsequent
tests that expect a pristine test environment. Most importantly, it may
cause the test initialization to fail which expects being able to
re-create the testing hierarchy before each test in case where another
test hasn't cleaned up after itself.

Fix the issue by adding a cleanup function that removes the temporary
testing hierarchy after each test if it still exists.

committed 6 years ago

ad273718 Browse Directory

tests: Add some more tests for git_futils_rmdir_r · e886ab46
```
Signed-off-by: Sven Strickroth <email@cs-ware.de>
```
Sven Strickroth committed 6 years ago
e886ab46 Browse Directory