Commits · 1d683c1d2e36631cfe7ff7e9fa930b0773604000 · lvzhengyang / git2

02 Nov, 2016 1 commit

pack: fix race in pack_entry_find_offset · 0cf15e39

In `pack_entry_find_offset`, we try to find the offset of a
certain object in the pack file. To do so, we first assert if the
packfile has already been opened and open it if not. Opening the
packfile is guarded with a mutex, so concurrent access to this is
in fact safe.

What is not thread-safe though is our calculation of offsets
inside the packfile. Assume two threads calling
`pack_entry_find_offset` at the same time. We first calculate the
offset and index location and only then determine if the pack has
already been opened. If so, we re-calculate the offset and index
address.

Now the case for two threads: thread 1 first calculates the
addresses and is subsequently suspended. The second thread will
now call `pack_index_open` and initialize the pack file,
calculating its addresses correctly. When the first thread is
resumed now, he'll see that the pack file has already been
initialized and will happily proceed with the addresses it has
already calculated before the check. As the pack file was not
initialized before, these addresses are bogus.

Fix the issue by only calculating the addresses after having
checked if the pack file is open.

committed 8 years ago

0cf15e39 Browse File

26 May, 2016 1 commit

delta: move delta application to delta.c · 6a2d2f8a

Move the delta application functions into `delta.c`, next to the
similar delta creation functions.  Make the `git__delta_apply`
functions adhere to other naming and parameter style within the
library.

committed 8 years ago

6a2d2f8a Browse File

02 May, 2016 1 commit

odb: avoid inflating the full delta to read the header · a97b769a

When we read the header, we want to know the size and type of the
object. We're currently inflating the full delta in order to read the
first few bytes. This can mean hundreds of kB needlessly inflated for
large objects.

Instead use a packfile stream to read just enough so we can read the two
varints in the header and avoid inflating most of the delta.

committed 8 years ago

a97b769a Browse File

07 Mar, 2016 1 commit

odb: improved not found error messages · e10144ae

When looking up an abbreviated oid, show the actual (abbreviated) oid
the caller passed instead of a full (but ambiguously truncated) oid.

committed 8 years ago

e10144ae Browse File

25 Feb, 2016 2 commits
- pack: don't allow a negative offset · 6d97beb9
  Carlos Martín Nieto committed 8 years ago
  
  6d97beb9 Browse File
- pack: make sure we don't go out of bounds for extended entries · ea9e00cb
```
A corrupt index might have data that tells us to go look past the end of
the file for data. Catch these cases and return an appropriate error
message.
```
  Carlos Martín Nieto committed 8 years ago
  ea9e00cb Browse File
09 Feb, 2016 1 commit

pack: do not free passed in poiter on error · a53d2e39

The function `git_packfile_stream_open` tries to free the passed
in stream when an error occurs. The only call site is
`git_indexer_append`, though, which passes in the address of a
stream struct which has not been allocated on the heap.

Fix the issue by simply removing the call to free. In case of an
error we did not allocate any memory yet and otherwise it should
be the caller's responsibility to manage it's object's lifetime.

committed 8 years ago

a53d2e39 Browse File

13 Jan, 2016 2 commits
- Remove duplicated calls to git_mwindow_close · d4e4f272
  P.S.V.R committed 9 years ago
  
  d4e4f272 Browse File
- Make packfile_unpack_compressed a private API · b644e223
  P.S.V.R committed 9 years ago
  
  b644e223 Browse File
31 Jul, 2015 1 commit

Remove extra semicolon outside of a function · c369b379

Without this change, compiling with gcc and pedantic generates warning:
ISO C does not allow extra ‘;’ outside of a function.

committed 9 years ago

c369b379 Browse File

10 Jun, 2015 1 commit

pack: use git_buf when building the index name · 878293f7

The way we currently do it depends on the subtlety of strlen vs sizeof
and the fact that .pack is one longer than .idx. Let's use a git_buf so
we can express the manipulation we want much more clearly.

committed 9 years ago

878293f7 Browse File

22 May, 2015 1 commit

indexer: don't look for the index we're creating · 38c10ecd

When creating an index, know that we do not have an index for
our own packfile, preventing some unnecessary file opens and
error reporting.

committed 9 years ago

38c10ecd Browse File

11 Mar, 2015 1 commit

Reorder some khash declarations · b63b76e0

Keep the definitions in the headers, while putting the declarations in
the C files. Putting the function definitions in headers causes
them to be duplicated if you include two headers with them.

committed 9 years ago

b63b76e0 Browse File

15 Feb, 2015 1 commit

Fix race in git_packfile_unpack. · 8588cb0c

Increment refcount of newly added cache entries just like existing
entries looked up from the cache. Otherwise the new entry can be
evicted from the cache and destroyed while it's still in use.

committed 9 years ago

8588cb0c Browse File

13 Feb, 2015 2 commits

Make our overflow check look more like gcc/clang's · f1453c59

Make our overflow checking look more like gcc and clang's, so that
we can substitute it out with the compiler instrinsics on platforms
that support it.  This means dropping the ability to pass `NULL` as
an out parameter.

As a result, the macros also get updated to reflect this as well.

committed 9 years ago

f1453c59 Browse File

allocations: test for overflow of requested size · 392702ee
```
Introduce some helper macros to test integer overflow from arithmetic
and set error message appropriately.
```
Edward Thomson committed 9 years ago
392702ee Browse File

29 Dec, 2014 1 commit
- Plug some leaks · 6f73e026
  Jacques Germishuys committed 10 years ago
  
  6f73e026 Browse File
21 Nov, 2014 1 commit
- Fix for misleading "missing delta bases" error - Fix #2721. · ec7e680c
  Ravindra Patel committed 10 years ago
  
  ec7e680c Browse File
27 Oct, 2014 1 commit
- Removed some useless variable assignments · ea66215d
  Pierre-Olivier Latour committed 10 years ago
  
  ea66215d Browse File
26 Sep, 2014 1 commit
- Silence uninitialized warning · e640a77c
  Jacques Germishuys committed 10 years ago
  
  e640a77c Browse File
02 Sep, 2014 1 commit
- Several CppCat warnings fixed · 5cd81bb3
  Arkady Shapkin committed 10 years ago
  
  5cd81bb3 Browse File
26 Aug, 2014 1 commit

pack: return the correct final offset · b3d3459f

The callers of git_packfile_unpack() expect the obj_offset argument to
be set to the beginning of the next object. We were mistakenly returning
the the offset of the object's data, which causes the CRC function to
try to use the wrong offset.

Set obj_offset to curpos instead of elem->offset to point to the next
element and bring back expected behaviour.

committed 10 years ago

b3d3459f Browse File

25 Jun, 2014 1 commit

pack: free the new pack struct if we fail to insert · 5e0f47c3

If we fail to insert the packfile in the map, make sure to free it.

This makes the free function only attempt to remove its mwindows from
the global list if we have opened the packfile to avoid accessing the
list unlocked.

committed 10 years ago

5e0f47c3 Browse File

23 Jun, 2014 1 commit

Share packs across repository instances · b3b66c57

Opening the same repository multiple times will currently open the same
file multiple times, as well as map the same region of the file multiple
times. This is not necessary, as the packfile data is immutable.

Instead of opening and closing packfiles directly, introduce an
indirection and allocate packfiles globally. This does mean locking on
each packfile open, but we already use this lock for the global mwindow
list so it doesn't introduce a new contention point.

committed 10 years ago

b3b66c57 Browse File

15 May, 2014 1 commit

pack: init the cache on packfile alloc · 649214be

When running multithreaded, it is not enough to check for the offmap
allocation. Move the call to cache_init() to packfile allocation so we
can be sure it is always allocated free of races.

This fixes #2355.

committed 10 years ago

649214be Browse File

13 May, 2014 3 commits
- pack: don't forget to cache the base object · c968ce2c
```
The base object is a good cache candidate, so we shouldn't forget to add
it to the cache.
```
  Carlos Martín Nieto committed 10 years ago
  c968ce2c Browse File
- pack: use stack allocation for smaller delta chains · 15bcced2
```
This avoid allocating the array on the heap for relatively small
chains. The expected performance increase is sadly not really
noticeable.
```
  Carlos Martín Nieto committed 10 years ago
  15bcced2 Browse File
- pack: expose a cached delta base directly · a3ffbf23
```
Instead of going through a special entry in the chain, let's pass it as
an output parameter.
```
  Carlos Martín Nieto committed 10 years ago
  a3ffbf23 Browse File
09 May, 2014 7 commits

pack: simplify delta chain code · 9dbd150f

The switch makes the loop somewhat unwieldy. Let's assume it's fine and
perform the check when we're accessing the data.

This makes our code look a lot more like git's.

committed 10 years ago

9dbd150f Browse File

pack: preallocate a 64-element chain · b2559f47

Dependency chains are often large and require a few
reallocations. Allocate a 64-element chain before doing anything else to
avoid allocations during the loop.

This value comes from the stack-allocated one git uses. We still
allocate this on the heap, but it does help performance a little bit.

committed 10 years ago

b2559f47 Browse File

pack: make sure not to leak the dep chain · e6d10c58
Carlos Martín Nieto committed 10 years ago

e6d10c58 Browse File

pack: use a cache for delta bases when unpacking · a332e91c

Bring back the use of the delta base cache for unpacking objects. When
generating the delta chain, we stop when we find a delta base in the
pack's cache and use that as the starting point.

committed 10 years ago

a332e91c Browse File

pack: unpack using a loop · 2acdf4b8

We currently make use of recursive function calls to unpack an object,
resolving the deltas as we come back down the chain. This means that we
have unbounded stack growth as we look up objects in a pack.

This is now done in two steps: first we figure out what the dependency
chain is by looking up the delta bases until we reach a non-delta
object, pushing the information we need onto a stack and then we pop
from that stack and apply the deltas until there are no more left.

This version of the code does not make use of the delta base cache so it
is slower than what's in the mainline. A later commit will reintroduce
it.

committed 10 years ago

2acdf4b8 Browse File

pack: do not repeat the same error message four times · ae081739

Repeating this error message makes it harder to find out where we
actually are finding the error, and they don't really describe what
we're trying to do.

committed 10 years ago

ae081739 Browse File

pack: remove misleading comment · 86d5810b
Carlos Martín Nieto committed 10 years ago

86d5810b Browse File

23 Jan, 2014 1 commit
- Drop parsing pack filename SHA1 part, no one cares the filename · 8610487c
  Linquize committed 10 years ago
  
  8610487c Browse File
11 Dec, 2013 4 commits

One more rename/cleanup for callback err functions · 26c1cb91
Russell Belfer committed 11 years ago

26c1cb91 Browse File
Some callback error check style cleanups · c7b3e1b3
```
I find this easier to read...
```
Russell Belfer committed 11 years ago
c7b3e1b3 Browse File

Remove converting user error to GIT_EUSER · 25e0b157

This changes the behavior of callbacks so that the callback error
code is not converted into GIT_EUSER and instead we propagate the
return value through to the caller.  Instead of using the
giterr_capture and giterr_restore functions, we now rely on all
functions to pass back the return value from a callback.

To avoid having a return value with no error message, the user
can call the public giterr_set_str or some such function to set
an error message.  There is a new helper 'giterr_set_callback'
that functions can invoke after making a callback which ensures
that some error message was set in case the callback did not set
one.

In places where the sign of the callback return value is
meaningful (e.g. positive to skip, negative to abort), only the
negative values are returned back to the caller, obviously, since
the other values allow for continuing the loop.

The hardest parts of this were in the checkout code where positive
return values were overloaded as meaningful values for checkout.
I fixed this by adding an output parameter to many of the internal
checkout functions and removing the overload.  This added some
code, but it is probably a better implementation.

There is some funkiness in the network code where user provided
callbacks could be returning a positive or a negative value and
we want to rely on that to cancel the loop.  There are still a
couple places where an user error might get turned into GIT_EUSER
there, I think, though none exercised by the tests.

committed 11 years ago

25e0b157 Browse File

Further EUSER and error propagation fixes · dab89f9b

This continues auditing all the places where GIT_EUSER is being
returned and making sure to clear any existing error using the
new giterr_user_cancel helper.  As a result, places that relied
on intercepting GIT_EUSER but having the old error preserved also
needed to be cleaned up to correctly stash and then retrieve the
actual error.

Additionally, as I encountered places where error codes were not
being propagated correctly, I tried to fix them up.  A number of
those fixes are included in the this commit as well.

committed 11 years ago

dab89f9b Browse File