Commits · a1ef995dc03379fb1f5151b5d98d16644218c95e · lvzhengyang / git2

25 Jan, 2019 1 commit

blob: validate that blob sizes fit in a size_t · c6cac733

Our blob size is a `git_off_t`, which is a signed 64 bit int. This may
be erroneously negative or larger than `SIZE_MAX`. Ensure that the blob
size fits into a `size_t` before casting.

committed 6 years ago

c6cac733 Browse File

22 Jan, 2019 1 commit
- git_error: use new names in internal APIs and usage · f673e232
```
Move to the `git_error` name in the internal API for error-related
functions.
```
  Edward Thomson committed 6 years ago
  f673e232 Browse File
01 Dec, 2018 1 commit
- object_type: use new enumeration names · 168fe39b
```
Use the new object_type enumeration names within the codebase.
```
  Edward Thomson committed 6 years ago
  168fe39b Browse File
22 Jun, 2018 2 commits

blob: implement function to parse raw data · 9ac79ecc

Currently, parsing objects is strictly tied to having an ODB object
available. This makes it hard to parse an object when all that is
available is its raw object and size. Furthermore, hacking around that
limitation by directly creating an ODB structure either on stack or on
heap does not really work that well due to ODB objects being reference
counted and then automatically free'd when reaching a reference count of
zero.

In some occasions parsing raw objects without touching the ODB
is actually recuired, though. One use case is for example object
verification, where we want to assure that an object is valid before
inserting it into the ODB or writing it into the git repository.

Asa first step towards that, introduce a distinction between raw and ODB
objects for blobs. Creation of ODB objects stays the same by simply
using `git_blob__parse`, but a new function `git_blob__parse_raw` has
been added that creates a blob from a pair of data and size. By setting
a new flag inside of the blob, we can now distinguish whether it is a
raw or ODB object now and treat it accordingly in several places.

Note that the blob data passed in is not being copied. Because of that,
callers need to make sure to keep it alive during the blob's life time.
This is being used to avoid unnecessarily increasing the memory
footprint when parsing largish blobs.

committed 6 years ago

9ac79ecc Browse File

blob: use getters to get raw blob content and size · bbbe8441

Going forward, we will have to change how blob sizes are calculated
based on whether the blob is a cahed object part of the ODB or not. In
order to not have to distinguish between those two object types
repeatedly when accessing the blob's data or size, encapsulate all
existing direct uses of those fields by instead using
`git_blob_rawcontent` and `git_blob_rawsize`.

committed 6 years ago

bbbe8441 Browse File

10 Jun, 2018 1 commit
- Convert usage of `git_buf_free` to new `git_buf_dispose` · ecf4f33a
  Patrick Steinhardt committed 6 years ago
  
  ecf4f33a Browse File
03 Jul, 2017 1 commit

Make sure to always include "common.h" first · 0c7f49dd

Next to including several files, our "common.h" header also declares
various macros which are then used throughout the project. As such, we
have to make sure to always include this file first in all
implementation files. Otherwise, we might encounter problems or even
silent behavioural differences due to macros or defines not being
defined as they should be. So in fact, our header and implementation
files should make sure to always include "common.h" first.

This commit does so by establishing a common include pattern. Header
files inside of "src" will now always include "common.h" as its first
other file, separated by a newline from all the other includes to make
it stand out as special. There are two cases for the implementation
files. If they do have a matching header file, they will always include
this one first, leading to "common.h" being transitively included as
first file. If they do not have a matching header file, they instead
include "common.h" as first file themselves.

This fixes the outlined problems and will become our standard practice
for header and source files inside of the "src/" from now on.

committed 7 years ago

0c7f49dd Browse File

13 Feb, 2017 1 commit

repository: use `git_repository_item_path` · c5f3da96

The recent introduction of the commondir variable of a repository
requires callers to distinguish whether their files are part of
the dot-git directory or the common directory shared between
multpile worktrees. In order to take the burden from callers and
unify knowledge on which files reside where, the
`git_repository_item_path` function has been introduced which
encapsulate this knowledge.

Modify most existing callers of `git_repository_path` to use
`git_repository_item_path` instead, thus making them implicitly
aware of the common directory.

committed 8 years ago

c5f3da96 Browse File

29 Dec, 2016 1 commit

giterr_set: consistent error messages · 909d5494

Error messages should be sentence fragments, and therefore:

1. Should not begin with a capital letter,
2. Should not conclude with punctuation, and
3. Should not end a sentence and begin a new one

committed 8 years ago

909d5494 Browse File

22 Mar, 2016 2 commits

blob: remove _fromchunks() · 6669e3e8

The callback mechanism makes it awkward to write data from an IO
source; move to `_fromstream()` which lets the caller remain in control,
in the same vein as we prefer iterators over foreach callbacks.

committed 8 years ago

6669e3e8 Browse File

blob: introduce creating a blob by writing into a stream · 0a5c6028

The pair of `git_blob_create_frombuffer()` and
`git_blob_create_frombuffer_commit()` is meant to replace
`git_blob_create_fromchunks()` by providing a way for a user to write a
new blob when they want filtering or they do not know the size.

This approach allows the caller to retain control over when to add data
to this buffer and a more natural fit into higher-level language's own
stream abstractions instead of having to handle IO wait in the callback.

The in-memory buffer size of 2MB is chosen somewhat arbitrarily to be a
round multiple of usual page sizes and a value where most blobs seem
likely to be either going to be way below or way over that size. It's
also a round number of pages.

This implementation re-uses the helper we have from `_fromchunks()` so
we end up writing everything to disk, but hopefully more efficiently
than with a default filebuf. A later optimisation can be to avoid
writing the in-memory contents to disk, with some extra complexity.

committed 8 years ago

0a5c6028 Browse File

12 Jul, 2015 1 commit

blob: fail to create a blob from a dir with EDIRECTORY · 0d98af09

This also affects `git_index_add_bypath()` by providing a better error
message and a specific error code when a directory is passed.

committed 9 years ago

0d98af09 Browse File

13 May, 2015 1 commit

odb: make the writestream's size a git_off_t · 77b339f7

Restricting files to size_t is a silly limitation. The loose backend
writes to a file directly, so there is no issue in using 63 bits for the
size.

We still assume that the header is going to fit in 64 bytes, which does
mean quite a bit smaller files due to the run-length encoding, but it's
still a much larger size than you would want Git to handle.

committed 9 years ago

77b339f7 Browse File

11 May, 2015 1 commit
- centralizing all IO buffer size values · 7dd22538
  J Wyman committed 9 years ago
  
  7dd22538 Browse File
19 Feb, 2015 2 commits

git_filter_opt_t -> git_filter_flag_t · 795eaccd
```
For consistency with the rest of the library, where an opt is an
options *structure*.
```
Edward Thomson committed 10 years ago
795eaccd Browse File

buffer: introduce git_buf_attach_notowned · d4cf1675

Provide a convenience function that creates a buffer that can be provided
to callers but will not be freed via `git_buf_free`, so the buffer
creator maintains the allocation lifecycle of the buffer's contents.

committed 10 years ago

d4cf1675 Browse File

16 May, 2014 1 commit
- Increase binary detection len to 8k · d0f00de4
  Russell Belfer committed 10 years ago
  
  d0f00de4 Browse File
06 May, 2014 1 commit

Add filter options and ALLOW_UNSAFE · 5269008c

Diff and status do not want core.safecrlf to actually raise an
error regardless of the setting, so this extends the filter API
with an additional options flags parameter and adds a flag so that
filters can be applied with GIT_FILTER_OPT_ALLOW_UNSAFE, indicating
that unsafe filter application should be downgraded from a failure
to a warning.

committed 10 years ago

5269008c Browse File

03 Apr, 2014 1 commit
- Const correctness! · 3b4ba278
  Jacques Germishuys committed 10 years ago
  
  3b4ba278 Browse File
30 Jan, 2014 1 commit
- Some missing oid to id renames · 5572d2b8
  Russell Belfer committed 11 years ago
  
  5572d2b8 Browse File
08 Jan, 2014 1 commit
- Handle git_buf's from users more liberally · 6adcaab7
  Edward Thomson committed 11 years ago
  
  6adcaab7 Browse File
11 Dec, 2013 1 commit

Update git_blob_create_fromchunks callback behavr · 19853bdd

The callback to supply data chunks could return a negative value
to stop creation of the blob, but we were neither using GIT_EUSER
nor propagating the return value. This makes things use the new
behavior of returning the negative value back to the user.

committed 11 years ago

19853bdd Browse File

05 Nov, 2013 1 commit
- move mode_t to filebuf_open instead of _commit · 1d3a8aeb
  Edward Thomson committed 11 years ago
  
  1d3a8aeb Browse File
17 Sep, 2013 5 commits

Merge git_buf and git_buffer · a9f51e43

This makes the git_buf struct that was used internally into an
externally available structure and eliminates the git_buffer.

As part of that, some of the special cases that arose with the
externally used git_buffer were blended into the git_buf, such as
being careful about git_buf objects that may have a NULL ptr and
allowing for bufs with a valid ptr and size but zero asize as a
way of referring to externally owned data.

committed 11 years ago

a9f51e43 Browse File

Add ident filter · 4b11f25a

This adds the ident filter (that knows how to replace $Id$) and
tweaks the filter APIs and code so that git_filter_source objects
actually have the updated OID of the object being filtered when
it is a known value.

committed 11 years ago

4b11f25a Browse File

Extend public filter api with filter lists · 2a7d224f

This moves the git_filter_list into the public API so that users
can create, apply, and dispose of filter lists.  This allows more
granular application of filters to user data outside of libgit2
internals.

This also converts all the internal usage of filters to the public
APIs along with a few small tweaks to make it easier to use the
public git_buffer stuff alongside the internal git_buf.

committed 11 years ago

2a7d224f Browse File

Create public filter object and use it · 85d54812

This creates include/sys/filter.h with a basic definition of a
git_filter and then converts the internal code to use it.  There
are related internal objects (git_filter_list) that we will want
to publish at some point, but this is a first step.

committed 11 years ago

85d54812 Browse File

Start of filter API + git_blob_filtered_content · 0cf77103

This begins the process of exposing git_filter objects to the
public API.  This includes:

* new public type and API for `git_buffer` through which an
  allocated buffer can be passed to the user
* new API `git_blob_filtered_content`
* make the git_filter type and GIT_FILTER_TO_... constants public

committed 11 years ago

0cf77103 Browse File

15 Aug, 2013 1 commit

odb: wrap the stream reading and writing functions · 376e6c9f

This is in preparation for moving the hashing to the frontend, which
requires us to handle the incoming data before passing it to the
backend's stream.

committed 11 years ago

376e6c9f Browse File

26 Jul, 2013 1 commit
- Fix some warnings · 8dd8aa48
  Russell Belfer committed 11 years ago
  
  8dd8aa48 Browse File
25 Jul, 2013 1 commit

Fix rename detection to use actual blob size · a16e4172

The size data in the index may not reflect the actual size of the
blob data from the ODB when content filtering comes into play.
This commit fixes rename detection to use the actual blob size when
calculating data signatures instead of the value from the index.

Because of a misunderstanding on my part, I first converted the
git_index_add_bypath API to use the post-filtered blob data size
in creating the index entry.  I backed that change out, but I
kept the overall refactoring of that routine and the new internal
git_blob__create_from_paths API because it eliminates an extra
stat() call from the code that adds a file to the index.

The existing tests actually cover this code path, at least when
running on Windows, so at this point I'm not adding new tests to
cover the changes.

committed 11 years ago

a16e4172 Browse File

10 Jun, 2013 1 commit

Reorganize diff and add basic diff driver · 114f5a6c

This is a significant reorganization of the diff code to break it
into a set of more clearly distinct files and to document the new
organization.  Hopefully this will make the diff code easier to
understand and to extend.

This adds a new `git_diff_driver` object that looks of diff driver
information from the attributes and the config so that things like
function content in diff headers can be provided.  The full driver
spec is not implemented in the commit - this is focused on the
reorganization of the code and putting the driver hooks in place.

This also removes a few #includes from src/repository.h that were
overbroad, but as a result required extra #includes in a variety
of places since including src/repository.h no longer results in
pulling in the whole world.

committed 11 years ago

114f5a6c Browse File

30 Apr, 2013 2 commits
- object: Explicitly define helper API methods for all obj types · 0b726701
  Vicent Marti committed 11 years ago
  
  0b726701 Browse File
- Some cleanups · 203d5b0e
```
Removed useless prototype and renamed object typecast functions
declaration macro.
```
  Russell Belfer committed 11 years ago
  203d5b0e Browse File
29 Apr, 2013 1 commit

Standardize cast versions of git_object accessors · d7761102

This removes the GIT_INLINE versions of the simple git_object
accessors and standardizes them with a helper macro in src/object.h
to build the function bodies.

committed 11 years ago

d7761102 Browse File

22 Apr, 2013 4 commits

Simplify object table parse functions · 3f27127d
```
This unifies the object parse functions into one signature that
takes an odb_object.
```
Russell Belfer committed 11 years ago
3f27127d Browse File

Add callback to git_objects_table · 78606263

This adds create and free callback to the git_objects_table so
that more of the creation and destruction of objects can be table
driven instead of using switch statements.  This also makes the
semantics of certain object creation functions consistent so that
we can make better use of function pointers.  This also fixes a
theoretical error case where an object allocation fails and we
end up storing NULL into the cache.

committed 11 years ago

78606263 Browse File

Use git_odb_object_data/_size whereever possible · badd85a6
```
This uses the odb object accessors so we can change the internals
more easily...
```
Russell Belfer committed 11 years ago
badd85a6 Browse File
What has science done. · 8842c75f
Vicent Marti committed 11 years ago

8842c75f Browse File

21 Apr, 2013 1 commit

Move odb_backend implementors stuff into git2/sys · 83cc70d9

This moves some of the odb_backend stuff that is related to the
internals of an odb_backend implementation into include/git2/sys.

Some of the stuff related to streaming I left in include/git2
because it seemed like it would be reasonably needed by a normal
user who wanted to stream objects into and out of the ODB.

Also, I added APIs for traversing the list of backends so that
some of the tests would not need to access ODB internals.

committed 11 years ago

83cc70d9 Browse File