Commits · 8749655745d06b3be01222d512218dc16a359731 · lvzhengyang / git2

12 Feb, 2023 1 commit

object: lookup sha256 objects · fe2ee3a0

This is much of the plumbing for the object database to support SHA256,
and for objects to be able to parse SHA256 versions of themselves.

committed 2 years ago

fe2ee3a0 Browse File

23 Feb, 2022 2 commits

refactor: make util an object library · c3b7ace9

Instead of simply including the utility files directly, make them a
cmake object library for easy reusability between other projects within
libgit2.

Now the top-level `src` is responsible for platform selection, while the
next-level `libgit2` and `util` configurations are responsible for
identifying what objects they include.

committed 3 years ago

c3b7ace9 Browse File

refactor: `src` is now `src/libgit2` · ef4ab298
Edward Thomson committed 3 years ago

ef4ab298 Browse File

10 Dec, 2021 1 commit

blob: identify binary content · 4591e76a

Introduce `git_blob_data_is_binary` to examine a blob's data, instead of
the blob itself.  A replacement for `git_buf_is_binary`.

committed 3 years ago

4591e76a Browse Directory

09 Nov, 2021 1 commit

path: separate git-specific path functions from util · 95117d47

Introduce `git_fs_path`, which operates on generic filesystem paths.
`git_path` will be kept for only git-specific path functionality (for
example, checking for `.git` in a path).

committed 3 years ago

95117d47 Browse Directory

17 Oct, 2021 1 commit

str: introduce `git_str` for internal, `git_buf` is external · f0e693b1

libgit2 has two distinct requirements that were previously solved by
`git_buf`.  We require:

1. A general purpose string class that provides a number of utility APIs
   for manipulating data (eg, concatenating, truncating, etc).
2. A structure that we can use to return strings to callers that they
   can take ownership of.

By using a single class (`git_buf`) for both of these purposes, we have
confused the API to the point that refactorings are difficult and
reasoning about correctness is also difficult.

Move the utility class `git_buf` to be called `git_str`: this represents
its general purpose, as an internal string buffer class.  The name also
is an homage to Junio Hamano ("gitstr").

The public API remains `git_buf`, and has a much smaller footprint.  It
is generally only used as an "out" param with strict requirements that
follow the documentation.  (Exceptions exist for some legacy APIs to
avoid breaking callers unnecessarily.)

Utility functions exist to convert a user-specified `git_buf` to a
`git_str` so that we can call internal functions, then converting it
back again.

committed 3 years ago

f0e693b1 Browse Directory

25 Sep, 2021 1 commit

blob: improve `create_from_disk` attribute lookups · 848bd009

Resolve absolute paths to be working directory relative when looking up
attributes.  Importantly, now we will _never_ pass an absolute path down
to attribute lookup functions.

committed 3 years ago

848bd009 Browse Directory

21 Sep, 2021 1 commit

filter: use a `git_oid` in filter options, not a pointer · 90656858

Using a `git_oid *` in filter options was a mistake; it is a deviation
from our typical pattern, and callers in some languages that GC may need
very special treatment in order to pass both an options structure and a
pointer outside of it.

committed 3 years ago

90656858 Browse Directory

14 Sep, 2021 1 commit

If longpaths is true and filters are enabled, pass git_repository through the… · 26bf94c0

If longpaths is true and filters are enabled, pass git_repository through the filtering code to ensure the cached longpath setting is returned.

Fixes: #6054

committed 3 years ago

26bf94c0 Browse Directory

22 Jul, 2021 1 commit
- filter: introduce GIT_BLOB_FILTER_ATTRIBUTES_FROM_COMMIT · 1439b9ff
```
Provide a mechanism to filter using attribute data from a specific
commit (making use of `GIT_ATTR_CHECK_INCLUDE_COMMIT`).
```
  Edward Thomson committed 3 years ago
  1439b9ff Browse Directory
11 May, 2021 1 commit

buf: remove internal `git_buf_text` namespace · d525e063

The `git_buf_text` namespace is unnecessary and strange.  Remove it,
just keep the functions prefixed with `git_buf`.

committed 3 years ago

d525e063 Browse Directory

28 Apr, 2021 1 commit
- use git_repository_workdir_path to generate paths · 578aeba9
```
Use `git_repository_workdir_path` to generate workdir paths since it
will validate the length.
```
  Edward Thomson committed 3 years ago
  578aeba9 Browse Directory
05 Jan, 2021 2 commits

blob: fix name of `GIT_BLOB_FILTER_ATTRIBUTES_FROM_HEAD` · 855f2998

`GIT_BLOB_FILTER_ATTTRIBUTES_FROM_HEAD` is misspelled, it should be
`GIT_BLOB_FILTER_ATTRIBUTES_FROM_HEAD`, and it would be if it were not
for the MacBook Pro keyboard and my inattentiveness.

committed 4 years ago

855f2998 Browse Directory

blob: add git_blob_filter_options_init · d9c15387

The `git_blob_filter_options_init` function should be included, to allow
callers in FFI environments to let us initialize an options structure
for them.

committed 4 years ago

d9c15387 Browse Directory

25 Nov, 2020 2 commits
- buffer: git_buf_sanitize should return a value · cb4bfbc9
```
`git_buf_sanitize` is called with user-input, and wants to sanity-check
that input.  Allow it to return a value if the input was malformed in a
way that we cannot cope.
```
  Edward Thomson committed 4 years ago
  cb4bfbc9 Browse Directory
- blob: use GIT_ASSERT · abb7a7e4
  Edward Thomson committed 4 years ago
  
  abb7a7e4 Browse Directory
09 Jun, 2020 1 commit

tree-wide: do not compile deprecated functions with hard deprecation · c6184f0c

When compiling libgit2 with -DDEPRECATE_HARD, we add a preprocessor
definition `GIT_DEPRECATE_HARD` which causes the "git2/deprecated.h"
header to be empty. As a result, no function declarations are made
available to callers, but the implementations are still available to
link against. This has the problem that function declarations also
aren't visible to the implementations, meaning that the symbol's
visibility will not be set up correctly. As a result, the resulting
library may not expose those deprecated symbols at all on some platforms
and thus cause linking errors.

Fix the issue by conditionally compiling deprecated functions, only.
While it becomes impossible to link against such a library in case one
uses deprecated functions, distributors of libgit2 aren't expected to
pass -DDEPRECATE_HARD anyway. Instead, users of libgit2 should manually
define GIT_DEPRECATE_HARD to hide deprecated functions. Using "real"
hard deprecation still makes sense in the context of CI to test we don't
use deprecated symbols ourselves and in case a dependant uses libgit2 in
a vendored way and knows it won't ever use any of the deprecated symbols
anyway.

committed 4 years ago

c6184f0c Browse Directory

22 Nov, 2019 1 commit
- blob: use `git_object_size_t` for object size · 4334b177
```
Instead of using a signed type (`off_t`) use a new `git_object_size_t`
for the sizes of objects.
```
  Edward Thomson committed 5 years ago
  4334b177 Browse Directory
11 Aug, 2019 4 commits

blob: optionally read attributes from repository · fba3bf79

When `GIT_BLOB_FILTER_ATTTRIBUTES_FROM_HEAD` is passed to
`git_blob_filter`, read attributes from `gitattributes` files that
are checked in to the repository at the HEAD revision.  This passes
the flag `GIT_FILTER_ATTRIBUTES_FROM_HEAD` to the filter functions.

committed 5 years ago

fba3bf79 Browse Directory

blob: allow blob filtering to ignore system gitattributes · a5392eae

Introduce `GIT_BLOB_FILTER_NO_SYSTEM_ATTRIBUTES`, which tells
`git_blob_filter` to ignore the system-wide attributes file, usually
`/etc/gitattributes`.

This simply passes the appropriate flag to the attribute loading code.

committed 5 years ago

a5392eae Browse Directory

blob: deprecate `git_blob_filtered_content` · fa1a4c77
```
Users should now use `git_blob_filter`.
```
Edward Thomson committed 5 years ago
fa1a4c77 Browse Directory

blob: introduce git_blob_filter · a32ab076

Provide a function to filter blobs that allows for more functionality
than the existing `git_blob_filtered_content` function.

committed 5 years ago

a32ab076 Browse Directory

15 Jun, 2019 1 commit

blob: add underscore to `from` functions · 08f39208

The majority of functions are named `from_something` (with an
underscore) instead of `fromsomething`.  Update the blob functions for
consistency with the rest of the library.

committed 5 years ago

08f39208 Browse Directory

25 Jan, 2019 1 commit

blob: validate that blob sizes fit in a size_t · c6cac733

Our blob size is a `git_off_t`, which is a signed 64 bit int. This may
be erroneously negative or larger than `SIZE_MAX`. Ensure that the blob
size fits into a `size_t` before casting.

committed 6 years ago

c6cac733 Browse Directory

22 Jan, 2019 1 commit
- git_error: use new names in internal APIs and usage · f673e232
```
Move to the `git_error` name in the internal API for error-related
functions.
```
  Edward Thomson committed 6 years ago
  f673e232 Browse Directory
01 Dec, 2018 1 commit
- object_type: use new enumeration names · 168fe39b
```
Use the new object_type enumeration names within the codebase.
```
  Edward Thomson committed 6 years ago
  168fe39b Browse Directory
22 Jun, 2018 2 commits

blob: implement function to parse raw data · 9ac79ecc

Currently, parsing objects is strictly tied to having an ODB object
available. This makes it hard to parse an object when all that is
available is its raw object and size. Furthermore, hacking around that
limitation by directly creating an ODB structure either on stack or on
heap does not really work that well due to ODB objects being reference
counted and then automatically free'd when reaching a reference count of
zero.

In some occasions parsing raw objects without touching the ODB
is actually recuired, though. One use case is for example object
verification, where we want to assure that an object is valid before
inserting it into the ODB or writing it into the git repository.

Asa first step towards that, introduce a distinction between raw and ODB
objects for blobs. Creation of ODB objects stays the same by simply
using `git_blob__parse`, but a new function `git_blob__parse_raw` has
been added that creates a blob from a pair of data and size. By setting
a new flag inside of the blob, we can now distinguish whether it is a
raw or ODB object now and treat it accordingly in several places.

Note that the blob data passed in is not being copied. Because of that,
callers need to make sure to keep it alive during the blob's life time.
This is being used to avoid unnecessarily increasing the memory
footprint when parsing largish blobs.

committed 6 years ago

9ac79ecc Browse Directory

blob: use getters to get raw blob content and size · bbbe8441

Going forward, we will have to change how blob sizes are calculated
based on whether the blob is a cahed object part of the ODB or not. In
order to not have to distinguish between those two object types
repeatedly when accessing the blob's data or size, encapsulate all
existing direct uses of those fields by instead using
`git_blob_rawcontent` and `git_blob_rawsize`.

committed 6 years ago

bbbe8441 Browse Directory

10 Jun, 2018 1 commit
- Convert usage of `git_buf_free` to new `git_buf_dispose` · ecf4f33a
  Patrick Steinhardt committed 6 years ago
  
  ecf4f33a Browse Directory
03 Jul, 2017 1 commit

Make sure to always include "common.h" first · 0c7f49dd

Next to including several files, our "common.h" header also declares
various macros which are then used throughout the project. As such, we
have to make sure to always include this file first in all
implementation files. Otherwise, we might encounter problems or even
silent behavioural differences due to macros or defines not being
defined as they should be. So in fact, our header and implementation
files should make sure to always include "common.h" first.

This commit does so by establishing a common include pattern. Header
files inside of "src" will now always include "common.h" as its first
other file, separated by a newline from all the other includes to make
it stand out as special. There are two cases for the implementation
files. If they do have a matching header file, they will always include
this one first, leading to "common.h" being transitively included as
first file. If they do not have a matching header file, they instead
include "common.h" as first file themselves.

This fixes the outlined problems and will become our standard practice
for header and source files inside of the "src/" from now on.

committed 7 years ago

0c7f49dd Browse Directory

13 Feb, 2017 1 commit

repository: use `git_repository_item_path` · c5f3da96

The recent introduction of the commondir variable of a repository
requires callers to distinguish whether their files are part of
the dot-git directory or the common directory shared between
multpile worktrees. In order to take the burden from callers and
unify knowledge on which files reside where, the
`git_repository_item_path` function has been introduced which
encapsulate this knowledge.

Modify most existing callers of `git_repository_path` to use
`git_repository_item_path` instead, thus making them implicitly
aware of the common directory.

committed 8 years ago

c5f3da96 Browse Directory

29 Dec, 2016 1 commit

giterr_set: consistent error messages · 909d5494

Error messages should be sentence fragments, and therefore:

1. Should not begin with a capital letter,
2. Should not conclude with punctuation, and
3. Should not end a sentence and begin a new one

committed 8 years ago

909d5494 Browse Directory

22 Mar, 2016 2 commits

blob: remove _fromchunks() · 6669e3e8

The callback mechanism makes it awkward to write data from an IO
source; move to `_fromstream()` which lets the caller remain in control,
in the same vein as we prefer iterators over foreach callbacks.

committed 8 years ago

6669e3e8 Browse Directory

blob: introduce creating a blob by writing into a stream · 0a5c6028

The pair of `git_blob_create_frombuffer()` and
`git_blob_create_frombuffer_commit()` is meant to replace
`git_blob_create_fromchunks()` by providing a way for a user to write a
new blob when they want filtering or they do not know the size.

This approach allows the caller to retain control over when to add data
to this buffer and a more natural fit into higher-level language's own
stream abstractions instead of having to handle IO wait in the callback.

The in-memory buffer size of 2MB is chosen somewhat arbitrarily to be a
round multiple of usual page sizes and a value where most blobs seem
likely to be either going to be way below or way over that size. It's
also a round number of pages.

This implementation re-uses the helper we have from `_fromchunks()` so
we end up writing everything to disk, but hopefully more efficiently
than with a default filebuf. A later optimisation can be to avoid
writing the in-memory contents to disk, with some extra complexity.

committed 8 years ago

0a5c6028 Browse Directory

12 Jul, 2015 1 commit

blob: fail to create a blob from a dir with EDIRECTORY · 0d98af09

This also affects `git_index_add_bypath()` by providing a better error
message and a specific error code when a directory is passed.

committed 9 years ago

0d98af09 Browse Directory

13 May, 2015 1 commit

odb: make the writestream's size a git_off_t · 77b339f7

Restricting files to size_t is a silly limitation. The loose backend
writes to a file directly, so there is no issue in using 63 bits for the
size.

We still assume that the header is going to fit in 64 bytes, which does
mean quite a bit smaller files due to the run-length encoding, but it's
still a much larger size than you would want Git to handle.

committed 9 years ago

77b339f7 Browse Directory

11 May, 2015 1 commit
- centralizing all IO buffer size values · 7dd22538
  J Wyman committed 9 years ago
  
  7dd22538 Browse Directory
19 Feb, 2015 2 commits

git_filter_opt_t -> git_filter_flag_t · 795eaccd
```
For consistency with the rest of the library, where an opt is an
options *structure*.
```
Edward Thomson committed 10 years ago
795eaccd Browse Directory

buffer: introduce git_buf_attach_notowned · d4cf1675

Provide a convenience function that creates a buffer that can be provided
to callers but will not be freed via `git_buf_free`, so the buffer
creator maintains the allocation lifecycle of the buffer's contents.

committed 10 years ago

d4cf1675 Browse Directory

16 May, 2014 1 commit
- Increase binary detection len to 8k · d0f00de4
  Russell Belfer committed 10 years ago
  
  d0f00de4 Browse Directory