Add SHA256TREE: a version of SHA-256 that supports chunking #235

EdSchouten · 2022-11-17T11:02:27Z

In PR #233 I proposed the addition of two new ContentAddressableStorage methods (ConcatenateBlobs and SplitBlobs) that allow one to gain random access it large CAS objects, while still providing a way to very data integrity. As part of that change, I added a new digest function to help with that, named ~~BLAKE3CONCAT~~ SHA256TREE.

This PR adds just this digest function, without bringing in any support for chunking. This will be done separately, as it was requested that both these features landed independently.

I have also included test vectors for the ~~BLAKE3CONCAT~~ SHA256TREE digest function. ~~I have derived these by modifying the BLAKE3 reference implementation written in Rust, and rerunning the tool that emits the official test vectors:~~

~~https://github.com/BLAKE3-team/BLAKE3/blob/master/test_vectors/test_vectors.json~~

Furthermore, I have been able to validate the newly obtained test vectors using a custom ~~BLAKE3CONCAT~~ SHA256TREE implementation that I have written in Go, which will become part of Buildbarn.

EdSchouten · 2022-11-17T11:08:34Z

Hey @EricBurnett @bergsieker @mostynb @sstriker @moroten,
As discussed on Tuesday, the plan is to bring in this functionality in two separate changes:

One that adds BLAKE3CONCAT in a fully backward compatible way (so without chunking),
Add chunking as a general extension, independent of the use of BLAKE3CONCAT.

This PR implements the first part. I will update PR #233 to provide the second part. It would be nice if this change could land in the nearby future, as that will already allow me to release changes on both the Bazel and Buildbarn side to add the new hashing algorithm. Any further changes to do actual chunking can then be limited to just Buildbarn components.

build/bazel/remote/execution/v2/remote_execution.proto

EdSchouten · 2022-11-23T16:28:09Z

I have managed to implement this on the Buildbarn side. I will publish these changes as soon as this proposal gets accepted/merged. While working on that, I also did some benchmarking:

Notice how for objects of 8 KiB or smaller, BLAKE3CONCAT is about as fast as SHA256 and SHA512. Above 8 KiB it tends to become a lot faster. This is because BLAKE3CONCAT is able to use AVX2 to hash up to eight 1-kilobyte sized chunks in parallel. On my laptop, I manage to compute BLAKE3CONCAT hashes up to 3.7 GB/s, whereas SHA256 maxes out at 500 MB/s.

SHA384 and SHA512 obviously perform equally, as they use the same algorithm. It's just that SHA384 is truncated.

These measurements are all without using any multi-threading, which would give BLAKE3CONCAT an even bigger advantage.

EdSchouten · 2022-11-24T10:30:49Z

While doing test builds against a Buildbarn cluster supporting BLAKE3CONCAT, also using an updated version of bb_clientd, I noticed that having colon characters in pathnames isn't always safe. For example, $PATH variables use colons as separators. Let me change this PR to use a dash as a separator instead.

EdSchouten · 2022-12-01T09:51:19Z

So I've been doing some more benchmarking over the last couple of days. On my MacBookPro16,1 with a 2,3 GHz 8-Core Intel Core i9, I see that BLAKE3CONCAT (blue) provides acceptable performance compared to SHA-256 (green):

On ARM64, it's a different story. Implementing BLAKE3CONCAT in NEON and running it on a Mac Studio (model Mac13,2), I see that BLAKE3CONCAT (blue) is significantly slower than SHA-256 (green); about 10 times as slow for <8 KB objects.

The reason being that the M1 CPU in the Mac Studio implements the ARMv8 cryptographic extension, meaning that there are native CPU instructions available for computing SHA-256 hashes. My assumption is that we will see similar performance characteristics on x86 systems that implements the Intel SHA extensions. So my takeaway is that BLAKE3 is optimized towards large objects, making it a bad fit for our use case. The CAS tends to contain lots of small objects as well.

Because of that, I have decided to update my proposal to add a hashing algorithm that's essentially a mixture between SHA-256 and BLAKE3, named SHA256TREE. It uses SHA-256's compression function in combination with BLAKE3's tree structure. This means that we can make use of native hardware instructions where available, while still being able to cause a significant speedup when only generic SIMD features (such as AVX2) are available. In the worst case, SHA256TREE is (16+1+1)/16 = +12.5% slower than plain SHA-256. It is shown in orange in the graphs above.

build/bazel/remote/execution/v2/remote_execution.proto

EdSchouten · 2022-12-16T13:18:03Z

As discussed during Tuesday's meeting, the preference was to not alter the shape of Digest.hash, but to have each of the requests to take an explicit digest_function. I have just made those changes to the spec, and I think they look pretty sweet.

For the Bytestream methods, the plan was to still use the scheme where we prefix the hashes with "%d-". While writing that into the spec, I realised that if we're going to add some special treatment here, we might as well just make the digest function an explicit component of the path (requiring that it's omitted for the existing hashing algorithms). This seems to be more consistent with how we deal with compressor functions. As far as I know, this should not introduce parsing ambiguities, as long as we don't introduce digest functions having names of lengths 32, 40, 64, 66, 96, or 128, consisting purely of hexadecimal characters.

The reason we used the short "%d-" notation originally, was because that ensured that the size of CAS objects containing Digest messages wasn't affected significantly. Now that it's only going to be part of the first message of a Bytestream request, there is no strong need to be so scrappy. Putting the digest function in there in string form is a lot more intuitive.

@juergbi @mostynb @EricBurnett Please let me know whether this approach is acceptable for you. If not, I will fall back to using "%d-" once more.

a6f · 2022-12-21T05:58:51Z

Google's internal build backend uses a content hash function called "PSHA2" which is a tree derivative of SHA-256. Inputs < 1KB are hashed with plain SHA-256. Inputs >= 1KB are striped 4 bytes at a time over 16 lanes, each lane is hashed with SHA-256, and then the lane hashes are combined. Inputs > 2MB are split into 2MB blocks, the blocks are hashed with SHA256x16, and then the block hashes are combined.

A vectorized implementation of the SHA256x16 level is available at https://github.com/intel/isa-l_crypto/tree/master/mh_sha256.

One peculiarity of the PSHA2 format is that the input length appears in the digest, and the whole-file hash for an input > 2MB is identical to the summary hash except for this length field. (The idea was to allow CAS operations to use the input to the summary hash as a manifest of chunks, and trivially derive the name of the chunk manifest from the name of the whole file, but we haven't yet taken much advantage of this yet.) This means it isn't a general-purpose cryptographic hash, as there are applications where you'd want to conceal the input length.

If you're contemplating something like SHA256TREE, perhaps we should publish the spec and the code for PSHA2, since it's already part of the internal branch of bazel.

native CPU instructions available for computing SHA-256 hashes. My assumption is that we will see similar performance characteristics on x86 systems that implements the Intel SHA extensions.

Intel's own library switches from the dedicated SHA-256 instructions to a vectorized implementation at 6 lanes:
https://github.com/intel/isa-l_crypto/blob/e3e3228cdb4b7d64d4851849a7054be88a25a044/sha256_mb/sha256_job.asm#L49
In other words, AVX2 is faster, if you can arrange to compute enough instances of SHA-256 in parallel.

My understanding is that the SHA-256 helper instructions on the M1 have twice the throughput of SHA256RNDS2, so the gap with a generic vectorized approach is larger.

So my takeaway is that BLAKE3 is optimized towards large objects, making it a bad fit for our use case.

Can you use a cheaper variant for very small inputs? (That's what we did in PSHA2; the constant-factor overhead of SHA256x16 isn't worth it for messages where every lane has only one block.)

I think BLAKE2bp or BLAKE3 are also reasonable choices here. Your BLAKE3CONCAT is relatively slow on the M1 because it only has a 128-bit vector unit. In Google's builds, most of the content hashing is performed on servers with AVX2 or AVX-512, and clients hash primarily small messages (source files and remote action descriptions). I would guess that server CPUs will continue to have relatively wide vector units, so I'm not sure you should expect accelerated single-stream SHA-256 to catch up.

One virtue of single-stream SHA-256 in this context is that you might get a fast implementation without JNI.

EdSchouten · 2022-12-21T09:12:49Z

Hey a6f,

First of all, thanks for your insightful feedback. I find it interesting how we are all running into these issues, but have had to resort to coming up with in-house solutions. AWS Glacier also has a tree hash that is based on SHA-256.

Let me respond to your message in random order.

One peculiarity of the PSHA2 format is that the input length appears in the digest, and the whole-file hash for an input > 2MB is identical to the summary hash except for this length field. (The idea was to allow CAS operations to use the input to the summary hash as a manifest of chunks, and trivially derive the name of the chunk manifest from the name of the whole file, but we haven't yet taken much advantage of this yet.) This means it isn't a general-purpose cryptographic hash, as there are applications where you'd want to conceal the input length.

I think this is fine for the remote execution protocol itself, as CAS objects are identified by their hash and size. Unfortunately, I have seen a fair share of tooling around REv2 that assumes that the hash itself is good enough to act as a key. Maybe we ought to protect users of the protocol from that, though I don't care strongly.

Google's internal build backend uses a content hash function called "PSHA2" which is a tree derivative of SHA-256. Inputs < 1KB are hashed with plain SHA-256. Inputs >= 1KB are striped 4 bytes at a time over 16 lanes, each lane is hashed with SHA-256, and then the lane hashes are combined. Inputs > 2MB are split into 2MB blocks, the blocks are hashed with SHA256x16, and then the block hashes are combined.

Interesting. Could you tell me more about how the 2 MB block hashes are combined? Is it done like VSO PagedHash, where they are hashed using a sequence? Or is some tree structure used here? For SHA256TREE I decided to use a tree based solution, as I don't really want to commit to a certain block size up front. What's a good block size tends to depend on latency, storage media, etc..

A vectorized implementation of the SHA256x16 level is available at https://github.com/intel/isa-l_crypto/tree/master/mh_sha256.

Great! I found some other sources online, but wasn't aware that Intel also published an implementation.

So my takeaway is that BLAKE3 is optimized towards large objects, making it a bad fit for our use case.

Can you use a cheaper variant for very small inputs? (That's what we did in PSHA2; the constant-factor overhead of SHA256x16 isn't worth it for messages where every lane has only one block.)

I think BLAKE2bp or BLAKE3 are also reasonable choices here. Your BLAKE3CONCAT is relatively slow on the M1 because it only has a 128-bit vector unit. In Google's builds, most of the content hashing is performed on servers with AVX2 or AVX-512, and clients hash primarily small messages (source files and remote action descriptions). I would guess that server CPUs will continue to have relatively wide vector units, so I'm not sure you should expect accelerated single-stream SHA-256 to catch up.

So there's no denying that BLAKE2bp, BLAKE3, etc. can be made to run fast on server hardware that has AVX2/AVX-512. However, for us it's also pretty important to have a hashing function that works well at desk/on workers. As we obviously use a lot of Mac hardware for that, we need something that performs well both on x86 and ARM64. I think that something based on SHA-256 is the only serious option there.

With regards to single-stream vs. striped, I fully understand that the 16-way striping that PSHA2 performs is a good fit for AVX-512. My only minor concern is that this is likely a pessimisation on ARM64, where the striping needs to be undone to be able to feed the data to the SHA-256 hardware instructions.

One virtue of single-stream SHA-256 in this context is that you might get a fast implementation without JNI.

Exactly. And I think we also need to keep in mind how algorithms like these can be implemented in languages like Python, Javascript (Node.js), etc.. Those languages have good intrinsics for computing plain SHA-256, but anything else becomes problematic. Even applying/undoing striping might be expensive there.

If you're contemplating something like SHA256TREE, perhaps we should publish the spec and the code for PSHA2, since it's already part of the internal branch of bazel.

I'm at this point seriously contemplating it. The nice thing about PSHA2 is that there's already code for Bazel readily available. Being able to reuse that would be nice. My only concerns/unknowns about PSHA2 at this point are:

What the performance will be like on ARM64. I guess to be able to compute it efficiently on ARM64, you will need to read data 1 KB at a time, undo the striping and then use the native hardware instructions. It would be nice if we could quantify the overhead of that.
Assuming the 2 MB block hashes are combined as a sequence (as opposed to a tree), it does mean that we're inherently stuck to a 2 MB block size. With SHA256TREE we currently have the freedom to pick any block size 2^k, where k >= 10.
As mentioned above, summary hashes colliding with objects containing the same data as the summary might be slightly problematic, but again this can be avoided by requiring that the object size is always validated properly.

If it wouldn't be too much of a hassle, could you publish a spec on PSHA2, including some test vectors? I'd be happy to take a look. Doesn't need to be 100% polished; just good enough for me to reproduce. Thanks!

EricBurnett · 2022-12-24T21:21:32Z

Apologies for the delay. PSHA2 spec [Edit: corrected link] that a6f references (and led creation of), as discussed in the last monthly.

I also personally lean towards favouring SHA256 over adopting something from the Blake family, for reasons of it being already in wide use, getting hardware implementations, and has been strongly validated by the security community. To that end, PSHA2 could be suitable for use as-is if it meets your needs, probably with a differently-defined framing for use in the REAPI - e.g. I think we could define a PSHA2_RAW encoding that's the CHUNK_HASH value directly without the framing (the "\x01" + BE24(len(s)) prefixes), since it'll be paired with the size already. (The two should be trivially convertible if the length is known.)

EdSchouten · 2022-12-24T22:29:06Z

Hey! The spec linked above doesn’t seem to be public yet. I just requested access for myself, but maybe it should be opened up altogether.

EricBurnett · 2022-12-25T00:24:25Z

Apologies, please check again - I accidentally had the wrong link originally but it should be openable now. For convenience, here's the link again.

EdSchouten · 2022-12-25T10:45:23Z

Hi @a6f and @EricBurnett,

Thanks for sending me a copy of the PSHA2 specification. Really appreciated!

Even though I think the design of PSHA2 makes a lot of sense, it looks like it lacks some of the flexibility that I am looking for. To clarify, I would like to use some hashing scheme that allows decomposition at many different levels of 2^k, and give both the client and server the freedom to pick the 2^k that works best for them. For example, for a centralised CAS it is likely sufficient to decompose large objects into ~2 MB chunks. But on a client that offers access to the CAS through a virtual file system (FUSE/NFSv4), it would be nice if objects on the client could get decomposed all the way to the page/block device sector size. Just like how modern file systems like ZFS/Btrfs provide data integrity and deduplication. With PSHA2, we would only be capable of decomposing objects into chunks that are either 2 MiB, 128 GiB or 4 PB in size, which is far too coarse for what I'm trying to achieve.

With that in mind, I would like to go ahead with this PR as is, namely to add the SHA256TREE digest function. If Google is at some point interested in adding support for PSHA2 to the Remote Execution protocol, I would be in favour, but it would not be in scope for this PR. That said, once both #235 and #236 land, it should be easier to add new algorithms going forward.

Buildbarn has invested heavily in using virtual file systems. Both on the worker and client side it's possible to lazily fault in data from the CAS. As Buildbarn implements checksum verification where needed, randomly accessing large files may be slow. To address this, this change adds support for composing and decomposing CAS objects, using newly added ConcatenateBlobs() and SplitBlobs() operations. If implemented naively (e.g., using SHA-256), these operations would not be verifiable. To rephrase: when merely given the checksum of smaller objects, there is no way to obtain that of its concatenated version. This is why we suggest that these operations are only used in combination with SHA256TREE (see bazelbuild#235). With these new operations present, there is no true need to use the Bytestream protocol any longer. Writes can be performed by uploading smaller parts through BatchUpdateBlobs(), followed by calling ConcatenateBlobs(). Conversely, reads of large objects can be performed by calling SplitBlobs() and downloading individual parts through BatchReadBlobs(). For compatibility, we still permit the Bytestream protocol to be used. This is a decision we can revisit in REv3.

a6f · 2022-12-27T13:52:12Z

I deliberated quite a bit about the block size for PSHA2. I ended up picking 2MB for a few reasons:

SHA256x16 has considerable setup overhead; you want to feed it at least 64KB at once to amortize that.
2MB results in 2^16-ary tree, which means that in practice, one level of indirect blocks is enough to handle any file that will appear in our builds. (And the trick of putting the chunklist into the same keyspace depends on using the same strategy for hashing leaf data and interior nodes.)
2MB is a reasonable size for demand-paging from a spinning disk or over a network, as it's close to typical bandwidth-delay products for either. (I can imagine an environment where you'd pick a smaller transfer size, but I doubt you'd want to verify a 32-byte hash for every 4KB page--that's almost 1% the size of the content. A larger block size will let you get away with more naive management of the interior hashes.)
2MB matches the size of "huge pages" on x86_64 (and on aarch64 with the typical 4KB "granules"), which is sometimes convenient.

I also debated whether to use 4-byte or 64-byte lanes. It cost about 4% more to transpose the input (when using AVX2), and I expected only slow fallback implementations would prefer it in 64-byte lanes. This was decided in 2018; if I had the M1 as a target, I might have chosen differently.

build/bazel/remote/execution/v2/remote_execution.proto

In PR bazelbuild#233 I proposed the addition of two new ContentAddressableStorage methods (ConcatenateBlobs and SplitBlobs) that allow one to gain random access it large CAS objects, while still providing a way to very data integrity. As part of that change, I added a new digest function to help with that, named SHA256TREE. This PR adds just this digest function, without bringing in any support for chunking. This will be done separately, as it was requested that both these features landed independently. I have also included test vectors for the SHA256TREE digest function. I have derived these by implementing three different versions in the Go programming language: - One version that uses regular arithmetic in Go. - One version for x86-64 that uses AVX2. - One version for ARM64 that uses the ARMv8 cryptography extensions. All three versions behave identically.

tylerwilliams · 2023-03-08T23:06:38Z

Looking forward to this PR as well -- both for SHA256 support but also the digest_function changes.

Does remote_asset.proto need changes as well to support disambiguating digest function?

EdSchouten · 2023-03-09T16:46:27Z

Looking forward to this PR as well -- both for SHA256 support but also the digest_function changes.

👍

Does remote_asset.proto need changes as well to support disambiguating digest function?

That sounds like a good idea.

It looks like the protocol is currently oblivious of digest functions. It also offers no way for clients/servers to negotiate on a digest function (read: there is no equivalent of GetCapabilities()). How is a digest function selected right now? Is there some implicit assumption that the CAS only supports exactly 1 digest function, and that the Remote Asset API uses that one unconditionally? Unfortunately, I don't use this protocol myself, so I'm slightly out of the loop here.

Maybe it's best if this PR is merged as is, and that people interested in using the Remote Asset API have a wider discussion on this topic? I suppose the best solution would be if the Remote Asset API gained its own GetCapabilities() method, and that clients interested in using both the CAS and a Remote Asset service intersect the list of supported digest functions to pick the one they want.

tylerwilliams · 2023-03-13T22:40:22Z

Does remote_asset.proto need changes as well to support disambiguating digest function?

That sounds like a good idea.

It looks like the protocol is currently oblivious of digest functions. It also offers no way for clients/servers to negotiate on a digest function (read: there is no equivalent of GetCapabilities()). How is a digest function selected right now? Is there some implicit assumption that the CAS only supports exactly 1 digest function, and that the Remote Asset API uses that one unconditionally? Unfortunately, I don't use this protocol myself, so I'm slightly out of the loop here.

Maybe it's best if this PR is merged as is, and that people interested in using the Remote Asset API have a wider discussion on this topic? I suppose the best solution would be if the Remote Asset API gained its own GetCapabilities() method, and that clients interested in using both the CAS and a Remote Asset service intersect the list of supported digest functions to pick the one they want.

I'm happy to take a look at the remote_asset API once this PR is merged.

In the following PR we extended most of the REv2 *Request messages to take an explicit digest function: bazelbuild/remote-apis#235 This eliminates the ambiguity between digest functions that have the same hash length (e.g., MD5 and MURMUR3, SHA256 and SHA256TREE). This change extends the REv2 client in Bazel to set the digest_function field explicitly, so that the server does not need to use any heuristics to pick a digest function when processing requests. Change-Id: I8302a6c724d7c89a2f7b227e7204d20178ae9bda

In the following PR we extended most of the REv2 *Request messages to take an explicit digest function: bazelbuild/remote-apis#235 This eliminates the ambiguity between digest functions that have the same hash length (e.g., MD5 and MURMUR3, SHA256 and SHA256TREE). This change extends the REv2 client in Bazel to set the digest_function field explicitly, so that the server does not need to use any heuristics to pick a digest function when processing requests. As we are now going to call DigestUtil.getDigestFunction() far more frequently, alter DigestUtil to precompute the value upon construction.

In the following PR we extended most of the REv2 *Request messages to take an explicit digest function: bazelbuild/remote-apis#235 This eliminates the ambiguity between digest functions that have the same hash length (e.g., MD5 and MURMUR3, SHA256 and SHA256TREE). This change extends the REv2 client in Bazel to set the digest_function field explicitly, so that the server does not need to use any heuristics to pick a digest function when processing requests. As we are now going to call DigestUtil.getDigestFunction() far more frequently, alter DigestUtil to precompute the value upon construction. This change was tested by letting Bazel forward all traffic through an instance of bb_clientd that was patched up to require the use of `digest_function`. Closes #17772. PiperOrigin-RevId: 520074622 Change-Id: Ie92dc368c502b9bc3fddef56a5eb0a238760f673

In the following PR we extended most of the REv2 *Request messages to take an explicit digest function: bazelbuild/remote-apis#235 This eliminates the ambiguity between digest functions that have the same hash length (e.g., MD5 and MURMUR3, SHA256 and SHA256TREE). This change extends the REv2 client in Bazel to set the digest_function field explicitly, so that the server does not need to use any heuristics to pick a digest function when processing requests. As we are now going to call DigestUtil.getDigestFunction() far more frequently, alter DigestUtil to precompute the value upon construction. This change was tested by letting Bazel forward all traffic through an instance of bb_clientd that was patched up to require the use of `digest_function`. Closes bazelbuild#17772. PiperOrigin-RevId: 520074622 Change-Id: Ie92dc368c502b9bc3fddef56a5eb0a238760f673

In the following PR we extended most of the REv2 *Request messages to take an explicit digest function: bazelbuild/remote-apis#235 This eliminates the ambiguity between digest functions that have the same hash length (e.g., MD5 and MURMUR3, SHA256 and SHA256TREE). This change extends the REv2 client in Bazel to set the digest_function field explicitly, so that the server does not need to use any heuristics to pick a digest function when processing requests. As we are now going to call DigestUtil.getDigestFunction() far more frequently, alter DigestUtil to precompute the value upon construction. This change was tested by letting Bazel forward all traffic through an instance of bb_clientd that was patched up to require the use of `digest_function`. Closes bazelbuild#17772. PiperOrigin-RevId: 520074622 Change-Id: Ie92dc368c502b9bc3fddef56a5eb0a238760f673 (cherry picked from commit 0a8380b)

In the following PR we extended most of the REv2 *Request messages to take an explicit digest function: bazelbuild/remote-apis#235 This eliminates the ambiguity between digest functions that have the same hash length (e.g., MD5 and MURMUR3, SHA256 and SHA256TREE). This change extends the REv2 client in Bazel to set the digest_function field explicitly, so that the server does not need to use any heuristics to pick a digest function when processing requests. As we are now going to call DigestUtil.getDigestFunction() far more frequently, alter DigestUtil to precompute the value upon construction. This change was tested by letting Bazel forward all traffic through an instance of bb_clientd that was patched up to require the use of `digest_function`. Closes #17772. PiperOrigin-RevId: 520074622 Change-Id: Ie92dc368c502b9bc3fddef56a5eb0a238760f673 (cherry picked from commit 0a8380b)

In the following PR we extended most of the REv2 *Request messages to take an explicit digest function: bazelbuild/remote-apis#235 This eliminates the ambiguity between digest functions that have the same hash length (e.g., MD5 and MURMUR3, SHA256 and SHA256TREE). This change extends the REv2 client in Bazel to set the digest_function field explicitly, so that the server does not need to use any heuristics to pick a digest function when processing requests. As we are now going to call DigestUtil.getDigestFunction() far more frequently, alter DigestUtil to precompute the value upon construction. This change was tested by letting Bazel forward all traffic through an instance of bb_clientd that was patched up to require the use of `digest_function`. PiperOrigin-RevId: 520074622 Change-Id: Ie92dc368c502b9bc3fddef56a5eb0a238760f673 (cherry picked from commit 0a8380b) Signed-off-by: Brentley Jones <github@brentleyjones.com>

…19049) In the following PR we extended most of the REv2 *Request messages to take an explicit digest function: bazelbuild/remote-apis#235 This eliminates the ambiguity between digest functions that have the same hash length (e.g., MD5 and MURMUR3, SHA256 and SHA256TREE). This change extends the REv2 client in Bazel to set the digest_function field explicitly, so that the server does not need to use any heuristics to pick a digest function when processing requests. As we are now going to call DigestUtil.getDigestFunction() far more frequently, alter DigestUtil to precompute the value upon construction. This change was tested by letting Bazel forward all traffic through an instance of bb_clientd that was patched up to require the use of `digest_function`. PiperOrigin-RevId: 520074622 Change-Id: Ie92dc368c502b9bc3fddef56a5eb0a238760f673 (cherry picked from commit 0a8380b) Signed-off-by: Brentley Jones <github@brentleyjones.com> Co-authored-by: Ed Schouten <eschouten@apple.com>

EdSchouten requested review from buchgr and bergsieker as code owners November 17, 2022 11:02

mostynb reviewed Nov 17, 2022

View reviewed changes

build/bazel/remote/execution/v2/remote_execution.proto Outdated Show resolved Hide resolved

build/bazel/remote/execution/v2/remote_execution.proto Outdated Show resolved Hide resolved

build/bazel/remote/execution/v2/remote_execution.proto Outdated Show resolved Hide resolved

EdSchouten force-pushed the eschouten/20221117-blake3concat branch 3 times, most recently from 55dd91f to bad83de Compare December 1, 2022 09:33

EdSchouten changed the title ~~Add BLAKE3CONCAT: a version of BLAKE3 that supports chunking~~ Add SHA256TREE: a version of SHA-256 that supports chunking Dec 1, 2022

EdSchouten force-pushed the eschouten/20221117-blake3concat branch 2 times, most recently from b893b3d to 1997de3 Compare December 1, 2022 22:11

mostynb reviewed Dec 2, 2022

View reviewed changes

build/bazel/remote/execution/v2/remote_execution.proto Show resolved Hide resolved

EdSchouten force-pushed the eschouten/20221117-blake3concat branch from 1997de3 to 400b362 Compare December 2, 2022 19:35

bergsieker removed the request for review from buchgr December 13, 2022 14:55

EdSchouten force-pushed the eschouten/20221117-blake3concat branch from 400b362 to 6473a0f Compare December 16, 2022 13:02

EdSchouten mentioned this pull request Dec 26, 2022

Support for compression from buildbarn buildbarn/bb-storage#158

Open

EdSchouten mentioned this pull request Dec 26, 2022

Add support for chunking of blobs, using SHA256TREE #233

Open

sstriker self-assigned this Feb 14, 2023

mostynb reviewed Feb 16, 2023

View reviewed changes

build/bazel/remote/execution/v2/remote_execution.proto Show resolved Hide resolved

sluongng mentioned this pull request Feb 16, 2023

FR?: ctx.actions.symlink action time scales w/ size of input; should be constant bazelbuild/bazel#14125

Closed

EdSchouten force-pushed the eschouten/20221117-blake3concat branch from cae7c20 to f2fa032 Compare February 17, 2023 16:27

EdSchouten force-pushed the eschouten/20221117-blake3concat branch from f2fa032 to 79efb10 Compare February 17, 2023 16:28

sstriker merged commit cdef0b4 into bazelbuild:main Mar 14, 2023

EdSchouten deleted the eschouten/20221117-blake3concat branch March 14, 2023 08:14

EdSchouten mentioned this pull request Mar 14, 2023

REv3 idea: Make use of digest_function in requests mandatory #246

Open

EdSchouten mentioned this pull request Mar 14, 2023

Set the digest_function field as part of all relevant gRPC requests bazelbuild/bazel#17772

Closed

EdSchouten mentioned this pull request Mar 15, 2023

Add value for BLAKE3 hash function #248

Merged

EdSchouten mentioned this pull request Apr 6, 2023

Update dependencies, buildbarn ecosystem to latest buildbarn/bb-remote-asset#29

Merged

2 tasks

meroton-benjamin mentioned this pull request Apr 19, 2023

Improve buildbarn documentation facebook/buck2#179

Closed

brentleyjones mentioned this pull request Jul 25, 2023

[6.4.0] Set the digest_function field as part of all relevant gRPC requests bazelbuild/bazel#19049

Merged

adam-singer mentioned this pull request Dec 21, 2023

Enable blake3 for Bazel builds TraceMachina/nativelink#565

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SHA256TREE: a version of SHA-256 that supports chunking #235

Add SHA256TREE: a version of SHA-256 that supports chunking #235

EdSchouten commented Nov 17, 2022 •

edited

Loading

EdSchouten commented Nov 17, 2022

EdSchouten commented Nov 23, 2022 •

edited

Loading

EdSchouten commented Nov 24, 2022

EdSchouten commented Dec 1, 2022 •

edited

Loading

EdSchouten commented Dec 16, 2022 •

edited

Loading

a6f commented Dec 21, 2022

EdSchouten commented Dec 21, 2022 •

edited

Loading

EricBurnett commented Dec 24, 2022 •

edited

Loading

EdSchouten commented Dec 24, 2022

EricBurnett commented Dec 25, 2022

EdSchouten commented Dec 25, 2022

a6f commented Dec 27, 2022

tylerwilliams commented Mar 8, 2023

EdSchouten commented Mar 9, 2023 •

edited

Loading

tylerwilliams commented Mar 13, 2023 •

edited

Loading

Add SHA256TREE: a version of SHA-256 that supports chunking #235

Add SHA256TREE: a version of SHA-256 that supports chunking #235

Conversation

EdSchouten commented Nov 17, 2022 • edited Loading

EdSchouten commented Nov 17, 2022

EdSchouten commented Nov 23, 2022 • edited Loading

EdSchouten commented Nov 24, 2022

EdSchouten commented Dec 1, 2022 • edited Loading

EdSchouten commented Dec 16, 2022 • edited Loading

a6f commented Dec 21, 2022

EdSchouten commented Dec 21, 2022 • edited Loading

EricBurnett commented Dec 24, 2022 • edited Loading

EdSchouten commented Dec 24, 2022

EricBurnett commented Dec 25, 2022

EdSchouten commented Dec 25, 2022

a6f commented Dec 27, 2022

tylerwilliams commented Mar 8, 2023

EdSchouten commented Mar 9, 2023 • edited Loading

tylerwilliams commented Mar 13, 2023 • edited Loading

EdSchouten commented Nov 17, 2022 •

edited

Loading

EdSchouten commented Nov 23, 2022 •

edited

Loading

EdSchouten commented Dec 1, 2022 •

edited

Loading

EdSchouten commented Dec 16, 2022 •

edited

Loading

EdSchouten commented Dec 21, 2022 •

edited

Loading

EricBurnett commented Dec 24, 2022 •

edited

Loading

EdSchouten commented Mar 9, 2023 •

edited

Loading

tylerwilliams commented Mar 13, 2023 •

edited

Loading