Chore: Remove one allocate per hash by using generics. #5829

jannotti · 2023-11-13T20:18:43Z

The way we currently hash various objects is with:

// HashRep appends the correct hashid before the message to be hashed.
func HashRep(h Hashable) []byte {
	hashid, data := h.ToBeHashed()
	return append([]byte(hashid), data...)
}

This means that every callers generally have to allocate in order to convert their argument, which might be a BlockHeader, for example, into a Hashable. (This happens transparently, of course.)

However, by writing HashRep as:

func HashRep[H Hashable](h H) []byte {
	hashid, data := h.ToBeHashed()
	return append([]byte(hashid), data...)
}

We use generics to make a HashRep for each Hashable type. So a Hashable need not be created to call it. Thus we we get one fewer allocations most of the time.

As an example, BenchmarkGenesisHash gives:

BenchmarkGenesisHash/new-10         	 5553607	       213.5 ns/op	     256 B/op	       2 allocs/op
BenchmarkGenesisHash/old-10         	 4810734	       252.2 ns/op	     400 B/op	       3 allocs/op

One fewer alloc, and a small little speedup.

Summary

Test Plan

The way we currently hash various objects is with: ``` // HashRep appends the correct hashid before the message to be hashed. func HashRep(h Hashable) []byte { hashid, data := h.ToBeHashed() return append([]byte(hashid), data...) } ``` This means that every callers generally have to allocate in order to convert their argument, which might be a `BlockHeader`, for example, into a Hashable. (This happens transparently, of course.) However, by writing HashRep as: ``` func HashRep[H Hashable](h H) []byte { hashid, data := h.ToBeHashed() return append([]byte(hashid), data...) } ``` We use generics to make a HashRep for each Hashable type. So a `Hashable` need not be created to call it. Thus we we get one fewer allocations most of the time. For this PR, I did this by writing `HashRepFast` instead, so that I could commit some benchmarks. They show one for allocation. I had to create several copies of existsing functions to make the Benchmarks work. In the real PR, I'll remove all that extra stuff.

codecov · 2023-11-13T20:42:30Z

Codecov Report

Attention: 7 lines in your changes are missing coverage. Please review.

Comparison is base (1bb78de) 55.72% compared to head (25698af) 55.71%.
Report is 7 commits behind head on master.

Files	Patch %	Lines
cmd/goal/application.go	0.00%	4 Missing ⚠️
cmd/goal/interact.go	0.00%	2 Missing ⚠️
cmd/goal/tealsign.go	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #5829      +/-   ##
==========================================
- Coverage   55.72%   55.71%   -0.01%     
==========================================
  Files         476      476              
  Lines       67131    67130       -1     
==========================================
- Hits        37408    37403       -5     
- Misses      27203    27204       +1     
- Partials     2520     2523       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

algorandskiy

Could post some benchmark results into the PR description?

crypto/util.go

jannotti · 2023-11-16T21:19:41Z

Could post some benchmark results into the PR description?

Added

BenchmarkGenesisHash/new-10 5553607 213.5 ns/op 256 B/op 2 allocs/op
BenchmarkGenesisHash/old-10 4810734 252.2 ns/op 400 B/op 3 allocs/op

zeldovich

Looks like a good idea!

Another opportunity to save on allocations would be to call protocol.EncodeMsgp() instead of protocol.Encode() in ToBeHashed(). The extra allocation is coming from the check that protocol.Encode() is doing through CanMarshalMsg() to check whether obj directly implements msgp.Marshaler, or whether its msgp.Marshaler methods are promoted from some embedded struct field.

Alternatively, we could just decide that we don't have any more dangling embedded fields whose parent structs haven't gone through msgp. This used to happen when I was first incrementally adding support for msgp, but by now, everything has been msgp'ed already. So, we could change protocol.Encode() to drop that CanMarshalMsg() check and save an extra allocation.

jannotti changed the title ~~Remove one allocate per hash by using generics.~~ Chore: Remove one allocate per hash by using generics. Nov 13, 2023

jannotti self-assigned this Nov 13, 2023

jannotti added the Enhancement label Nov 13, 2023

jannotti force-pushed the hash-no-allocate branch from b28ee68 to 427c33a Compare November 15, 2023 22:06

Remove (most) of the code that exists only for benchmarking

eb94b80

jannotti force-pushed the hash-no-allocate branch from 427c33a to eb94b80 Compare November 15, 2023 22:20

jannotti marked this pull request as ready for review November 16, 2023 19:30

jannotti requested review from algorandskiy, ohill and cce November 16, 2023 19:30

algorandskiy reviewed Nov 16, 2023

View reviewed changes

crypto/util.go Outdated Show resolved Hide resolved

Move benchmark functions to single place used

25698af

algorandskiy approved these changes Nov 16, 2023

View reviewed changes

algorandskiy requested a review from zeldovich November 17, 2023 00:07

zeldovich approved these changes Nov 17, 2023

View reviewed changes

algorandskiy merged commit 06790fe into algorand:master Nov 17, 2023

This was referenced Nov 29, 2023

go-algorand 3.20.1-beta Release PR #5849

Merged

go-algorand 3.20.1-stable Release PR #5852

Merged

PhearZero pushed a commit to PhearNet/crypto that referenced this pull request Jan 17, 2025

Chore: Remove one allocate per hash by using generics (algorand#5829)

b359a5f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chore: Remove one allocate per hash by using generics. #5829

Chore: Remove one allocate per hash by using generics. #5829

jannotti commented Nov 13, 2023 •

edited

Loading

codecov bot commented Nov 13, 2023 •

edited

Loading

algorandskiy left a comment

jannotti commented Nov 16, 2023

zeldovich left a comment

Chore: Remove one allocate per hash by using generics. #5829

Chore: Remove one allocate per hash by using generics. #5829

Conversation

jannotti commented Nov 13, 2023 • edited Loading

Summary

Test Plan

codecov bot commented Nov 13, 2023 • edited Loading

Codecov Report

algorandskiy left a comment

Choose a reason for hiding this comment

jannotti commented Nov 16, 2023

zeldovich left a comment

Choose a reason for hiding this comment

jannotti commented Nov 13, 2023 •

edited

Loading

codecov bot commented Nov 13, 2023 •

edited

Loading