Hashing Performance Considerations (Allocations) #9

solonovamax · 2025-03-04T16:10:00Z

While implementing the two PRs that I just recently made, I noticed that hashing a ByteArray via ByteArray.hash(HasherFactory) some unnecessary allocations are performed, such as allocating a new Hasher every time.

In performance-critical code this is undesirable.
Since the HasherFactory is guaranteed to always invoke .digest() which finalizes the hash value, perhaps it would be best to pool the Hashers.

If the pool is empty, a new hasher is created via the val create: () -> Hasher property, however if the pool has a hasher in it, then it can be used instead.
When a hasher is used, it will be removed from the pool.
After digest() is called, reset() will always be called before then returning the hasher to the pool.
The pool could have a 'reasonable' upper cap on the number of entries, and when attempting to insert any entries while already at the cap, they will instead just be discarded.
The pool would only be local to the HasherFactory, and any external users of it would have no awareness of it. operator fun invoke(): Hasher will continue to always return a new Hasher.

The text was updated successfully, but these errors were encountered:

soywiz · 2025-03-04T17:24:29Z

HasherFactory.digest methods are guaranteed to end calling the digest. It could also call the .reset method if required.
HasherFactory also exposes an invoke method that creates a new Hasher, that shouldn't be Pooled.

A private Pool implementation could be used like allocHasher { digest() }. It would require to be Thread-Safe, so atomicfu or per platform code would be needed.

In any case, for performance-critical code, you can still construct a Hasher and do the reset on that code, right?

solonovamax changed the title ~~Hashing Performance Considerations~~ Hashing Performance Considerations (Allocations) Mar 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hashing Performance Considerations (Allocations) #9

Hashing Performance Considerations (Allocations) #9

solonovamax commented Mar 4, 2025

soywiz commented Mar 4, 2025

Hashing Performance Considerations (Allocations) #9

Hashing Performance Considerations (Allocations) #9

Comments

solonovamax commented Mar 4, 2025

soywiz commented Mar 4, 2025