Switch serialization #1166

ilblackdragon · 2019-08-13T23:39:27Z

Unfortunately protos are not deterministicly serialized across languages and platforms.
We also not using the main benefits of the protos, e.g. upgradability and backward compatibility. Because if the transaction or block proto changes, all the responsible nodes must compute the same state transition, meaning must run the same code.

Also it's known that Bincode is faster than protos. We also added CBOR for comparison:
https://gist.github.com/ilblackdragon/88ab377bf0da7ff721af80cb8a11fe28

Generally, CBOR has support across many languages and a proper standard. Bincode is more specific to the Rust and still doesn't have formal specification.

frol · 2019-08-14T09:35:28Z

Here are some more benchmarks: erickt/rust-serialization-benchmarks#7

FlatBuffers do quite well. I recall we have had an issue with bincode performance.

ilblackdragon · 2019-08-14T20:01:51Z

Notes on protobuf not being deterministic:

Flatbufs are not deterministic:
https://groups.google.com/forum/#!msg/flatbuffers/v2RkM3KB1Qw/emQDRTgmCQAJ

ilblackdragon · 2019-08-14T20:02:41Z

Bincode - is very basic serialization format but at the same time has performance issues due to visitor pattern inside the serde.

ilblackdragon · 2019-08-14T20:08:01Z

CBOR is a bit slower than Bincode, and is actually self-descriptive. Which we don't really need (and it adds extra wire bytes).

Alternative is just using a simple serialization scheme:

 u8 | u16 | u32 | u64 | u128 -> write little endian
 [u8; size] -> for _ in 0..size write byte
 Vec<T> -> len() as u32 + for _ in 0..len() write T
 struct -> for each field -> write field

We already use this in store/trie as it gave 2x improvement over bincode.

This is similar to Simple Serialization suggested by Vitalik here - https://github.com/ethereum/beacon_chain/blob/master/ssz/ssz.py

vgrichina · 2019-08-14T20:25:33Z

Here are some more benchmarks: erickt/rust-serialization-benchmarks#7

FlatBuffers do quite well. I recall we have had an issue with bincode performance.

FlatBuffers aren't deterministic:
https://groups.google.com/forum/#!msg/flatbuffers/v2RkM3KB1Qw/emQDRTgmCQAJ

frol · 2019-08-15T18:55:42Z

Just keep in mind that the deserialization should be secure (e.g. if there is a field storing a length of an array, we should check that the number is sane and the array would fit into memory).

maxzaver · 2019-08-15T20:26:19Z

Performance improvement ideas:

Speed-up serialization
Add size_of field to Serializable trait so that we can compute the size and pre-allocate the byte array before serializing object into it;

Speed-up deserialization
Use repr(C) (or rather repr(packed), but this is currently UB rust-lang/rust#27060) for structs/enums to make memory layout deterministic. Then arrange all fields in struct such that primitive types and types composed strictly of primitive types go first and everything else last. Deserialize primitive types and user-defined types all at once by mapping the memory from the byte array directly to struct then deserialize everything else one at a time.
This will remove the need to allocate objects for primitive types just for them to be copied into the struct fields. Read info:
https://doc.rust-lang.org/reference/type-layout.html#the-c-representation
https://doc.rust-lang.org/nomicon/other-reprs.html
https://doc.rust-lang.org/std/index.html#primitives

frol · 2019-08-16T18:51:02Z

It seems that speedy is the project very similar in the ideas with BORsh. The benchmarks are also promising.

maxzaver · 2019-08-16T19:12:40Z

It seems like speedy has a different set of priorities than us. Speedy optimizes for speed above all. While our serializer optimizes for consistency of binary representation, safety, and only then speed.

As the result our code does not contain unsafe blocks, while speedy heavily utilizes unsafe but even more so utilizes methods that toy with undefined behavior, such as mem::uninitialized, but also constructing slice from raw memory.

ilblackdragon · 2019-08-20T06:25:05Z

Started BORsh & implemented it - https://github.com/nearprotocol/borsh

Moving details of the spec from near/nearcore#1166 to here.

ilblackdragon mentioned this issue Aug 14, 2019

Remove protos, replace with custom byte serialization #1170

Merged

weekly-digest bot mentioned this issue Aug 16, 2019

Weekly Digest (9 August, 2019 - 16 August, 2019) #1178

Closed

ilblackdragon closed this as completed Aug 20, 2019

ilblackdragon added a commit to near/borsh that referenced this issue Aug 20, 2019

Adding initial specification #4

a04cb87

Moving details of the spec from near/nearcore#1166 to here.

volovyks pushed a commit to near/borsh-js that referenced this issue Nov 30, 2020

Adding initial specification #4

676eac3

Moving details of the spec from near/nearcore#1166 to here.

macalinao mentioned this issue Jun 3, 2021

Switch from Borsh to Protobuf coral-xyz/anchor#357

Open

zfedoran mentioned this issue Jan 9, 2024

Feature: Request Login with Payment code-payments/code-sdk#14

Merged

romac mentioned this issue Jan 22, 2025

code: Compute signatures over a deterministic and canonical representation of the data informalsystems/malachite#798

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch serialization #1166

Switch serialization #1166

ilblackdragon commented Aug 13, 2019

frol commented Aug 14, 2019 •

edited

Loading

ilblackdragon commented Aug 14, 2019

ilblackdragon commented Aug 14, 2019

ilblackdragon commented Aug 14, 2019

vgrichina commented Aug 14, 2019

frol commented Aug 15, 2019

maxzaver commented Aug 15, 2019 •

edited

Loading

frol commented Aug 16, 2019

maxzaver commented Aug 16, 2019

ilblackdragon commented Aug 20, 2019

Switch serialization #1166

Switch serialization #1166

Comments

ilblackdragon commented Aug 13, 2019

frol commented Aug 14, 2019 • edited Loading

ilblackdragon commented Aug 14, 2019

ilblackdragon commented Aug 14, 2019

ilblackdragon commented Aug 14, 2019

vgrichina commented Aug 14, 2019

frol commented Aug 15, 2019

maxzaver commented Aug 15, 2019 • edited Loading

frol commented Aug 16, 2019

maxzaver commented Aug 16, 2019

ilblackdragon commented Aug 20, 2019

frol commented Aug 14, 2019 •

edited

Loading

maxzaver commented Aug 15, 2019 •

edited

Loading