Contract Spec: Experiment with compressed or optimized form of XDR #415

leighmcculloch · 2022-08-11T00:08:13Z

We use XDR for the contract spec, but it has the downsides that all XDR has, lots of unused space due to small integers, small valued enums, and padding.

Take the following contract spec as an example:

[0, 0, 0, 0, 0, 0, 0, 3, 97, 100, 100, 0, 0, 0, 0, 2, 0, 0, 0, 2, 0, 0, 0, 2, 0, 0, 0, 1, 0, 0, 0, 2]

Horizon uses a simplistic compressed form of XDR for reducing significant memory usage. It reduces some values such as integers, enums, etc to 1 byte instead of 4 on the assumption they will never escape the limits of 1 byte. The above example employing the same or similar approach would become:

[3, 97, 100, 100, 2, 2, 2, 1, 2]

There are other schemes for how to variably encode integers that would likely be better since they would allow us to make less assumptions about the contents and simply encode them using an alternative scheme.

This has huge drawbacks in that we'd be generating non-standard XDR, or rather we'd be generating something that isn't XDR at all, but the benefits probably outweigh the downside of that.

An alternative to this is for us to find an entirely different format for the contract spec that is optimized for size better than XDR is, although that would likely require us to add an entirely new toolset, rather than modifying our existing toolset. Modifying our existing XDR toolset with our own extension would probably make it easier to apply the same pattern to Stellar XDR elsewhere as well.

jonjove · 2022-08-11T14:07:39Z

An alternative would be to compress the XDR with a known and relatively simple compression algorithm like LZ77. We've also discussed compressing WASM blobs (eg. when creating a contract) on the wire with LZ77, although I can't find the discussion of that right now. I'm going to open an issue to track that too, and reference this.

leighmcculloch · 2023-05-01T20:55:00Z

@graydon It'd be good if we could still do this. Simply gzipping the contract spec would probably reduce things. Wdyt?

graydon · 2023-05-01T22:30:07Z

I don't think it's worth bothering at this level. 1 contract wasm is shared among N contract instances, each instance stores M contract-data entries. M*N is way larger than the 1 wasm contract. Moreover there's just not that much to win on a single wasm: if I look at our example contracts, our largest example is soroban_liquidity_pool_contract.wasm which is 11422 bytes, of which 560 bytes are the meta section. lz4 takes it down to 333 bytes, deflate/gzip takes it to 223 bytes, but .. either way we're talking about maybe winning 2-3% on the wasm blob which is itself the smallest piece of the storage puzzle.

I'd be modestly supportive of, say, applying lz4 to all ledger entries, just to minimize load on the BL; but I think it's too intrusive to specify this at a protocol level, it'd break a ton of stuff and would also be double-compressing when storing long term (archives already gzip buckets). I think the way we'd do this (if we did) would be to teach @SirTyson's new BucketDB layer to store a new flavor of buckets as indexed lz4 frames rather than indexed bucket entries. Basically imitating what we'd get if we were on a filesystem with transparent lz4 compression (several such filesystems exist but they're not what you'll get on your stock linux VM typically running core).

leighmcculloch · 2023-05-02T04:15:37Z

👍🏻 All great points, I agree, not worth doing.

This is also not something that is urgent to look into before the first release. It's easy for us to add new spec formats, the way meta/specs are stored in the WASM is designed to support that.

Closing.

leighmcculloch · 2023-12-01T09:10:14Z

I did two quick hacks to see if some of the ideas I'd had about this were worth slipping in to the release.

For both I used the token interface as a guide, which has a contract spec that is over 5336 bytes today with doc comments, and 1004 bytes without doc comments.

Compress contract specs with gzip and store in contractspecv0gzip #1178 – Easy to impl, very low effort.

Gzipping reduced 5336 to 3000s.

Gzipping without doc comments reduced 1004 to 963.
Quick hack experiment with removing padding in XDR for contract specs xdrgen#187 – Hard to impl, high effort, would be unlikely to actually do this, but a quick hack to see if it works didn't take long.

Using the padless format reduced 5336 to 5139.

Using the padless format without doc comments reduced 1004 to 826.

Both underwhelming. The token contract has a large number of functions. For contracts with less functions it is even more underwhelming.

If writing long docs on functions was desirable, gzipping would be worth it, but I also expect folks will write brief docs, or no docs and document elsewhere. Of course, doing nothing here might cause people to never write docs, but I don't think 3000 vs 5000 will make a difference to that decision.

I don't think it's worth bothering at this level.

@graydon you are still right 😄.

jonjove mentioned this issue Aug 11, 2022

CAP-0047 / CAP-0052: Should we compress WASM in transactions/operations? stellar/stellar-protocol#1292

Closed

MonsieurNicolas mentioned this issue Jan 18, 2023

Soroban: should we support compressed transactions? stellar/stellar-core#3645

Open

leighmcculloch closed this as not planned Won't fix, can't repro, duplicate, stale May 2, 2023

leighmcculloch reopened this Dec 1, 2023

leighmcculloch self-assigned this Dec 1, 2023

leighmcculloch mentioned this issue Dec 1, 2023

Compress contract specs with gzip and store in contractspecv0gzip #1178

Closed

leighmcculloch closed this as not planned Won't fix, can't repro, duplicate, stale Dec 1, 2023

leighmcculloch removed their assignment Dec 1, 2023

leighmcculloch reopened this Dec 1, 2023

leighmcculloch self-assigned this Dec 1, 2023

leighmcculloch mentioned this issue Dec 1, 2023

Quick hack experiment with removing padding in XDR for contract specs stellar/xdrgen#187

Closed

leighmcculloch changed the title ~~Contract Spec: Use compressed optimized form of XDR~~ Contract Spec: Experiment with compressed or optimized form of XDR Dec 1, 2023

leighmcculloch closed this as completed Dec 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contract Spec: Experiment with compressed or optimized form of XDR #415

Contract Spec: Experiment with compressed or optimized form of XDR #415

leighmcculloch commented Aug 11, 2022 •

edited

Loading

jonjove commented Aug 11, 2022

leighmcculloch commented May 1, 2023

graydon commented May 1, 2023

leighmcculloch commented May 2, 2023 •

edited

Loading

leighmcculloch commented Dec 1, 2023 •

edited

Loading

Contract Spec: Experiment with compressed or optimized form of XDR #415

Contract Spec: Experiment with compressed or optimized form of XDR #415

Comments

leighmcculloch commented Aug 11, 2022 • edited Loading

jonjove commented Aug 11, 2022

leighmcculloch commented May 1, 2023

graydon commented May 1, 2023

leighmcculloch commented May 2, 2023 • edited Loading

leighmcculloch commented Dec 1, 2023 • edited Loading

leighmcculloch commented Aug 11, 2022 •

edited

Loading

leighmcculloch commented May 2, 2023 •

edited

Loading

leighmcculloch commented Dec 1, 2023 •

edited

Loading