Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add support for sync car reading #121

Merged
merged 8 commits into from
Jan 27, 2023
Merged

feat: add support for sync car reading #121

merged 8 commits into from
Jan 27, 2023

Conversation

hugomrdias
Copy link
Member

@hugomrdias hugomrdias commented Jan 12, 2023

This PR adds a new class CarReaderSync to support syncronous car reading.

Changes:

  • Extracted some utils from the async decoder
    • decodeV2Header, decodeVarint and getMultihashLength
  • Created a new sync decoder
  • BytesReader interface has a new arg in the exactly method to seek after reading, defaults to false (previous behavior)
  • Async decoder was tweaked to used the extracted utils and the new types.
  • New types were added for buffer (sync) methods

I was hoping to be able to extract more from the async decoder than i did, but i think this is useful and does not introduce any breaking change.

The new CarReaderSync is exported in package.json.

@hugomrdias hugomrdias requested review from alanshaw and rvagg January 12, 2023 16:11
src/api.ts Outdated Show resolved Hide resolved
src/api.ts Outdated Show resolved Hide resolved
src/api.ts Outdated Show resolved Hide resolved
src/coding.ts Outdated Show resolved Hide resolved
src/decoder-sync.js Outdated Show resolved Hide resolved
@hugomrdias hugomrdias requested review from alanshaw and Gozala January 18, 2023 15:34
Copy link
Contributor

@Gozala Gozala left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Other than interface for the sync reader everything looks good to me, although I did provide some additional feedback which you can address or disregard as you find fit.

Please add a sync interface though, or if you really disagree let's find a compromise.

src/api.ts Outdated Show resolved Hide resolved
src/api.ts Outdated Show resolved Hide resolved
src/api.ts Outdated Show resolved Hide resolved
src/api.ts Outdated Show resolved Hide resolved
src/coding.ts Outdated Show resolved Hide resolved
src/decoder-sync.js Outdated Show resolved Hide resolved
src/reader-browser.js Outdated Show resolved Hide resolved
* @param {Uint8Array} bytes
* @returns {CarReader} blip blop
*/
static fromBytesSync (bytes) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it might be a good idea to align naming convention with https://github.com/ipld/js-car/blob/master/src/buffer-writer.js which is same idea but a writer interface.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what do you means ?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Things like CarBufferReader as opposed to CarReaderSync, same thing for method names and stuff.

@@ -139,6 +140,22 @@ export class CarReader {
}
}

/**
*
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it should say something about the fact that no verification occurs with this reader, meaning that block that follows CID may not in fact have that CID.

this._header = header
this._blocks = blocks
this._keys = blocks.map((b) => b.cid.toString())
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we have a method to perform verification that will verify that blocks match claimed CIDs ?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Gozala this would be worthwhile but needs to be an option and should be tackled in a separate PR. Opened an issue here and it'd be great if someone could pick this off: #123

As noted there, this requires having the hashers available to do the validation, which is always a problem in JS—what do you do if someone shows up with blake2b CIDs?

Also worth having notes in the README, although pretty much all of the JS stack doesn't perform this kind of validation, so there's a bit of a caveat emptor all round on that front unfortunately.

@rvagg
Copy link
Member

rvagg commented Jan 19, 2023

I'm not sure jsdoc2readme is working properly on this anymore, npm run docs is supposed to take the jsdoc updates and insert it into the readme; if that doesn't work, you might have to do a bit of manual sync, but getting something on the README would be good for this.

As long as the 3 of you agree on the details don't let me be a blocker, other than the desire for decent docs.

@hugomrdias
Copy link
Member Author

I'm not sure jsdoc2readme is working properly on this anymore, npm run docs is supposed to take the jsdoc updates and insert it into the readme; if that doesn't work, you might have to do a bit of manual sync, but getting something on the README would be good for this.

As long as the 3 of you agree on the details don't let me be a blocker, other than the desire for decent docs.

i was able to make jsdoc2readme work readme is updated and i also added a reference to the sync reader in the usage section.

@hugomrdias hugomrdias requested a review from Gozala January 19, 2023 19:05
Copy link
Member

@alanshaw alanshaw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There's so many small things here I'm not even sure I got all of them.

src/api.ts Outdated Show resolved Hide resolved
src/buffer-reader-browser.js Outdated Show resolved Hide resolved
src/buffer-reader-browser.js Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
src/api.ts Outdated Show resolved Hide resolved
src/buffer-reader-browser.js Outdated Show resolved Hide resolved
src/buffer-reader-browser.js Outdated Show resolved Hide resolved
src/buffer-reader-browser.js Outdated Show resolved Hide resolved
@hugomrdias hugomrdias requested a review from alanshaw January 20, 2023 15:50
Copy link
Member

@alanshaw alanshaw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just a few minor problems but I think nothing blocking...🚀

README.md Outdated
- [`CarBufferReader.fromBytes(bytes)`](#carbufferreaderfrombytesbytes)
- [`CarBufferReader.readRaw(fd, blockIndex)`](#carbufferreaderreadrawfd-blockindex)
- [License](#license)
- [Contribution](#contribution)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is just the API listing not the ToC for the whole README...

* @returns {boolean}
*/
has (key) {
return Boolean(this._cids.find((cid) => cid.equals(key)))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
return Boolean(this._cids.find((cid) => cid.equals(key)))
return this._blocks.some(b => b.cid.equals(key))

Comment on lines 98 to 99
const index = this._keys.indexOf(key.toString())
return index > -1 ? this._blocks[index] : undefined
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why not just find the block?

Suggested change
const index = this._keys.indexOf(key.toString())
return index > -1 ? this._blocks[index] : undefined
return this_blocks.find(b -> b.cid.equals(key))

this._header = header
this._blocks = blocks
this._keys = blocks.map((b) => b.cid.toString())
this._cids = this._blocks.map(b => b.cid)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure I understand the benefit of _keys and _cids, or why in has we do .equals() and in get we do a string comparison?

I guess this._cids is so we don't have to recompute the array every time .cids() it's called - but personally I'd just cache the result after the first call.

@hugomrdias
Copy link
Member Author

@rvagg seems ready to merge.

Copy link
Member

@rvagg rvagg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm, very nice work all!

@hugomrdias hugomrdias merged commit c3917aa into master Jan 27, 2023
@hugomrdias hugomrdias deleted the feat/car-sync branch January 27, 2023 07:19
github-actions bot pushed a commit that referenced this pull request Jan 27, 2023
## [5.1.0](v5.0.3...v5.1.0) (2023-01-27)

### Features

* add support for sync car reading ([#121](#121)) ([c3917aa](c3917aa))
@github-actions
Copy link

🎉 This PR is included in version 5.1.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

olizilla added a commit to storacha/w3up that referenced this pull request Feb 8, 2023
Remove sync car reader code that has moved upstream into [@ipld/car](ipld/js-car#121 (comment))

Fixes #401

License: MIT
Signed-off-by: Oli Evans <oli@protocol.ai>
alanshaw pushed a commit to storacha/w3up that referenced this pull request Mar 27, 2023
Remove sync car reader code in favour of CarBufferReader that landed
upstream in
[@ipld/car](ipld/js-car#121 (comment))

Fixes #401


❤️

License: MIT
Signed-off-by: Oli Evans <oli@protocol.ai>

---------

Signed-off-by: Oli Evans <oli@protocol.ai>
Co-authored-by: Benjamin Goering <171782+gobengo@users.noreply.github.com>
Co-authored-by: Alan Shaw <alan.shaw@protocol.ai>
gobengo added a commit to storacha/w3up that referenced this pull request Apr 11, 2023
Remove sync car reader code in favour of CarBufferReader that landed
upstream in
[@ipld/car](ipld/js-car#121 (comment))

Fixes #401


❤️

License: MIT
Signed-off-by: Oli Evans <oli@protocol.ai>

---------

Signed-off-by: Oli Evans <oli@protocol.ai>
Co-authored-by: Benjamin Goering <171782+gobengo@users.noreply.github.com>
Co-authored-by: Alan Shaw <alan.shaw@protocol.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: 🎉 Done
Development

Successfully merging this pull request may close these issues.

4 participants