-
Notifications
You must be signed in to change notification settings - Fork 13k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Doc change: Remove mention of fnv
in HashMap
#77996
Conversation
…ve hasher. Signed-off-by: Tom Kaitchuck <tom.kaitchuck@emc.com>
Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @LukasKalbertodt (or someone else) soon. If any changes to this PR are deemed necessary, please add them as extra commits. This ensures that the reviewer can see what has changed since they last reviewed the code. Due to the way GitHub handles out-of-date commits, this should also make it reasonably obvious what issues have or haven't been addressed. Large or tricky changes may require several passes of review and changes. Please see the contribution instructions for more information. |
I'd rather not use libstd to promote hash functions that haven't seen much review, especially since aHash makes claims about DOS-resistance. |
cc @rust-lang/libs Historically we've tried to not mention external crates in std docs; I think it may make sense to drop fnv here. One of the options might be to point at some summary post, though I worry that it will be an endless treadmill of updating it. I guess one option is something like https://en.wikipedia.org/wiki/List_of_hash_functions. |
I would also prefer to link to either hash functions in general or a widely used and understood one. |
@tkaitchuck FYI I tried changing
Consider relaxing that bound, I think you shouldn't need it for constructing the hashmap, only for operations involving the elements (get/set/entry). |
Same for various other functions, |
The error message there indicates the code is using an FxHashmap which is by definition using FxHash. If you want to switch that code, you will need to update it. (If there are suggestions please open an issue on the repo rather than adding more here.) |
Or perhaps an external list of Rust crates that implement the interface? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree with the previous comments: I'd rather not mention aHash
in these docs. However, removing the mention of fnv
makes sense to me. I'm fine with either of those:
- Just removing the mention of
fnv
. - Linking Wikipedia.
- Linking to the "hash" keyword on crates.io (this includes lots of hashing libraries that cannot be used with the std hashmap; so maybe that would confuse readers.)
fnv
in HashMap as an alternative hasher to aHash
fnv
in HashMap
Signed-off-by: Tom Kaitchuck <Tom.Kaitchuck@gmail.com>
@LukasKalbertodt I have linked to the "hasher" keyword on crates.io. This avoids the problem of "hash" which contains lots of crates that don't implement the interface. Currently there are only 6 crates which have this keyword all but 1 provides an implementation of the interface. Of course many (most) of the popular alternatives do not have this keyword tagged to them, but that is easy to fix on the individual crates. |
The @jonas-schievink @Mark-Simulacrum @sfackler What do you think about linking to this keyword? |
hasher keyword seems to contain hashers, but it's... very short (6 crates) and neither ahash, fnv, or any of the other "immediate" candidates I'd reach for are in there, despite being available in general. I think the hash keyword is closer to the right one right now, but maybe the Wikipedia link is preferable. Edit: I also think that we can link to "hasher" but we should update crates first, and it creates a sort of "first mover" problem in some sense (e.g., if ahash ends up being first and only when this is merged). But maybe that's not too big a concern. |
@Mark-Simulacrum I have opened several PRs. |
Co-authored-by: Mara Bos <m-ou.se@m-ou.se>
Thanks! @bors r+ rollup |
📌 Commit 4e58483 has been approved by |
…laumeGomez Rollup of 6 pull requests Successful merges: - rust-lang#77151 (Add regression test for issue rust-lang#76042) - rust-lang#77996 (Doc change: Remove mention of `fnv` in HashMap) - rust-lang#78463 (Add type to `ConstKind::Placeholder`) - rust-lang#78984 (Rustdoc check option) - rust-lang#78985 (add dropck test for const params) - rust-lang#78996 (add explicit test for const param promotion) Failed merges: r? `@ghost` `@rustbot` modify labels: rollup
Disclaimer: I am the author of aHash.
This changes the Rustdoc in
HashMap
from mentioning thefnv
crate to mentioning theaHash
crate, as an alternativeHasher
implementation.Why
Fnv has poor hash quality, is slow for larger keys, and does not provide dos resistance, because it is unkeyed (this can also cause other problems).
Fnv has acceptable performance for integers and has very poor performance with keys >32 bytes. This is the reason it was removed from the standard library in #37229 .
Because regardless of which dimension you value, there are better alternatives, it does not make sense for anyone to consider using
fnv
.The text mentioning
fnv
in the standard library continues to create confusion: rust-lang/hashbrown#153 rust-lang/hashbrown#9 . There are also a number of crates using it a great many of which are hashing strings (Which is when Fnv is the worst, possible, choice.)I think aHash makes the most sense to mention as an alternative because it is the most credible option (in my obviously biased opinion). It offers good performance on numbers and strings, is of high quality, and provides dos resistance. It is popular (see stats) and is the default hasher for hashbrown and dashmap which are the most popular alternative hashmaps. Finally it does not have any of the
gotcha
cases thatFxHash
suffers from. (Which is the other popular hashing option when DOS attacks are not a concern)Signed-off-by: Tom Kaitchuck tom.kaitchuck@emc.com