Determinism in hash tables #339

tolikzinovyev · 2019-06-25T04:07:03Z

Hello. I really like absl hash tables but non-determinism in the iteration order is not desirable for my purpose: simulations. Would you accept a patch that disables non-determinism with a compiler flag? I can imagine, it will also be useful for debugging a program.

derekmauro · 2019-06-26T15:26:37Z

This isn't something we'd accept.

Part of Abseil's philosophy is engineering at scale. In addition to making it harder to execute a hash flooding attack, another reason we randomize iteration order is that is makes it easier for use to change the underlying implementation if we need to. When we wanted to change our hash function (for example), we found thousands of unit tests that depended on iteration order. We could not just break these tests and say "sorry, you are violating Hyrum's Law" since we would have thousands of angry Google developers. So instead, we fixed the tests and implemented randomization so that next time we wanted to change something in the implementation, this would not be an issue. The need for iteration order determinism is relatively rare compared to the potential wins we get by being free to change the implementation (maybe to make the hash table faster, which could save millions of cycles at scale, for example). By giving users a knob to disable the randomness, we'd be in the same situation all over again.

tolikzinovyev · 2019-06-26T18:14:08Z

Makes sense, thank you. I have another proposal then.
We could make a boolean constant in a separate file that when flipped would disable non-determinism. So, when a user needs to debug their program, they would still need to change the absl code but only one line.

derekmauro · 2019-06-27T13:48:16Z

I think that is a better suggestion, but there are costs to keeping superfluous code around, so we generally don't do it.

tolikzinovyev · 2019-06-27T16:34:54Z

I understand.

tolikzinovyev closed this as completed Jun 27, 2019

derekmauro mentioned this issue Jul 7, 2019

Performance of flat_hash_set copy constructor #346

Closed

wrowe mentioned this issue Jun 30, 2020

Replace std::unordered_map with absl::node_hash_map? envoyproxy/envoy#11825

Closed

MaskRay mentioned this issue Jun 21, 2024

[Hashing] Use a non-deterministic seed if LLVM_ENABLE_ABI_BREAKING_CHECKS llvm/llvm-project#96282

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Determinism in hash tables #339

Determinism in hash tables #339

tolikzinovyev commented Jun 25, 2019

derekmauro commented Jun 26, 2019

Uh oh!

tolikzinovyev commented Jun 26, 2019

Uh oh!

derekmauro commented Jun 27, 2019

Uh oh!

tolikzinovyev commented Jun 27, 2019

Uh oh!

Determinism in hash tables #339

Determinism in hash tables #339

Comments

tolikzinovyev commented Jun 25, 2019

derekmauro commented Jun 26, 2019

Uh oh!

tolikzinovyev commented Jun 26, 2019

Uh oh!

derekmauro commented Jun 27, 2019

Uh oh!

tolikzinovyev commented Jun 27, 2019

Uh oh!