Improve discrimination of database keys #3867

jobh · 2024-01-30T09:37:13Z

We don't (and can't) guarantee that our database keys are unique. If two test methods have identical keys, the consequences are limited: We replay arbitrary examples instead of previously-failing ones. This is a low-priority issue, but recorded here as a reminder.

While the downsides are limited, they do exist: a slight slowdown plus loss of the database benefits. Hence, we should try hard (within reason) to actually generate unique-yet-stable database keys.

Some ideas are:

Mixing in __qualname__ is cheap and simple. Hypothesis Seed not reproducing "max allowable size" HealthCheck #3446 (comment)

One common failure mode in our own tests is inner tests used in multiple contexts, either by parametrized strategies or inside a helper function (tests.common.debug).

For the parametrized strategies, we might mix in the @given arguments (if we recognize the need - see Clean up use of example in tests #3865 (comment)).
Plus, mix in __qualname__ because inner tests are often called... inner.
This might not be enough for the helper functions though, as they typically also take an asserted predicate as input — for those, maybe a @database_key decorator where we can mix in the predicate as well? Do note, we already set database=None in at least some of these, which fixes the slowdown issue.

The text was updated successfully, but these errors were encountered:

Zac-HD · 2024-01-30T10:25:43Z

I agree that we should pick up any low-hanging improvements here, and this list looks pretty good to me.

Specific to the last point, we have a magic attribute used to distinguish @pytest.mark.parametrize()d tests, which could also be used here. Just assign whatever bytestring you like, and it'll be hashed into the database key along with everything else, e.g.:

hypothesis/hypothesis-python/src/_hypothesis_pytestplugin.py

Lines 297 to 299 in 7b63483

    
           # Give every parametrized test invocation a unique database key 
        
           key = item.nodeid.encode() 
        
           item.obj.hypothesis.inner_test._hypothesis_internal_add_digest = key

jobh · 2024-01-30T18:49:33Z

Hmm, I think we might add to the list

Pick up _hypothesis_internal_add_digest from the currently executing test function instead of the function actually passed to given.

otherwise we'd duplicate keys for f.x.

@pytest.mark.parametrize("v", [1, 2])
def test_something(v):

    @given(st.integers(v))
    def inner(x):
        assert x >= v

    inner()

since the attribute is added to test_something and not inner. At that point, it might be more straightforward to just store the add_digest as a global/threadlocal rather than as an attribute.

[edit] ...actually, we could mix in currently executing nodeid regardless of parametrization... and call it a day. Too pytest specific?

Zac-HD · 2024-01-31T08:20:32Z

[edit] ...actually, we could mix in currently executing nodeid regardless of parametrization... and call it a day. Too pytest specific?

Too specific - the problem is that if you execute a particular test function manually or via pytest, or a method with unittest or pytest, we want to have the same database key in either case. So mixing in the nodeid is fine when it's a parametrized test because there's a solid benefit and we're unlikely to execute it without pytest, but I'd prefer to avoid doing that unconditionally.

jobh · 2024-01-31T10:09:40Z

Understood, thanks! I have just one more question:

We don't use the strategy definition in our db key, and it looks intentional (e.g., _clean_source in get_digest and ignoring specifier in find()). This leads to collisions in find (naturally), and also f.x. test_attrs_inference_builds which is defined by the same name but slightly different strategy in two test modules.

Why is this? Is it beneficial to have key stability across changes in strategy?

Zac-HD · 2024-01-31T11:11:04Z

It's actually pretty hard to derive a stable-across-runs bytestring from an arbitrary strategy; for example it's easy to end up with a repr that includes some object's memory address, or less often something like the current time or date or machine name or OS.

We strip the decorators from source code before hashing it so that adding or removing @settings() and @example() decorators doesn't affect the database key.

jobh added enhancement it's not broken, but we want it to be better performance go faster! use less memory! labels Jan 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve discrimination of database keys #3867

Improve discrimination of database keys #3867

jobh commented Jan 30, 2024 •

edited

Loading

Zac-HD commented Jan 30, 2024

jobh commented Jan 30, 2024 •

edited

Loading

Zac-HD commented Jan 31, 2024

jobh commented Jan 31, 2024 •

edited

Loading

Zac-HD commented Jan 31, 2024

Improve discrimination of database keys #3867

Improve discrimination of database keys #3867

Comments

jobh commented Jan 30, 2024 • edited Loading

Zac-HD commented Jan 30, 2024

jobh commented Jan 30, 2024 • edited Loading

Zac-HD commented Jan 31, 2024

jobh commented Jan 31, 2024 • edited Loading

Zac-HD commented Jan 31, 2024

jobh commented Jan 30, 2024 •

edited

Loading

jobh commented Jan 30, 2024 •

edited

Loading

jobh commented Jan 31, 2024 •

edited

Loading