GH-106485: "Un-materialize" `dict`s in `LOAD_ATTR_WITH_HINT` #106496

brandtbucher · 2023-07-06T23:06:21Z

There's a failure path in the specialized bytecode that is often hit by objects that have a materialized __dict__, but probably don't need it anymore.

I'm running the benchmarks and gathering stats to see how promising this approach is.

Issue: Reduce the number of materialized instance __dict__s #106485

markshannon · 2023-07-07T10:51:08Z

~~Do you have stats for this PR?~~

Stats: https://github.com/faster-cpython/ideas/blob/main/stats/pystats-2023-07-07-brandtbucher-0ab8274.md

markshannon · 2023-07-07T10:52:31Z

There a few scenarios that we should consider. Here are the two I'm concerned about:

One or more objects have their __dict__s accessed, then an attribute accessed. Alternately
Many objects have their __dict__s accessed (e.g. by copy or pickle), then attributes are accessed repeatedly

In case 1, we will repeatedly materialize and dematerialize the __dict__. Hopefully this case will be rare, so the performance impact will be acceptable.

It is case 2 that matters, IMO. We need to keep the relevant LOAD_ATTRs specialized to LOAD_ATTR_INSTANCE_VALUES and at the same time dematerialize the __dict__s when we can.

That suggests to me that dematerialization should occur in the specializer and, more importantly, in the deopt path of LOAD_ATTR_INSTANCE_VALUES.

In LOAD_ATTR_INSTANCE_VALUES
we could replace
DEOPT_IF(!_PyDictOrValues_IsValues(dorv), LOAD_ATTR);
with
DEOPT_IF(!_PyDictOrValues_IsValues(dorv), LOAD_ATTR_DEMATERIALIZE)
and eliminate LOAD_ATTR_WITH_HINT altogether.

brandtbucher · 2023-07-07T17:53:16Z

According to the stats comparison, the number of __dict__ materializations "on request" increased from 3.7 million to 3.9 million, but the number of dict "un-materializations" is 3.7 million.

So this is incredibly effective, but the results do suggest that some __dict__s are "thrashing" back and forth in the mypy benchmark, which got 17% slower and pulled the (otherwise boring) results down to 0.6% slower.

(Sorry, it looks like there aren't public links for these results.)

brandtbucher · 2023-07-07T17:58:09Z

In case 1, we will repeatedly materialize and dematerialize the __dict__. Hopefully this case will be rare, so the performance impact will be acceptable.

See my comment above: the numbers suggest that the mypy benchmark does this, with quite painful results.

It is case 2 that matters, IMO. We need to keep the relevant LOAD_ATTRs specialized to LOAD_ATTR_INSTANCE_VALUES and at the same time dematerialize the __dict__s when we can.

That suggests to me that dematerialization should occur in the specializer and, more importantly, in the deopt path of LOAD_ATTR_INSTANCE_VALUES.

I think this should stay out of the specializer, since that runs infrequently and only sees the first instance of a class at a given location. I'll try the LOAD_ATTR_INSTANCE_VALUE flavor, though.

brandtbucher · 2023-07-07T19:49:32Z

Something about my merge messed up the diff, I think...

brandtbucher · 2023-07-07T23:21:30Z

#106539 is an alternative to this, using LOAD_ATTR_INSTANCE_VALUE.

brandtbucher · 2023-07-14T23:04:48Z

Closing in favor of #106539.

brandtbucher added 5 commits June 22, 2023 13:41

"Un-materialize" __dict__s if possible

a4e456f

Add stats

716cc5a

Catch up with main

b9ec16f

Add comment

c5f2067

Catch up with main

0ab8274

brandtbucher added performance Performance or resource usage interpreter-core (Objects, Python, Grammar, and Parser dirs) labels Jul 6, 2023

brandtbucher self-assigned this Jul 6, 2023

bedevere-bot mentioned this pull request Jul 6, 2023

Reduce the number of materialized instance __dict__s #106485

Closed

brandtbucher requested a review from markshannon July 6, 2023 23:10

Catch up with main

c3d076b

brandtbucher closed this Jul 7, 2023

brandtbucher reopened this Jul 7, 2023

brandtbucher added 2 commits July 7, 2023 13:08

Catch up with main (again)

ec1dac5

DEOPT_IF -> assert

552b966

brandtbucher changed the title ~~GH-106485: "Un-materialize" __dict__s if possible~~ GH-106485: "Un-materialize" __dict__s in LOAD_ATTR_WITH_HINT Jul 7, 2023

brandtbucher mentioned this pull request Jul 7, 2023

GH-106485: Dematerialize instance dictionaries when possible #106539

Merged

Catch up with main

825a700

brandtbucher closed this Jul 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

GH-106485: "Un-materialize" `dict`s in `LOAD_ATTR_WITH_HINT` #106496

GH-106485: "Un-materialize" `dict`s in `LOAD_ATTR_WITH_HINT` #106496

Uh oh!

brandtbucher commented Jul 6, 2023 •

edited by bedevere-bot

Loading

Uh oh!

markshannon commented Jul 7, 2023 •

edited

Loading

Uh oh!

markshannon commented Jul 7, 2023

Uh oh!

brandtbucher commented Jul 7, 2023 •

edited

Loading

Uh oh!

brandtbucher commented Jul 7, 2023 •

edited

Loading

Uh oh!

brandtbucher commented Jul 7, 2023

Uh oh!

brandtbucher commented Jul 7, 2023

Uh oh!

brandtbucher commented Jul 14, 2023

Uh oh!

Uh oh!

Uh oh!

GH-106485: "Un-materialize" __dict__s in LOAD_ATTR_WITH_HINT #106496

GH-106485: "Un-materialize" __dict__s in LOAD_ATTR_WITH_HINT #106496

Uh oh!

Conversation

brandtbucher commented Jul 6, 2023 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markshannon commented Jul 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markshannon commented Jul 7, 2023

Uh oh!

brandtbucher commented Jul 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brandtbucher commented Jul 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brandtbucher commented Jul 7, 2023

Uh oh!

brandtbucher commented Jul 7, 2023

Uh oh!

brandtbucher commented Jul 14, 2023

Uh oh!

Uh oh!

GH-106485: "Un-materialize" `dict`s in `LOAD_ATTR_WITH_HINT` #106496

GH-106485: "Un-materialize" `dict`s in `LOAD_ATTR_WITH_HINT` #106496

brandtbucher commented Jul 6, 2023 •

edited by bedevere-bot

Loading

markshannon commented Jul 7, 2023 •

edited

Loading

brandtbucher commented Jul 7, 2023 •

edited

Loading

brandtbucher commented Jul 7, 2023 •

edited

Loading