[Impeller] Adds the ability to specify a golden threshold #40824

gaaclarke · 2023-03-31T18:55:31Z

Pre-launch Checklist

I read the Contributor Guide and followed the process outlined there for submitting PRs.
I read the Tree Hygiene wiki page, which explains my responsibilities.
I read and followed the Flutter Style Guide and the C++, Objective-C, Java style guides.
I listed at least one issue that this PR fixes in the description above.
I added new tests to check the change I am making or feature I am adding, or Hixie said the PR is test-exempt. See testing the engine for instructions on writing and running engine tests.
I updated/added relevant documentation (doc comments with ///).
I signed the CLA.
All existing and new tests are passing.

If you need help, consider asking for advice on the #hackers-new channel on Discord.

gaaclarke · 2023-03-31T18:55:54Z

chinmaygarde

We don't want thresholds, we want to follow the Skia GPU teams guidelines on having the Gold results be tackled out of band.

gaaclarke · 2023-03-31T19:13:15Z

We don't want thresholds, we want to follow the Skia GPU teams guidelines on having the Gold results be tackled out of band.

I thought those things are mutually exclusive? No matter how failures are triaged we don't want this test to be tripping up all the time, right?

flutter-dashboard · 2023-03-31T19:37:54Z

Golden file changes have been found for this pull request. Click here to view and triage (e.g. because this is an intentional change).

If you are still iterating on this change and are not ready to resolve the images on the Flutter Gold dashboard, consider marking this PR as a draft pull request above. You will still be able to view image results on the dashboard, commenting will be silenced, and the check will not try to resolve itself until marked ready for review.

Changes reported for pull request #40824 at sha bcc52d2

gaaclarke · 2023-03-31T19:56:42Z

I'm actually a bit confused by SkiaGold's dashboard, the max diff pixels is 7864, yet the different number of pixels is reported as 24. Shouldn't this not have been flagged?

Piinks · 2023-03-31T20:14:45Z

cc @mdebbar

the max diff pixels is 7864, yet the different number of pixels is reported as 24. Shouldn't this not have been flagged?

I am not familiar with how the engine is configured for Gold.

chinmaygarde · 2023-03-31T21:58:26Z

No matter how failures are triaged we don't want this test to be tripping up all the time, right?

The understanding is that each gold test has multiple valid variants. As long as a triager confirms that two variants are sufficiently alike, the test will pass unless an entirely new variant is detected.

gaaclarke · 2023-03-31T22:00:03Z

The understanding is that each gold test has multiple valid variants. As long as a triager confirms that two variants are sufficiently alike, the test will pass unless an entirely new variant is detected.

Ahh yea, I've heard mention of such a feature. Who can help us with that?

mdebbar · 2023-04-03T14:18:49Z

The way the fuzzy matcher works in Gold is by doing two checks:

How many pixels are different between baseline image and test image? This is controlled by the differentPixelsRate parameter that you are using in this PR. So you are good here.
How much is each of these pixels allowed to differ from its corresponding pixel in the baseline image? This is controlled by the pixelColorDelta parameter. The default is 0 and you are overriding this in your PR.

What's happening is your image is passing the 1st condition but it's failing on the 2nd.

gaaclarke · 2023-04-03T18:24:30Z

@chinmaygarde there was a lot of talk about this in different channels. I'll try to summarize it:

Skia Gold already can work against multiple goldens
We want to make failures not block the roll (outlined in The SkiaGold check should not block rolls into flutter/engine flutter#123979)
The number of accepted golden variations is limited to around the number of devices running

I think we still want a tiny bit of fuzzing in addition to the other mechanisms. This PR makes it so all the tests pass if less than 1% of pixels are different by less than 4 color component deltas. (I increased the color delta for the rotated text to 40). PTAL, let me know if you want to discuss this further and we can try to suss everything out.

chinmaygarde

I think we still want a tiny bit of fuzzing in addition to the other mechanisms...

I think that is totally reasonable. But I'd rather not have the tests get into the business of determining the deltas like in the current version of the patch.

If there is a global fuzz value you are comfortable with, let's set it in the test harness. If not, we can leave it out altogether. I'd rather not have to think about how to figure out maxDiffPixelsPercent and max_color_delta for each screenshot. That is cognitive overhead we don't want to add every time we write a test. Especially if the process is going to be semi-manual anyway.

…uite

gaaclarke · 2023-04-03T21:03:03Z

If there is a global fuzz value you are comfortable with, let's set it in the test harness.

Okay, I switched it up so the whole test_runner has the same values across the whole tests.

flutter-dashboard · 2023-04-03T22:21:30Z

Golden file changes are available for triage from new commit, Click here to view.

Changes reported for pull request #40824 at sha d0cf61d

gaaclarke · 2023-04-03T22:49:35Z

I'm a bit confused about what Skia gold is doing. The color delta is clearly set to 8 but the skia gold report has it set to zero:

Let's land this and keep an eye on it. It may be some latent state or inability to change the fuzzy threshold after the fact, we'll see.

…lutter/engine#40824)

Piinks · 2023-04-05T21:51:20Z

This does not appear to be working as intended. The correct config for fuzzy matching is being received by gold, but it looks like this image is just flaky. @camsim99 came across it in #40924 and looking at the digest, we can see the image has a different color dot for every commit. This means it is not producing a consistent image, and is blocking PRs from landing.

https://flutter-engine-gold.skia.org/search?issue=40924&crs=github&patchsets=3&corpus=flutter-engine

Adds the ability to specify a golden threadshold

bcc52d2

gaaclarke requested a review from dnfield March 31, 2023 18:55

chinmaygarde suggested changes Mar 31, 2023

View reviewed changes

chinmaygarde added the e: impeller label Mar 31, 2023

flutter-dashboard bot added the will affect goldens label Mar 31, 2023

chinmaygarde assigned gaaclarke Apr 1, 2023

gaaclarke added 3 commits April 3, 2023 10:03

renamed threshold to max_diff_pixels_percent

9dfc4ea

added the pixelColorDelta field

7a28698

updated skia_gold_docs

9d5ee0b

gaaclarke mentioned this pull request Apr 3, 2023

[Impeller] Reason about flakiness in testing rendering results from Impeller pixel tests. flutter/flutter#123954

Closed

gaaclarke requested a review from chinmaygarde April 3, 2023 18:24

chinmaygarde reviewed Apr 3, 2023

View reviewed changes

gaaclarke added 2 commits April 3, 2023 13:48

made the golden fuzzy parameters test suite specific

3f3be86

made the fuzzy paramters specific to the whole runner, not the test s…

d0cf61d

…uite

gaaclarke requested a review from chinmaygarde April 3, 2023 21:03

chinmaygarde approved these changes Apr 3, 2023

View reviewed changes

gaaclarke added the autosubmit Merge PR when tree becomes green via auto submit App label Apr 3, 2023

auto-submit bot merged commit 6a6d8cc into flutter:main Apr 3, 2023

engine-flutter-autoroll mentioned this pull request Apr 4, 2023

Roll Flutter Engine from aa60150eda7e to 9b19fade3e0b (4 revisions) flutter/flutter#124083

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Apr 4, 2023

6a6d8ccb4 [Impeller] Adds the ability to specify a golden threshold (f…

4b21539

…lutter/engine#40824)

engine-flutter-autoroll mentioned this pull request Apr 4, 2023

Roll Flutter Engine from aa60150eda7e to 4e848dc654ff (5 revisions) flutter/flutter#124086

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Apr 4, 2023

6a6d8ccb4 [Impeller] Adds the ability to specify a golden threshold (f…

aef677f

…lutter/engine#40824)

engine-flutter-autoroll mentioned this pull request Apr 4, 2023

Roll Flutter Engine from aa60150eda7e to 3711acef3285 (8 revisions) flutter/flutter#124090

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Apr 4, 2023

6a6d8ccb4 [Impeller] Adds the ability to specify a golden threshold (f…

a697d25

…lutter/engine#40824)

engine-flutter-autoroll mentioned this pull request Apr 4, 2023

Roll Flutter Engine from aa60150eda7e to f175a94d47f8 (9 revisions) flutter/flutter#124091

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Apr 4, 2023

6a6d8ccb4 [Impeller] Adds the ability to specify a golden threshold (f…

2d7a007

…lutter/engine#40824)

engine-flutter-autoroll mentioned this pull request Apr 4, 2023

Roll Flutter Engine from aa60150eda7e to 37ea80f6951b (11 revisions) flutter/flutter#124093

Merged

engine-flutter-autoroll added a commit to engine-flutter-autoroll/flutter that referenced this pull request Apr 4, 2023

6a6d8ccb4 [Impeller] Adds the ability to specify a golden threshold (f…

8783037

…lutter/engine#40824)

Piinks mentioned this pull request Apr 5, 2023

[Impeller] Flaky impeller golden test in the engine flutter/flutter#124277

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Impeller] Adds the ability to specify a golden threshold #40824

[Impeller] Adds the ability to specify a golden threshold #40824

Uh oh!

gaaclarke commented Mar 31, 2023

Uh oh!

gaaclarke commented Mar 31, 2023

Uh oh!

chinmaygarde left a comment

Uh oh!

gaaclarke commented Mar 31, 2023

Uh oh!

flutter-dashboard bot commented Mar 31, 2023

Uh oh!

gaaclarke commented Mar 31, 2023

Uh oh!

Piinks commented Mar 31, 2023

Uh oh!

chinmaygarde commented Mar 31, 2023

Uh oh!

gaaclarke commented Mar 31, 2023 •

edited

Loading

Uh oh!

mdebbar commented Apr 3, 2023

Uh oh!

gaaclarke commented Apr 3, 2023

Uh oh!

chinmaygarde left a comment

Uh oh!

gaaclarke commented Apr 3, 2023

Uh oh!

flutter-dashboard bot commented Apr 3, 2023

Uh oh!

gaaclarke commented Apr 3, 2023

Uh oh!

Piinks commented Apr 5, 2023

Uh oh!

Uh oh!

[Impeller] Adds the ability to specify a golden threshold #40824

[Impeller] Adds the ability to specify a golden threshold #40824

Uh oh!

Conversation

gaaclarke commented Mar 31, 2023

Pre-launch Checklist

Uh oh!

gaaclarke commented Mar 31, 2023

Uh oh!

chinmaygarde left a comment

Choose a reason for hiding this comment

Uh oh!

gaaclarke commented Mar 31, 2023

Uh oh!

flutter-dashboard bot commented Mar 31, 2023

Uh oh!

gaaclarke commented Mar 31, 2023

Uh oh!

Piinks commented Mar 31, 2023

Uh oh!

chinmaygarde commented Mar 31, 2023

Uh oh!

gaaclarke commented Mar 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdebbar commented Apr 3, 2023

Uh oh!

gaaclarke commented Apr 3, 2023

Uh oh!

chinmaygarde left a comment

Choose a reason for hiding this comment

Uh oh!

gaaclarke commented Apr 3, 2023

Uh oh!

flutter-dashboard bot commented Apr 3, 2023

Uh oh!

gaaclarke commented Apr 3, 2023

Uh oh!

Piinks commented Apr 5, 2023

Uh oh!

Uh oh!

gaaclarke commented Mar 31, 2023 •

edited

Loading