Add a separate test pass to run flaky tests that doesn't fail the build #8486

analogrelay · 2019-03-13T17:40:27Z

This doesn't actually add the pass yet, just a sample flaky test to use to verify things are working. Sorry Kestrel CODEOWNERS, you're getting pinged on this one because you're the random victim ;). I'll remove reviewers and re-add ones that make sense when this is ready to review.

TODO:

It worked on Windows, make sure it works on macOS and Ubuntu too
Make sure TRX files are still published.

jkotalik

Seems great! 😄

analogrelay · 2019-03-13T19:27:20Z

eng/helix/vstest/runtests.cmd

@@ -1,3 +1,6 @@
+REM Disable "!Foo!" expansions because they break the filter syntax
+setlocal disableextensions


cc @HaoK

Even with the planned change to ignore the exit code from helix altogether I'd like to get this change in in some form so that when we do start paying attention to the exit code, we don't start getting known-flaky tests causing failures.

(Still working on Unix scripts btw)

Yup, ignoring the exit code is hopefully just a short/medium term thing that gives us some time to get flaky in place

maybe consider passing in the filters to each script (assuming they are the same in the sh version)

Yeah, they would be the same. I'll look at that. There can be some shell escaping problems with the syntax so it might be tricky, but I'll give it a shot.

HaoK · 2019-03-13T21:59:14Z

Do you think its worth separating flaky and non flaky farther apart for helix? Basically instead of running both flaky and non flaky on the same helix item (and just ignoring flaky failures), build two helix work items instead of one per test project, only including the additional run helix flaky work item if the test project has any marked (and the script could just take an parameter saying to always report success for that run).

For example:
20190313.11/workItem/ProjectTemplates.Tests-netcoreapp3.0 excluding flaky tests
20190313.11/workItem/ProjectTemplates.Tests-netcoreapp3.0-flaky with the flaky tests filter and alwaysSucceed = true (if we can't detect this at build time, can just add a FlakyTests=true msbuild property for those test projects)

analogrelay · 2019-03-13T22:08:30Z

We could do that, but adding a FlakyTest msbuild property seems error-prone. One of the things I like about the current system is that you just slap an attribute on a test and get the behavior, you don't have to remember to update an MSBuild file.

analogrelay · 2019-03-14T22:55:53Z

🤞 I think I have the YAML and MSBuild goop in place to run the Flaky test pass separately.

analogrelay · 2019-03-15T03:30:06Z

Helix results look good (https://mc.dot.net/#/user/aspnetcore/pr~2Faspnet~2Faspnetcore/ci/20190314.26/workItem/Libuv.FunctionalTests-netcoreapp3.0/wilogs). Helix is reporting success even though flaky tests failed.

analogrelay · 2019-03-15T17:26:15Z

I believe this is basically ready for review. I will remove the fake-flaky test I added to Kestrel before merging (feel free to withhold final approval until I do that if you'd like) but the rest should be ready to review.

Out of date

dougbu

Minor comments and, yes, remove the temporary test 😺

dougbu · 2019-03-15T17:42:00Z

eng/helix/vstest/runtests.sh

-$DOTNET_ROOT/dotnet vstest $1 --logger:trx
+# Run non-flaky tests first
+# We need to specify all possible Flaky filters that apply to this environment, because the flaky attribute
+# only puts the explicit filter traits the user provided in


"provided in" what?

dougbu · 2019-03-15T17:44:31Z

eng/helix/vstest/runtests.cmd

+%DOTNET_ROOT%\dotnet vstest %target% --logger:trx --logger:console;verbosity=normal --TestCaseFilter:%FLAKY_FILTER%
+if errorlevel 1 (
+    echo Failure in flaky test 1>&2
+    REM DO NOT EXIT and DO NOT SET EXIT_CODE to 1


Suggest including similar comments in runtests.sh

dougbu · 2019-03-15T17:44:58Z

eng/helix/vstest/runtests.cmd

+
+REM Run non-flaky tests first
+REM We need to specify all possible Flaky filters that apply to this environment, because the flaky attribute
+REM only puts the explicit filter traits the user provided in


"provided in" what?

HaoK

Looks good,

one thing that still nags me is running the non flaky + flaky tests together on helix. There's going to be potentially a lot more noise in the helix logs with all the flaky tests at the end so its going to make it even more painful hunting for the non flaky error messages (compared to a clean helix job with its on mission control link + a flaky helix job with a perma red mission control link)

But hopefully once we get trx support for the error reporting maybe it won't be as much of an issue

HaoK · 2019-03-15T17:58:10Z

Basically an alternative to consider doing eventually is two helix jobs

helix job with FailOnMissionControlFailure = true (non flaky version),
helix job that passes in the flaky filter to run all tests that runs with FailOnMissionControl = false (so the build ignores any failures from that run)

analogrelay · 2019-03-15T17:58:48Z

Yeah, I do agree that it won't be the cleanest results format. I could suppress the console output of the flaky run maybe.

We could create separate Helix work items for flaky tests if we thought that was useful but it feels like that will take up a lot more resources than it's worth...

HaoK · 2019-03-15T17:59:19Z

Well its not a big deal right now since we are treating all helix tests as flaky so...there's that...

jkotalik previously approved these changes Mar 13, 2019

View reviewed changes

analogrelay commented Mar 13, 2019

View reviewed changes

Eilon added the area-infrastructure Includes: MSBuild projects/targets, build scripts, CI, Installers and shared framework label Mar 13, 2019

analogrelay force-pushed the anurse/flaky-pass branch from f007cf0 to 6f59bbb Compare March 14, 2019 22:49

analogrelay marked this pull request as ready for review March 15, 2019 17:25

analogrelay requested review from dougbu and Tratcher as code owners March 15, 2019 17:25

dougbu approved these changes Mar 15, 2019

View reviewed changes

HaoK approved these changes Mar 15, 2019

View reviewed changes

analogrelay added 10 commits March 18, 2019 10:16

add flaky tests we can use to verify it's working

a988f6e

add logic to helix windows script and fix improperly-broken test

598bd30

unix support for flaky test pass

a5eddf1

stash

a12084b

update korebuild to get new flaky test fun

c33ad15

enable flaky test pass in AzP

39c8e6f

How does bash work?

c5e622f

debugging missing TRX files

fe6c61d

never mind, I figured it out

6cc5b8e

comment clean-up

b77353f

analogrelay force-pushed the anurse/flaky-pass branch from 9a190b9 to b77353f Compare March 18, 2019 17:17

removing intentionally-failing test tests

db96c1d

analogrelay merged commit 706778d into master Mar 19, 2019

analogrelay deleted the anurse/flaky-pass branch May 1, 2019 16:53

clavecoder mentioned this pull request Jul 29, 2019

Port to 2.1: JsonResult causes thread pool exhaustion via synchronous flushes aspnet/Mvc#8486 #9762

Closed

		@@ -1,3 +1,6 @@
		REM Disable "!Foo!" expansions because they break the filter syntax
		setlocal disableextensions

Add a separate test pass to run flaky tests that doesn't fail the build #8486

Add a separate test pass to run flaky tests that doesn't fail the build #8486

Uh oh!

Conversation

analogrelay commented Mar 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jkotalik left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HaoK commented Mar 13, 2019

Uh oh!

analogrelay commented Mar 13, 2019

Uh oh!

analogrelay commented Mar 14, 2019

Uh oh!

analogrelay commented Mar 15, 2019

Uh oh!

analogrelay commented Mar 15, 2019

Uh oh!

dougbu left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HaoK left a comment

Choose a reason for hiding this comment

Uh oh!

HaoK commented Mar 15, 2019

Uh oh!

analogrelay commented Mar 15, 2019

Uh oh!

HaoK commented Mar 15, 2019

Uh oh!

Uh oh!

analogrelay commented Mar 13, 2019 •

edited

Loading