ai/live: Use synctest in discovery #3739

j0sh · 2025-09-16T06:10:05Z

The new synctest package, introduced in golang 1.25, makes tests involving time and concurrency much faster and more reliable. All the discovery tests that took more than 100ms on my machine were converted to use synctest.

This surfaced a number of race conditions in both the tests and the code itself so these were also fixed.

Timings with go test -timeout 10s -count 1 -race

Before:
ok github.com/livepeer/go-livepeer/discovery 7.087s

After:
ok github.com/livepeer/go-livepeer/discovery 0.817s

Since syncest depends on a nested call similar to a sub-test, I opted to just rename the top level test function in the interests of not having to re-indent the entire file, or having yet another level of indentation around the file.

j0sh · 2025-09-17T16:23:42Z

Rebased onto latest master to fix merge conflicts, no other changes

mjh1

nice 👍

mjh1 · 2025-09-18T14:46:47Z

discovery/discovery_test.go

 	assert.Nil(err, "Should not be error")
 	assert.Len(infos, 1, "Should return one orchestrator")
 	assert.Equal("transcoderfromtestserver", infos[0].RemoteInfo.Transcoder)
+	wgWait(&wg)


do we need to check the returned bool?

good idea, checked in a6110ec

mjh1 · 2025-09-18T14:48:19Z

discovery/discovery_test.go

-	first := true
 	serverGetOrchInfo = func(ctx context.Context, bcast common.Broadcaster, orchestratorServer *url.URL, params server.GetOrchestratorInfoParams) (*net.OrchestratorInfo, error) {
-		mu.Lock()
-		if first {


why were we pausing on the first request, don't really understand that

me neither - best guess is some early test needed it or had a race condition (probably the deadlock one), and it got copied everywhere else.

I double checked the rest of these tests and seems that I mistakenly removed the sleep from the deadlock test; that apparently was meant to drop one of the orchs so it wouldn't pass discovery. Re-added it, hopefully with a little better explanation in a6110ec (I still have no idea how this could have caused a deadlock though.)

victorges

this synctest is really neat!

The golang 1.25 update turned out into something of an ordeal, so splitting it up from #3739 to allow for better review. While everything compiled in 1.25, some tests and CI steps broke. * Go lint needed to be updated to v2+ to support golang 1.25 * Bumping go vet from v1 to v2 entailed a bunch of CLI changes. Tried to make those 1-to-1 as much as possible, however: Go fmt now needs to be invoked separately for golint. We check go fmt in the CI step right before lint ... so just remove the go fmt step? The revive linter flagged a ton of missing comments. But it seems the linter only applies to the `pm` and `verification` packages which hardly change. Rather than update those packages, remove the use of the `revive` linter. * Some linting step (not sure which one? vet?) now complains about format string functions being used incorrectly, eg using a variable in place of a format string literal. Those were fixed in the code. * The `rand.Seed` function apparently became a no-op as of golang 1.24 and that broke most of our tests that use the RNG. Removed the init() function that sets the seed, and added a package level RNG context (using the same seed as the old init) and updated any failing tests to use that context. Not all uses of the RNG were updated, just ones that broke tests. The global RNG is safely initialized so we can continue to use it by default in the code that still uses it without test coverage.

The new synctest package, introduced in golang 1.25, makes tests much faster and more reliable. All the discovery tests that took more than 100ms on my machine were converted to use synctest. This surfaced a number of race conditions in both the tests and the code itself so these were also fixed. Timings with `go test -timeout 10s -count 1 -race` Before: ok github.com/livepeer/go-livepeer/discovery 7.087s After: ok github.com/livepeer/go-livepeer/discovery 0.817s Since syncest depends on a nested call similar to a sub-test, I opted to just rename the top level test function in the interests of not having to re-indent the entire file, or having yet another level of indentation around the file.

j0sh · 2025-09-22T23:06:24Z

Since the update for golang 1.25 became more involved than I was expecting, separated that out in its own PR in #3745 so this one has just the synctest changes. Once that goes in, this one will go in too.

The golang 1.25 update turned out into something of an ordeal, so splitting it up from #3739 to allow for better review. While everything compiled in 1.25, some tests and CI steps broke. * Go lint needed to be updated to v2+ to support golang 1.25 * Bumping go vet from v1 to v2 entailed a bunch of CLI changes. Tried to make those 1-to-1 as much as possible, however: Go fmt now needs to be invoked separately for golint. We check go fmt in the CI step right before lint ... so just remove the go fmt step? The revive linter flagged a ton of missing comments. But it seems the linter only applies to the `pm` and `verification` packages which hardly change. Rather than update those packages, remove the use of the `revive` linter. * Some linting step (not sure which one? vet?) now complains about format string functions being used incorrectly, eg using a variable in place of a format string literal. Those were fixed in the code. * The `rand.Seed` function apparently became a no-op as of golang 1.24 and that broke most of our tests that use the RNG. Removed the init() function that sets the seed, and added a package level RNG context (using the same seed as the old init) and updated any failing tests to use that context. Not all uses of the RNG were updated, just ones that broke tests. The global RNG is safely initialized so we can continue to use it by default in the code that still uses it without test coverage. * Fix race condition in discovery uncovered by golang 1.25

codecov · 2025-09-25T22:05:14Z

Codecov Report

❌ Patch coverage is 0% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 31.14775%. Comparing base (afef0a6) to head (4099731).
⚠️ Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
common/testutil.go	0.00000%	2 Missing ⚠️

Additional details and impacted files

@@                 Coverage Diff                 @@
##              master       #3739         +/-   ##
===================================================
- Coverage   31.14906%   31.14775%   -0.00131%     
===================================================
  Files            158         158                 
  Lines          47552       47554          +2     
===================================================
  Hits           14812       14812                 
- Misses         31852       31854          +2     
  Partials         888         888

Files with missing lines	Coverage Δ
common/testutil.go	`16.12903% <0.00000%> (-0.53764%)`	⬇️

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update afef0a6...4099731. Read the comment docs.

Files with missing lines	Coverage Δ
common/testutil.go	`16.12903% <0.00000%> (-0.53764%)`	⬇️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

j0sh requested review from victorges, leszko and mjh1 September 16, 2025 06:10

github-actions bot added dependencies Pull requests that update a dependency file go Pull requests that update Go code labels Sep 16, 2025

j0sh mentioned this pull request Sep 16, 2025

ai/live: Support for multiple instances per orchestrator in discovery #3719

Merged

Base automatically changed from ja/multiple-serviceuri to master September 17, 2025 15:30

j0sh force-pushed the ja/discovery-synctest branch from a1ec486 to d8f25d8 Compare September 17, 2025 16:22

j0sh mentioned this pull request Sep 18, 2025

ai/live: Gateway native WHEP server #3691

Open

mjh1 approved these changes Sep 18, 2025

View reviewed changes

mjh1 reviewed Sep 18, 2025

View reviewed changes

victorges approved these changes Sep 19, 2025

View reviewed changes

j0sh mentioned this pull request Sep 22, 2025

ai/live: Update to golang 1.25 #3745

Merged

j0sh force-pushed the ja/discovery-synctest branch from f3310ea to 424ddbc Compare September 22, 2025 23:01

j0sh changed the title ~~ai/live: Update to golang 1.25, use synctest in discovery~~ ai/live: Use synctest in discovery Sep 22, 2025

j0sh changed the base branch from master to ja/golang-1.25 September 22, 2025 23:02

j0sh force-pushed the ja/discovery-synctest branch from 424ddbc to b6015f2 Compare September 22, 2025 23:04

Base automatically changed from ja/golang-1.25 to master September 25, 2025 20:30

j0sh added 2 commits September 25, 2025 13:35

Merge branch 'master' into ja/discovery-synctest

ca11c29

Convert timing sensitive transcoder test to synctest

4c11855

fix some wonkiness around non deterministic test runs

4099731

j0sh merged commit 7f04401 into master Sep 26, 2025
28 of 30 checks passed

j0sh deleted the ja/discovery-synctest branch September 26, 2025 00:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ai/live: Use synctest in discovery #3739

ai/live: Use synctest in discovery #3739

Uh oh!

j0sh commented Sep 16, 2025 •

edited

Loading

Uh oh!

j0sh commented Sep 17, 2025

Uh oh!

mjh1 left a comment

Uh oh!

mjh1 Sep 18, 2025

Uh oh!

j0sh Sep 18, 2025

Uh oh!

mjh1 Sep 18, 2025

Uh oh!

j0sh Sep 18, 2025

Uh oh!

victorges left a comment

Uh oh!

j0sh commented Sep 22, 2025

Uh oh!

codecov bot commented Sep 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ai/live: Use synctest in discovery #3739

ai/live: Use synctest in discovery #3739

Uh oh!

Conversation

j0sh commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

j0sh commented Sep 17, 2025

Uh oh!

mjh1 left a comment

Choose a reason for hiding this comment

Uh oh!

mjh1 Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

j0sh Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

mjh1 Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

j0sh Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

victorges left a comment

Choose a reason for hiding this comment

Uh oh!

j0sh commented Sep 22, 2025

Uh oh!

codecov bot commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

j0sh commented Sep 16, 2025 •

edited

Loading

codecov bot commented Sep 25, 2025 •

edited

Loading