Summary: Display thresholds values even when not in summaryTrendStats #4698

joanlopez · 2025-04-16T12:01:52Z

What?

Makes it possible to display threshold values (since #4089) even when the aggregation on the thresholds expression (e.g. p(99.5)) isn't included as part of options.summaryTrendStats.

Why?

Now those values aren't displayed (instead, it prints undefined, which I would consider a bug), because we're not calculating values for aggregations other than options.summaryTrendStats.

Alternatively, we could just omit these lines from the summary, but I think these are valuable information for the user, and it's aligned with the goal of the revamped summary.

Checklist

I have performed a self-review of my code.
I have commented on my code, particularly in hard-to-understand areas.
I have added tests for my changes.
I have run linter and tests locally (make check) and all pass.

Checklist: Documentation (only for k6 maintainers and if relevant)

Please do not merge this PR until the following items are filled out.

I have added the correct milestone and labels to the PR.
I have updated the release notes: link
I have updated or added an issue to the k6-documentation: grafana/k6-docs#NUMBER if applicable
I have updated or added an issue to the TypeScript definitions: grafana/k6-DefinitelyTyped#NUMBER if applicable

Related PR(s)/Issue(s)

Closes #4695

oleiade

🚀 LGTM on my end 🚀

(I believe removing the logs 👇🏻 would fix the tests 🙇🏻 )

oleiade · 2025-04-16T12:36:43Z

internal/js/summary.js

+	console.log(info);
+	console.log(metric.values);
+


Suggested change

console.log(info);

console.log(metric.values);

Yeah, sorry, I forgot one extra force-push before opening the PR haha 😅

Hehe no worries 😄 Been there, did that, many times 🫶

joanlopez · 2025-04-16T12:59:53Z

internal/js/summary.js

@@ -839,7 +839,7 @@ function renderThresholdResults(
 			: formatter.decorate(failMark, 'red');

 		const sourceText = formatter.decorate(
-			`'${threshold.source}'`,


This isn't directly related, but it's so small that I prefer to add it as part of this PR. It's not a huge deal, but I think it's nice if we trim the threshold's source before displaying it in the summary.

What is the reason for trimming here? If the source includes spaces why we don't interpret it as intentional?

If the source includes spaces why we don't interpret it as intentional?

I'm a bit confused now; is there any reason for doing so?

To me, it feels like when you mistakenly add a leading or trailing whitespace in an input designed to write your name or similar.

Looking at the threshold syntax expression (ref), I can see for instance users adding (or not) whitespaces before/after the <operator>, for readability, but not why doing so at the beginning/end of the threshold definition.

internal/output/summary/data.go

codebien

Just an observation: the user experience might be tricky here. Because now we might have the potential scenarios where disabled stats appear in the summary.

I know that isn't a new limitation introduced here, instead it should highlight the need to track/untrack specific metrics as described in #1321, where we should extend the same concept to stats as well.

codebien · 2025-04-22T13:53:22Z

internal/js/summary.js

@@ -839,7 +839,7 @@ function renderThresholdResults(
 			: formatter.decorate(failMark, 'red');

 		const sourceText = formatter.decorate(
-			`'${threshold.source}'`,


What is the reason for trimming here? If the source includes spaces why we don't interpret it as intentional?

codebien · 2025-04-22T13:58:55Z

internal/js/summary.js

@@ -1061,6 +1065,10 @@ function computeSummaryInfo(metrics, renderContext, options) {
 	const nonTrendExtras = {};
 	const trendCols = {};

+	// While "trendCols" contain the values for each "trendStats" aggregation (e.g. p(90) as a sorted array,
+	// "trendKeys" is used to store specific aggregation values that aren't part of "trendStats"; mainly for thresholds.
+	const trendKeys = {};


// While "trendCols" contain the values for each "trendStats"

It doesn't sound right without applying in-depth thinking. Because Trend is itself a stat. Wouldn't it be easier to just use the concept of Threshold?

I'm not sure 🤷🏻 The code/data structure is quite generic, and while it's true that this is now used for threshold, it could technically be used for anything else, so I tried to keep it consistent with the existing "conventions" (if any) - one data structure holds stats identified by "columns", and this one by "keys" 😅

Overall, that code has many aspects that could be improved (there's already some // TODOs), and @oleiade already did a great job fixing some as part of #4089, but I'd prefer if we can keep focus on fixing the issue that's been reported, and re-evaluate namings and possible improvements over this code in another PR/ as part of another issue/work.

internal/output/summary/data.go

joanlopez · 2025-04-22T15:01:49Z

Just an observation: the user experience might be tricky here. Because now we might have the potential scenarios where disabled stats appear in the summary.

I know that isn't a new limitation introduced here, instead it should highlight the need to track/untrack specific metrics as described in #1321, where we should extend the same concept to stats as well.

I'm sorry @codebien, but I don't fully get your point here. As far as I can see, #1321 is all about metrics and sub-metrics, but here we only care about different "calculations" (percentiles, basically") of already existing metrics/sub-metrics, so I cannot see how it related 🤔 Conceptually I see those two as different, unrelated things.

codebien · 2025-04-22T15:24:56Z

@joanlopez, my interpretation of the issue is that the user wants to add a threshold for a specific metric at a particular percentile without adding that percentile to all the other metrics.

If that's not the case, then I guess we could simply suggest using summaryTrendStats: [/* other stats */..., 'p(99.99)'] or on our side, skip visualizing the non-computed stats to avoid undefined values.

However, neither of these solutions addresses the user's specific requirement.

Instead, I think 1321 might address it, if we integrate the proposed API with an option for defining the stats. Something similar to the example below:

export const options = { 
  summaryTrendStats: ['max', 'p(95)'], 
};

track("http_req_duration", {percentile: 99.9})

Where without the track line, the threshold will never fail because the stats for the metric doesn't exist. Adding this feature then will lead to a breaking change, but that should be acceptable since I don't expect this being real before v2.

joanlopez · 2025-04-23T07:04:18Z

@joanlopez, my interpretation of the issue is that the user wants to add a threshold for a specific metric at a particular percentile without adding that percentile to all the other metrics.

Let me slightly correct you, because I think it's key here:

"...at a particular percentile without displaying that percentile in summary to all the ~~other~~ metrics."

What I mean here is basically two things (in my humble opinion):

The summaryTrendStats option is, as its name hints, an option to configure the "end-of-test summary", that let users choose what's displayed there (which percentiles values are displayed in the summary for trend metrics). NOT an option to configure what metrics are tracked/untracked.
Percentiles aren't a form of "(sub)metrics tracked", but just a calculation you do over a metric, or more correctly over a sink. It's like when we refer to "min" or "max" for a given "gauge sink". What I think could make sense in the future, as you mentioned, is a mechanism to let the user choose what (sub)metrics are tracked, but less likely what calculations.

If that's not the case, then I guess we could simply suggest using summaryTrendStats: [/* other stats */..., 'p(99.99)'] or on our side, skip visualizing the non-computed stats to avoid undefined values.

I disagree with both, because:

I still see value of having the opportunity to define a threshold for a concrete percentile, even if you don't want to see the value of that percentile for ALL the metrics displayed in the end-of-test summary. In other words, why do I need to add to the summary the value of p(99.99) for all the trend metrics, if I only care about the p(99.99) of this particular metric X?
I think it's better to display the value in the thresholds section that hiding it because:
- It keeps that behavior consistent with the whole "thresholds" section, where the value of the aggregation the threshold is defined on is always displayed. It also keeps it consistent with the rest of the summary,as the value won't be displayed in the list below, where all the metrics are displayed, according to summaryTrendStats.
- Accompanying thresholds results with the value of the particular aggregator used by the threshold was precisely one of the main ideas of the revamped end-of-test summary, because it really makes it very easy for the user to understand why that threshold succeed/failed.

However, neither of these solutions addresses the user's specific requirement.

Instead, I think 1321 might address it, if we integrate the proposed API with an option for defining the stats. Something similar to the example below:
export const options = { 
  summaryTrendStats: ['max', 'p(95)'], 
};

track("http_req_duration", {percentile: 99.9})
Where without the track line, the threshold will never fail because the stats for the metric doesn't exist. Adding this feature then will lead to a breaking change, but that should be acceptable since I don't expect this being real before v2.

That's definitely out of the scope of this PR, so I'd postpone that discussion for whenever we design that feature, but I'm not super convinced that having to specify what calculations do we want to track is really a good idea, because calculations aren't that associated with noise/high memory consumption, which I think are the underneath goal of that feature, and doing so would make both the design and implementation of such feature way more complex.

cc/ @oleiade any take here?

codebien

I want to unblock the merge for the pull request, so it mostly looks good to me. I want to check what is the context to provide an answer for #4698 (comment), but I don't have time today, I will do it at the end of the week. 🙇

codebien · 2025-04-23T08:43:32Z

@joanlopez However, the CI seems to be red, can you check that is not related, please?

joanlopez · 2025-04-23T11:19:13Z

@joanlopez However, the CI seems to be red, can you check that is not related, please?

It isn't, it's just one of the flaky Browser E2E tests that define a thresholds that isn't always satisfied.

oleiade

🚀 🍾

joanlopez added bug ux labels Apr 16, 2025

joanlopez requested review from oleiade and codebien April 16, 2025 12:01

joanlopez self-assigned this Apr 16, 2025

joanlopez requested a review from a team as a code owner April 16, 2025 12:01

oleiade previously approved these changes Apr 16, 2025

View reviewed changes

joanlopez dismissed oleiade’s stale review via 706ba24 April 16, 2025 12:52

joanlopez force-pushed the fix-4695 branch 3 times, most recently from 616bd35 to 17d877e Compare April 16, 2025 12:58

joanlopez commented Apr 16, 2025

View reviewed changes

internal/output/summary/data.go Show resolved Hide resolved

Summary: Display thresholds values even when not in summaryTrendStats

c4bb651

joanlopez force-pushed the fix-4695 branch from 1dd7c22 to c4bb651 Compare April 17, 2025 10:29

joanlopez requested a review from oleiade April 18, 2025 09:02

codebien requested changes Apr 22, 2025

View reviewed changes

Merge remote-tracking branch 'upstream/master' into fix-4695

4b6aa05

Apply suggestions from code review

062cc97

joanlopez requested a review from codebien April 23, 2025 07:05

codebien approved these changes Apr 23, 2025

View reviewed changes

codebien added this to the v1.0.0 milestone Apr 23, 2025

oleiade approved these changes Apr 23, 2025

View reviewed changes

joanlopez merged commit 8d96e64 into master Apr 23, 2025
28 checks passed

joanlopez deleted the fix-4695 branch April 23, 2025 12:58

Summary: Display thresholds values even when not in summaryTrendStats #4698

Summary: Display thresholds values even when not in summaryTrendStats #4698

Uh oh!

Conversation

joanlopez commented Apr 16, 2025

What?

Why?

Checklist

Checklist: Documentation (only for k6 maintainers and if relevant)

Related PR(s)/Issue(s)

Uh oh!

oleiade left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codebien left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codebien Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

joanlopez commented Apr 22, 2025

Uh oh!

codebien commented Apr 22, 2025

Uh oh!

joanlopez commented Apr 23, 2025

Uh oh!

codebien left a comment

Choose a reason for hiding this comment

Uh oh!

codebien commented Apr 23, 2025

Uh oh!

joanlopez commented Apr 23, 2025

Uh oh!

oleiade left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

oleiade left a comment •

edited

Loading

codebien left a comment •

edited

Loading

codebien Apr 22, 2025 •

edited

Loading