Bucketize autoscaling metrics by timeframe not by pod name. #3289

markusthoemmes · 2019-02-20T15:57:00Z

Proposed Changes

Stats are averaged in each specific timeframe vs. averaged over the whole window. See the linked issue for more in-depth information

Release Note

TBD

knative-prow-robot

@markusthoemmes: 0 warnings.

Details

In response to this:

Fixes #2977

Proposed Changes

Stats are averaged in each specific timeframe vs. averaged over the whole window. See the linked issue for more in-depth information

Release Note
TBD

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

markusthoemmes · 2019-02-22T10:47:01Z

Unrelated failure

/test pull-knative-serving-integration-tests

markusthoemmes · 2019-02-22T12:31:34Z

/assign @yanweiguo
/assign @k4leung4

Please let me know what you think.

markusthoemmes · 2019-02-22T13:36:10Z

test/e2e/autoscale_test.go


 	go func() {
-		if err := generateTraffic(ctx, int(numPods*10), 30*time.Second, stopChan); err != nil {
+		if err := generateTraffic(ctx, int(numPods*10), 60*time.Second, stopChan); err != nil {


These changes stabilize the autoscaling tests. They have recently been adjusted to continue generating more traffic as soon as the we hit the desired replica count. However that's only been done on "Replicas" so we're at danger of overflowing if the pod takes a while to come up.

Likewise the amount of traffic being sent in (30s) can be juuuuuust about enough to cause us to scale up. After 60s it's guaranteed to (for the default window sizes).

vagababov

Superficial mostly. I need to re-read the PR again for the logic part, though it mostly makes sense to me.

pkg/autoscaler/autoscaler.go

yanweiguo · 2019-02-22T18:54:00Z

pkg/autoscaler/autoscaler_test.go

 	kubeInformer.Core().V1().Endpoints().Informer().GetIndexer().Add(ep)
 }
+
+func roundedNow() time.Time {


Is the reason to use roundedNow that it prevent flakiness because some stats could be out of scale window if now() is used directly?

Yes, it basically normalizes the instances of "now" so the test doesn't depend on when exactly it is executed. Especially when adding to "now" in the tests we otherwise risk to jump into other buckets in the calculation. It makes the test deterministic.

vagababov

/lgtm

knative-metrics-robot · 2019-02-22T19:26:10Z

The following is the coverage report on pkg/.
Say /test pull-knative-serving-go-coverage to re-run this coverage report

File	Old Coverage	New Coverage	Delta
pkg/autoscaler/autoscaler.go	97.2%	97.0%	-0.3

k4leung4 · 2019-02-22T19:34:33Z

/lgtm

yanweiguo · 2019-02-22T19:56:38Z

/lgtm

srinivashegde86 · 2019-02-22T20:15:58Z

/lgtm
/approve

knative-prow-robot · 2019-02-22T20:16:04Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: markusthoemmes, srinivashegde86

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~pkg/autoscaler/OWNERS~~ [markusthoemmes]
~~test/OWNERS~~ [srinivashegde86]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

knative-prow-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Feb 20, 2019

knative-prow-robot requested review from josephburnett and mdemirhan February 20, 2019 15:57

knative-prow-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Feb 20, 2019

knative-prow-robot reviewed Feb 20, 2019

View reviewed changes

knative-prow-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 20, 2019

markusthoemmes mentioned this pull request Feb 20, 2019

Autoscaler pod calculation when transitioning off activator drops activator metrics #3281

Closed

markusthoemmes added 10 commits February 22, 2019 09:45

Initial implementation of a different aggregation strategy.

0fcfc01

More simplification.

15b3f3e

Some more stabilizing.

22995b6

Some more adjustments.

b8a2702

Throw away 'isActivator'.

4ffb1c2

Further simplification.

ab13afc

Externalize aggregation logic.

908bb23

Hardening the autoscale test.

102082e

Rolling back unnecessary test changes.

f77ee4b

Revive system totals.

a61fb92

markusthoemmes force-pushed the new-autoscaler branch from 0fde3c4 to a61fb92 Compare February 22, 2019 09:58

knative-prow-robot removed the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 22, 2019

markusthoemmes marked this pull request as ready for review February 22, 2019 10:13

markusthoemmes changed the title ~~[WIP] Bucketize autoscaling metrics by timeframe not by pod name.~~ Bucketize autoscaling metrics by timeframe not by pod name. Feb 22, 2019

knative-prow-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Feb 22, 2019

knative-prow-robot assigned k4leung4 and yanweiguo Feb 22, 2019

Self review: Add documentation, fix nits.

7a10e6d

markusthoemmes mentioned this pull request Feb 22, 2019

Don't double account for requests going through the activator. #3301

Closed

markusthoemmes commented Feb 22, 2019

View reviewed changes

Nit about variable grouping.

7bfb32e

vagababov reviewed Feb 22, 2019

View reviewed changes

pkg/autoscaler/autoscaler.go Outdated Show resolved Hide resolved

pkg/autoscaler/autoscaler.go Outdated Show resolved Hide resolved

pkg/autoscaler/autoscaler.go Outdated Show resolved Hide resolved

pkg/autoscaler/autoscaler.go Show resolved Hide resolved

markusthoemmes added 4 commits February 22, 2019 18:43

Pass Stat as a pointer.

61fb53c

Use type-alias instead of a struct type.

8dcbe3a

Typo.

6d439c3

Collapse ifs.

6e3d61f

yanweiguo reviewed Feb 22, 2019

View reviewed changes

vagababov reviewed Feb 22, 2019

View reviewed changes

knative-prow-robot assigned vagababov Feb 22, 2019

knative-prow-robot added the lgtm Indicates that a PR is ready to be merged. label Feb 22, 2019

More review comments.

c4f8174

knative-prow-robot removed the lgtm Indicates that a PR is ready to be merged. label Feb 22, 2019

Distinguish division issues better.

d05a769

knative-prow-robot added the lgtm Indicates that a PR is ready to be merged. label Feb 22, 2019

knative-prow-robot assigned srinivashegde86 Feb 22, 2019

knative-prow-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 22, 2019

knative-prow-robot merged commit 366aa03 into knative:master Feb 22, 2019

yanweiguo mentioned this pull request Feb 23, 2019

Scrape queue-proxy metrics in autoscaler #3149

Merged

josephburnett mentioned this pull request Jun 5, 2019

Add markusthoemmes as Scaling Working Group Lead. knative/community#11

Merged

Bucketize autoscaling metrics by timeframe not by pod name. #3289

Bucketize autoscaling metrics by timeframe not by pod name. #3289

Uh oh!

Conversation

markusthoemmes commented Feb 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed Changes

Uh oh!

knative-prow-robot left a comment

Choose a reason for hiding this comment

Proposed Changes

Uh oh!

markusthoemmes commented Feb 22, 2019

Uh oh!

markusthoemmes commented Feb 22, 2019

Uh oh!

markusthoemmes Feb 22, 2019

Choose a reason for hiding this comment

Uh oh!

vagababov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yanweiguo Feb 22, 2019

Choose a reason for hiding this comment

Uh oh!

markusthoemmes Feb 22, 2019

Choose a reason for hiding this comment

Uh oh!

vagababov left a comment

Choose a reason for hiding this comment

Uh oh!

knative-metrics-robot commented Feb 22, 2019

Uh oh!

k4leung4 commented Feb 22, 2019

Uh oh!

yanweiguo commented Feb 22, 2019

Uh oh!

srinivashegde86 commented Feb 22, 2019

Uh oh!

knative-prow-robot commented Feb 22, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

markusthoemmes commented Feb 20, 2019 •

edited

Loading