Skip to content

feat: Add custom placeholder pages for scale-from-zero scenarios #1303

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 19 commits into
base: main
Choose a base branch
from

Conversation

malpou
Copy link

@malpou malpou commented May 30, 2025

Allows HTTPScaledObjects to serve configurable HTML pages while workloads scale up from zero, with support for templates,
custom headers, and automatic refresh.

Description

This PR implements customizable placeholder pages that are displayed to users while applications are scaling up from zero.
This significantly improves the user experience during cold starts by providing immediate feedback instead of connection
timeouts or errors.

Key features:

  • Adds placeholderConfig section to HTTPScaledObject CRD for configuring placeholder pages
  • Supports both inline HTML content and ConfigMap-based templates
  • Includes Go template support with variables like {{.ServiceName}} and {{.RefreshInterval}}
  • Automatic page refresh capability to check when the service becomes available
  • Custom HTTP headers and status codes
  • Configurable timeout for how long to show placeholder pages

Implementation details:

  • New placeholder handler in the interceptor that serves configured HTML when backends are unavailable
  • Graceful fallback to upstream requests once the service is ready
  • Full backward compatibility - disabled by default

Checklist

Fixes #874

@malpou malpou requested a review from a team as a code owner May 30, 2025 10:03
@malpou malpou closed this May 30, 2025
@malpou malpou reopened this May 30, 2025
@malpou malpou force-pushed the placeholder-pages branch from 92f8f69 to 02451ee Compare May 30, 2025 10:33
Copy link
Member

@JorTurFer JorTurFer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a great feature! Thanks for the contribution 🙇

<html>
<head>
<title>Service Starting</title>
<meta http-equiv="refresh" content="{{.RefreshInterval}}">
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we inject this value instead of expect that users provide it? I mean, this looks as something that we want to enforce. @wozniakjan ?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was thinking about injecting it but wanted some feedback first.

I think it also would be possible to add some JavaScript instead which calls the URL the browser is on and checks if the X-KEDA-HTTP-Placeholder-Served: true header is there. If it's not we should do a full refresh.
This avoids having the browser do an actual refresh every X seconds. This would mostly be an UX improvement.

Both approaches could be injected into whatever template the user provides, if that's the approach you would prefer.

Example:

<script>
(function() {
    const checkInterval = {{.RefreshInterval}} * 1000; // Convert seconds to milliseconds
    
    async function checkServiceStatus() {
        try {
            // Make a HEAD request to the current URL to check headers
            const response = await fetch(window.location.href, {
                method: 'HEAD',
                cache: 'no-cache'
            });
            
            // Check if the placeholder header is present
            const placeholderHeader = response.headers.get('X-KEDA-HTTP-Placeholder-Served');
            
            if (placeholderHeader !== 'true') {
                // Service is ready! Do a full page refresh
                window.location.reload();
            } else {
                // Still showing placeholder, check again later
                setTimeout(checkServiceStatus, checkInterval);
            }
        } catch (error) {
            // On error, continue checking
            console.error('Error checking service status:', error);
            setTimeout(checkServiceStatus, checkInterval);
        }
    }
    
    // Start checking after the interval
    setTimeout(checkServiceStatus, checkInterval);
})();
</script>

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I like this approach! it's quite elegant IMHO, the only thing is that JS needs to be injected as well, doesn't

Copy link
Member

@JorTurFer JorTurFer Jun 2, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@JorTurFer I've tried to make a take on the injection with a <script> tag and some JS to figure out when to refresh the browser.

malpou added 7 commits June 2, 2025 21:19
Allows HTTPScaledObjects to serve configurable HTML pages while workloads
scale up from zero, with support for templates, custom headers, and
automatic refresh.

Signed-off-by: malpou <[email protected]>
Signed-off-by: malpou <[email protected]>
…should be comptatible in the GHA environment

Signed-off-by: malpou <[email protected]>
@malpou malpou force-pushed the placeholder-pages branch from 2e3f5f5 to a3d021f Compare June 2, 2025 19:19
malpou and others added 7 commits June 3, 2025 13:43
Signed-off-by: Malthe Poulsen <[email protected]>
Signed-off-by: malpou <[email protected]>
Signed-off-by: Malthe Poulsen <[email protected]>
Signed-off-by: Malthe Poulsen <[email protected]>
Signed-off-by: Malthe Poulsen <[email protected]>
Signed-off-by: Malthe Poulsen <[email protected]>
@malpou malpou requested a review from JorTurFer June 3, 2025 16:53
Signed-off-by: Malthe Poulsen <[email protected]>
malpou and others added 3 commits June 4, 2025 08:54
Co-authored-by: Jorge Turrado Ferrero <[email protected]>
Signed-off-by: Malthe Poulsen <[email protected]>
Co-authored-by: Jorge Turrado Ferrero <[email protected]>
Signed-off-by: Malthe Poulsen <[email protected]>
Signed-off-by: malpou <[email protected]>
@malpou malpou requested a review from JorTurFer June 4, 2025 07:08
@wozniakjan wozniakjan requested a review from Copilot June 10, 2025 12:03
Copy link

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR introduces configurable placeholder pages for scale-from-zero scenarios, enhancing the user experience during cold starts. Key changes include:

  • Adding a new placeholder handler and supporting API changes in the HTTPScaledObject CRD.
  • Adjusting proxy handler logic and test cases to incorporate placeholder page responses.
  • Updating documentation, examples, and deepcopy functions to reflect the new placeholderConfig functionality.

Reviewed Changes

Copilot reviewed 14 out of 14 changed files in this pull request and generated 1 comment.

Show a summary per file
File Description
tests/checks/placeholder_pages/placeholder_pages_test.go Added E2E tests for placeholder page responses and script injection.
operator/apis/http/v1alpha1/zz_generated.deepcopy.go Added DeepCopy functions for the new PlaceholderConfig type.
operator/apis/http/v1alpha1/httpscaledobject_types.go Introduced the PlaceholderConfig type in the CRD schema.
interceptor/proxy_handlers_test.go, proxy_handlers_integration_test.go Updated tests to pass new parameters for placeholder handling.
interceptor/proxy_handlers.go Integrated the placeholder handler into the forwarding logic.
interceptor/main.go, main_test.go Updated server initialization to include the placeholder handler.
interceptor/handler/placeholder.go Implemented the placeholder page rendering and caching logic.
examples/vX.X.X/httpscaledobject.yaml Provided example configuration for placeholder pages.
docs/ref/vX.X.X/http_scaled_object.md Documented the placeholderConfig section details.
config/crd/bases/http.keda.sh_httpscaledobjects.yaml Updated the CRD with placeholderConfig properties.
CHANGELOG.md Added an entry for the custom placeholder pages feature.
Comments suppressed due to low confidence (1)

interceptor/handler/placeholder.go:206

  • [nitpick] Consider logging the error returned from getTemplate before falling back to the inline response to improve debuggability of template parsing issues.
tmpl, err := h.getTemplate(r.Context(), hso)

Copy link
Member

@JorTurFer JorTurFer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

awesome job! Only one small nit inline and it's okey to merge ❤️

@malpou malpou requested a review from JorTurFer June 21, 2025 10:17
@Nico385412
Copy link

Hello any news on this feature ? 🙏

@JorTurFer
Copy link
Member

Sorry, @wozniakjan told me that he wanted to review the feature and I just kept this to him.
Did you take a look @wozniakjan ?

Copy link
Member

@wozniakjan wozniakjan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this looks really great, thank you for contributing!

I do have a couple of discussion points before it gets merged

  • usage of ConfigMaps for storing content for placeholder pages - imho defaults would be better configured globally as a statically mounted ConfigMap or env variable on Deployment
  • simplifying content processing - the default as a nicely formatted HTML page makes great sense but then we should allow arbitrary payloads with arbitrary headers, not all usage of this tool is aimed at web browsers, sometimes it's microservices talking between each other

Comment on lines +20 to +102
const placeholderScript = `<script>
(function() {
const checkInterval = {{.RefreshInterval}} * 1000;

async function checkServiceStatus() {
try {
const response = await fetch(window.location.href, {
method: 'HEAD',
cache: 'no-cache'
});

const placeholderHeader = response.headers.get('X-KEDA-HTTP-Placeholder-Served');

if (placeholderHeader !== 'true') {
window.location.reload();
} else {
setTimeout(checkServiceStatus, checkInterval);
}
} catch (error) {
console.error('Error checking service status:', error);
setTimeout(checkServiceStatus, checkInterval);
}
}

setTimeout(checkServiceStatus, checkInterval);
})();
</script>`

const defaultPlaceholderTemplateWithoutScript = `<!DOCTYPE html>
<html>
<head>
<title>Service Starting</title>
<meta charset="utf-8">
<style>
body {
font-family: -apple-system, BlinkMacSystemFont, "Segoe UI", Roboto, sans-serif;
display: flex;
align-items: center;
justify-content: center;
min-height: 100vh;
margin: 0;
background: #f5f5f5;
}
.container {
text-align: center;
padding: 2rem;
background: white;
border-radius: 8px;
box-shadow: 0 2px 10px rgba(0,0,0,0.1);
max-width: 400px;
}
h1 {
color: #333;
margin-bottom: 1rem;
font-size: 1.5rem;
}
.spinner {
width: 40px;
height: 40px;
margin: 1.5rem auto;
border: 4px solid #f3f3f3;
border-top: 4px solid #3498db;
border-radius: 50%;
animation: spin 1s linear infinite;
}
@keyframes spin {
0% { transform: rotate(0deg); }
100% { transform: rotate(360deg); }
}
p {
color: #666;
margin: 0.5rem 0;
}
</style>
</head>
<body>
<div class="container">
<h1>{{.ServiceName}} is starting up...</h1>
<div class="spinner"></div>
<p>Please wait while we prepare your service.</p>
</div>
</body>
</html>`
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

wdyt about moving these default templates to configuration instead of having them hardcoded in the binary? I can imagine setting it through environment variables in the Deployment or alternatively, which might be even better, mounted as a ConfigMap.

These default pages are pretty good examples, but I'd imagine people may want to tweak them a little bit or centrally replace with different defaults. As implemented, iiuc, users would have to copy the same ConfigMap into each namespace and then reference it in the HTTPScaledObject or inline the same string in each HTTPScaledObject.

}

// For non-HTML content, wrap it in a minimal HTML structure with the script
return fmt.Sprintf(`<!DOCTYPE html>
Copy link
Member

@wozniakjan wozniakjan Jul 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what if users don't want HTML content? The client doesn't have to be web browser, it could be two microservices talking to each other and they may have other L7 protocols (json, protobuf, etc) with their own mechanisms on how to handle retries.

Comment on lines +85 to +91
// Path to ConfigMap containing placeholder HTML content (in same namespace)
// +optional
ContentConfigMap string `json:"contentConfigMap,omitempty" description:"ConfigMap name containing placeholder HTML content"`
// Key in ConfigMap containing the HTML template
// +kubebuilder:default="template.html"
// +optional
ContentConfigMapKey string `json:"contentConfigMapKey,omitempty" description:"Key in ConfigMap containing the HTML template"`
Copy link
Member

@wozniakjan wozniakjan Jul 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mind the fact interceptor doesn't have RBAC for ConfigMaps. Is it worth adding clusterwide read access to ConfigMaps so users can configure content of a placeholder page per namespace? Imho it's a bit too much and I would rather just add defaults as mounted ConfigMaps to interceptor or env variables on the Deployment as mentioned above.

if config.ContentConfigMap != "" {
cacheKey := fmt.Sprintf("%s/%s/cm/%s", hso.Namespace, hso.Name, config.ContentConfigMap)

cm, err := h.k8sClient.CoreV1().ConfigMaps(hso.Namespace).Get(ctx, config.ContentConfigMap, metav1.GetOptions{})
Copy link
Member

@wozniakjan wozniakjan Jul 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would rather not use ConfigMaps at the HTTPScaledObject level but if we decide to go with them, please use cached client instead of the direct client. Fetching ConfigMap for each request on zero-scaled apps might be tricky with kube-client QPS and could easily cause a lot of load for kube-apiserver if users increase the burst config to "deal with errors".

// PlaceholderHandler handles serving placeholder pages during scale-from-zero
type PlaceholderHandler struct {
k8sClient kubernetes.Interface
routingTable routing.Table
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
routingTable routing.Table

the routing table seems to be unused, there are only three references and they don't seem to carry much value?

  1 interceptor/handler/placeholder.go|114 col 2| routingTable  routing.Table
  2 interceptor/handler/placeholder.go|140 col 3| routingTable:  routingTable,
  3 interceptor/handler/placeholder_test.go|32 col 27| assert.NotNil(t, handler.routingTable)

Comment on lines +76 to +86
endpoints, err := endpointsCache.Get(httpso.GetNamespace(), httpso.Spec.ScaleTargetRef.Service)
if err == nil && workloadActiveEndpoints(endpoints) == 0 {
if placeholderErr := placeholderHandler.ServePlaceholder(w, r, httpso); placeholderErr != nil {
lggr.Error(placeholderErr, "failed to serve placeholder page")
w.WriteHeader(http.StatusBadGateway)
if _, err := w.Write([]byte("error serving placeholder page")); err != nil {
lggr.Error(err, "could not write error response to client")
}
}
return
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

situation when placeholder pages are configured and the endpoints cache returns error is not handled, which means the code will fall through to waiting on cold-start. Is that desired, or should the interceptor instead return a 503 error indicating an internal problem?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

"Please hold" landing pages for slow scale from zero scenarios
4 participants