Skip to content

Bump up oximeter redundancy to 3 perhaps? #6900

Open
@askfongjojo

Description

@askfongjojo

Oximeter is one of the few services that has no redundancy in the current provisioning policy. Metrics hasn't been considered mission-critical so far because they weren't exposed to users previously and is still in experimental mode at this time via OxQL. But as customer starts to consume the data for monitoring purposes, service availability will become more important than before.

Besides redundancy, distributing the metrics collection across different sleds will also help balance the network traffic load across different sleds. The sled_data_link:bytes_sent|received metrics on rack2 show that oximeter is the heaviest consumer of network bandwidth among all the non-crucible control plane services.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions