Automatic compression selection issues

### Observed behavior

**TL;DR**
We've been facing CPU throttling, high memory consumption, and increasing latency in the hub clusters. 

After we turn compression off in the leafnode connections, all problems disappear.

From our initial analysis, we think a snowball effect occurred: once memory usage reached a certain threshold, the garbage collector was triggered, which in turn increased CPU usage. As the CPU became throttled, RTT started to grow for some connections. This led to an automatic upgrade to a higher compression level, which further increased CPU load and RTT, perpetuating the cycle.

We encountered this issue on our production servers, but were able to reproduce it in our development setup. 

All the data below refers to the development setup.

_Note_: In the development setup, tracing and debugging are enabled. We repeated the same test with `trace` and `debug` set to `false`, without noticing any changes in behavior.

**Observabiltity**

CPU:

![Image](https://github.com/user-attachments/assets/242901a5-1ab9-40fb-bf92-9522ce7e06c1)

Memory:

_Observation_: We stopped the bench and started it again. During the period when the bench was stopped, memory usage never dropped, while CPU usage showed a noticeable decrease.

![Image](https://github.com/user-attachments/assets/8e58be9d-c47b-45bf-8323-8a99e8784055)

Latency:

![Image](https://github.com/user-attachments/assets/22fd8cf3-34b5-487e-b0d8-ab6004614ab9)

Throttling:

![Image](https://github.com/user-attachments/assets/54382f0a-deaa-4545-b0ee-e4801320aae0)

**Profilling**


_Development_: [profiles.zip](https://github.com/user-attachments/files/21040671/profiles.zip)

_Production_: [profiling_prod.zip](https://github.com/user-attachments/files/21041257/profiling_prod.zip)



### Expected behavior

Typical behaviour, the server being able to stand the load.

### Server and client version

```
~ $ nats --version
v0.1.2-0.20250310115758-f4eda5b1b7a3
```

```
~ $ nats-server --version
nats-server: v2.11.4
```



### Host environment

EKS at AWS, nodes running Bottlerocket.

![Image](https://github.com/user-attachments/assets/5c184bed-b3d9-4185-b443-796c9b5dc53e)

```
~ $ cat /etc/os-release
NAME="Alpine Linux"
ID=alpine
VERSION_ID=3.22.0
PRETTY_NAME="Alpine Linux v3.22"
HOME_URL="https://alpinelinux.org/"
BUG_REPORT_URL="https://gitlab.alpinelinux.org/alpine/aports/-/issues"
```

### Steps to reproduce

This is our setup.

![Image](https://github.com/user-attachments/assets/caa7defc-64df-4cba-ad93-ef949ecddccf)

We generated the load with:

```
nats bench pub -s nats://$T@localhost:4221 test --msgs 1000000000 --clients=10  --multisubject --multisubjectmax 100000
```

We consumed messages with:

```
nats sub -s nats://$T@localhost:4220 "test.*"
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Automatic compression selection issues #7037

Observed behavior

Expected behavior

Server and client version

Host environment

Steps to reproduce

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Automatic compression selection issues #7037

Description

Observed behavior

Expected behavior

Server and client version

Host environment

Steps to reproduce

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions