description	Performance and data safety

Buffering

After Fluent Bit ingests data, it temporarily stores that data in the system memory (heap) before processing and routing that data to its destination. This process is known as buffering.

graph LR
    accTitle: Fluent Bit data pipeline
    accDescr: A diagram of the Fluent Bit data pipeline, which includes input, a parser, a filter, a buffer, routing, and various outputs.
    A[Input] --> B[Parser]
    B --> C[Filter]
    C --> D[Buffer]
    D --> E((Routing))
    E --> F[Output 1]
    E --> G[Output 2]
    E --> H[Output 3]
    style D stroke:darkred,stroke-width:2px;

{% hint style="info" %} Buffered data uses the Fluent Bit internal binary representation, which isn't raw text. This buffered data is immutable. {% endhint %}

Chunks

When an input plugin emits records, the engine groups records together into a chunk. Each chunk has an average size of 2 MB. The active buffering mode determines where these chunks are stored.

Chunks that are stored simultaneously in memory and in filesystem storage are known as up chunks. Chunks that are stored only in filesystem storage are known as down chunks. A down chunk becomes an up chunk when a copy of the down chunk is written to memory.

After an up chunk is processed and routed, the associated buffered data both in memory and in the filesystem is flushed.

Irrecoverable chunks

Fluent Bit marks a chunk as irrecoverable in the following scenarios:

When Fluent Bit encounters a bad layout in a chunk. A bad layout is a chunk that doesn't conform to the expected format.
When Fluent Bit encounters an incorrect or invalid chunk header size.

After marking a chunk as irrecoverable, Fluent Bit logs an error message and then discards the irrecoverable chunk.

Buffering modes

Fluent Bit offers three modes for storing buffered data. Memory-only and memory ring buffer modes store data exclusively in memory, while filesystem buffering is a hybrid method that stores an additional copy of buffered data in the filesystem.

You can set the buffering mode for each active input plugin.

Memory-only buffering

When memory-only buffering is enabled, Fluent Bit stores buffered data in memory until it's ready to process and route that data to its intended destinations. After Fluent Bit processes and routes the data, it flushes that data from memory.

This buffering method is faster than filesystem buffering, and uses less system overhead, but is more prone to data loss.

Filesystem buffering (hybrid)

When filesystem buffering is enabled, Fluent Bit stores each chunk of buffered data in the filesystem through mmap(2). If Fluent Bit has enough space in memory, an identical chunk of that buffered data is also written to memory. If Fluent Bit doesn't have enough space in memory, the chunk of buffered data remains only in the filesystem until there is enough space to write an identical chunk to memory. After Fluent Bit processes and routes the data, it flushes that data from memory and from the filesystem.

This buffering method is less efficient than memory-only buffering, and uses more system overhead, but is less prone to data loss.

Memory ring buffer buffering

When memory ring buffer (memrb) buffering is enabled, Fluent Bit stores buffered data in a fixed-size memory ring buffer. Unlike memory-only buffering, when the ring buffer is full, Fluent Bit automatically drops the oldest chunks to make room for new data instead of pausing the input plugin. Use this mode in scenarios where you want to keep the most recent data and can tolerate loss of older data.

When chunks are dropped, Fluent Bit tracks the number of dropped chunks and bytes through the memrb_dropped_chunks and memrb_dropped_bytes metrics. You can monitor these metrics to understand how much data is being discarded.

This buffering method prioritizes continuous data ingestion over data completeness, making it suitable for high-throughput scenarios where pausing the input isn't acceptable.

Configuration settings

{% hint style="info" %} For information about different strategies for managing backpressure, see Backpressure. {% endhint %}

Use the information in this section to configure buffering settings in Fluent Bit. Global settings configure the storage layer, per-input settings define which buffering mechanism to use, and per-output settings define limits for the logical filesystem queues.

Global settings

In the service section of Fluent Bit configuration files, several settings related to buffering are stored in the storage key. These are global settings that affect all input and output plugins.

To set a default buffering mode for all input plugins, configure storage.type and enable storage.inherit in the service section. Input plugins that don't explicitly set their own storage.type will inherit the global value. The following example sets filesystem buffering as the default for all inputs, while allowing individual inputs to override the default:

{% tabs %} {% tab title="fluent-bit.yaml" %}

service:
  flush: 1
  log_level: info
  storage.path: /var/log/flb-storage/
  storage.type: filesystem
  storage.inherit: on
  storage.sync: normal
  storage.max_chunks_up: 128

pipeline:
  inputs:
    - name: cpu
      # Inherits storage.type: filesystem from service

    - name: mem
      storage.type: memory  # Overrides the inherited default

{% endtab %} {% tab title="fluent-bit.conf" %}

[SERVICE]
  Flush                     1
  Log_Level                 info
  Storage.Path              /var/log/flb-storage/
  Storage.Type              filesystem
  Storage.Inherit           on
  Storage.Sync              normal
  Storage.Max_Chunks_Up     128

[INPUT]
  Name          cpu
  # Inherits storage.type: filesystem from SERVICE

[INPUT]
  Name          mem
  Storage.Type  memory  # Overrides the inherited default

{% endtab %} {% endtabs %}

Per-input settings

You can configure buffering settings for any input plugin by using these configuration parameters:

Key	Description	Default
`mem_buf_limit`	Sets a limit for how much buffered data the plugin can write to memory. With memory-only buffering, the plugin pauses until more memory becomes available after this limit is reached. With memory ring buffer (`memrb`) buffering, the oldest chunks are dropped to make room for new data instead of pausing. This value must follow unit size specifications. If unspecified, no limit is enforced.	`0`
`storage.pause_on_chunks_overlimit`	If filesystem buffering is enabled, specifies how the input plugin should behave after the global `storage.max_chunks_up` limit is reached. When set to `off`, the plugin will stop buffering data to memory but continue buffering data to the filesystem. When set to `on`, the plugin will stop both memory buffering and filesystem buffering until more memory becomes available. Possible values: `on`, `off`.	`off`
`storage.type`	Specifies the buffering mechanism to use for this input plugin. To enable filesystem buffering, a global `storage.path` value must be set in the `service` section of your configuration file. When set to `memrb`, Fluent Bit uses a memory ring buffer that automatically drops the oldest chunks when the buffer is full to make room for new data. Accepted values: `memory`, `filesystem`, `memrb`.	`memory`

The following configuration example sets global settings in service to support filesystem buffering, then configures input plugins with filesystem buffering, memory-only buffering, and memory ring buffer buffering:

{% tabs %} {% tab title="fluent-bit.yaml" %}

service:
  flush: 1
  log_level: info
  storage.path: /var/log/flb-storage/
  storage.sync: normal
  storage.checksum: off
  storage.max_chunks_up: 128
  storage.backlog.mem_limit: 5M

pipeline:
  inputs:
    - name: cpu
      storage.type: filesystem

    - name: mem
      storage.type: memory

    - name: dummy
      storage.type: memrb
      mem_buf_limit: 10M

{% endtab %} {% tab title="fluent-bit.conf" %}

[SERVICE]
  Flush                     1
  Log_Level                 info
  Storage.Path              /var/log/flb-storage/
  Storage.Sync              normal
  Storage.Checksum          off
  Storage.Max_Chunks_Up     128
  Storage.Backlog.Mem_Limit 5M

[INPUT]
  Name          cpu
  Storage.Type  filesystem

[INPUT]
  Name          mem
  Storage.Type  memory

[INPUT]
  Name           dummy
  Storage.Type   memrb
  Mem_Buf_Limit  10M

{% endtab %} {% endtabs %}

Per-output settings

If any active input plugins use filesystem buffering, you can limit how many chunks are buffered to the filesystem based on the output plugin where Fluent Bit intends to route that chunk. To do so, use this configuration parameter:

Key	Description	Default
`storage.total_limit_size`	Sets the size of the queue for this output plugin. This queue is the number of chunks buffered to the filesystem with this output as the intended destination. If the output plugin reaches its `storage.total_limit_size` capacity, the oldest chunk from its queue will be discarded to make room for new data. This value must follow unit size specifications.	none

The following configuration example creates records with CPU usage samples in the filesystem which are delivered to Google Stackdriver service while limiting the logical queue to 5M:

{% tabs %} {% tab title="fluent-bit.yaml" %}

service:
  flush: 1
  log_level: info
  storage.path: /var/log/flb-storage/
  storage.sync: normal
  storage.checksum: off
  storage.max_chunks_up: 128
  storage.backlog.mem_limit: 5M

pipeline:
  inputs:
    - name: cpu
      storage.type: filesystem

  outputs:
    - name: stackdriver
      match: '*'
      storage.total_limit_size: 5M

{% endtab %} {% tab title="fluent-bit.conf" %}

[SERVICE]
  Flush                     1
  Log_Level                 info
  Storage.Path              /var/log/flb-storage/
  Storage.Sync              normal
  Storage.Checksum          off
  Storage.Max_Chunks_Up     128
  Storage.Backlog.Mem_Limit 5M

[INPUT]
  Name                      cpu
  Storage.Type              filesystem

[OUTPUT]
  Name                      stackdriver
  Match                     *
  Storage.Total_Limit_Size  5M

{% endtab %} {% endtabs %}

In this example, if Fluent Bit is offline because of a network issue, it will continue buffering CPU samples, keeping a maximum of 5 MB of the newest data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Buffering

Chunks

Irrecoverable chunks

Buffering modes

Memory-only buffering

Filesystem buffering (hybrid)

Memory ring buffer buffering

Configuration settings

Global settings

Per-input settings

Per-output settings

FilesExpand file tree

buffering.md

Latest commit

History

buffering.md

File metadata and controls

Buffering

Chunks

Irrecoverable chunks

Buffering modes

Memory-only buffering

Filesystem buffering (hybrid)

Memory ring buffer buffering

Configuration settings

Global settings

Per-input settings

Per-output settings