Fluentbit Service Storage Filling up

Thanks Justin!

Can you provide logs from fluentbit and elasticsearch to help us determine the issue. Thanks

added logging label

@justinguidry11 The value storage.total_limit_size looks to be the value that's needed to alleviate this issue. Could you test these values passed through to fluentbit and let us know if this works as intended:

That value needs to go in the output configuration, and as a result the entire outputs needs to be captured or it will be overwritten to just the single value.

fluentbit:
  values:
    config:
      outputs: |
        [OUTPUT]
            Name es
            Match kube.*
            # -- Pointing to Elasticsearch service installed by ECK, based off EK name "logging-ek", update elasticsearch.name above to update.
            Host {{ .Values.elasticsearch.name }}-es-http
            HTTP_User elastic
            HTTP_Passwd ${FLUENT_ELASTICSEARCH_PASSWORD}
            Logstash_Format On
            Retry_Limit False
            Replace_Dots On
            tls On
            tls.verify On
            tls.ca_file /etc/elasticsearch/certs/ca.crt
            storage.total_limit_size 2G
    
        [OUTPUT]
            Name es
            Match host.*
            # -- Pointing to Elasticsearch service installed by ECK, based off EK name "logging-ek", update elasticsearch.name above to update.
            Host {{ .Values.elasticsearch.name }}-es-http
            HTTP_User elastic
            HTTP_Passwd ${FLUENT_ELASTICSEARCH_PASSWORD}
            Logstash_Format On
            Logstash_Prefix node
            Retry_Limit False
            tls On
            tls.verify On
            tls.ca_file /etc/elasticsearch/certs/ca.crt
            storage.total_limit_size 2G

We will also be testing and working towards implementing something similar for the package and any info you can report back will help a lot! Thanks.

The code @ryan.j.garcia gave worked just fine, the only thing I did on our side was lower the size limit to 50 M as shown here:

fluentbit:
  values:
    config:
      outputs: |
        [OUTPUT]
            Name es
            Match kube.*
            # -- Pointing to Elasticsearch service installed by ECK, based off EK name "logging-ek", update elasticsearch.name above to update.
            Host {{ .Values.elasticsearch.name }}-es-http
            HTTP_User elastic
            HTTP_Passwd ${FLUENT_ELASTICSEARCH_PASSWORD}
            Logstash_Format On
            Retry_Limit False
            Replace_Dots On
            tls On
            tls.verify On
            tls.ca_file /etc/elasticsearch/certs/ca.crt
            storage.total_limit_size 50M
    
        [OUTPUT]
            Name es
            Match host.*
            # -- Pointing to Elasticsearch service installed by ECK, based off EK name "logging-ek", update elasticsearch.name above to update.
            Host {{ .Values.elasticsearch.name }}-es-http
            HTTP_User elastic
            HTTP_Passwd ${FLUENT_ELASTICSEARCH_PASSWORD}
            Logstash_Format On
            Logstash_Prefix node
            Retry_Limit False
            tls On
            tls.verify On
            tls.ca_file /etc/elasticsearch/certs/ca.crt
            storage.total_limit_size 50M

It did require performing watch "du -sh /var/log/flb-storage" within nodes that had this storage location to notice that it was working as intended.

added priority6 label

changed milestone to %1.17.0

changed iteration to Big Bang Iterations Sep 7, 2021 - Sep 20, 2021

set weight to 1

assigned to @michaelmartin

added statusdoing label

Any thoughts here on a sane default value (I'm thinking like 10G)? We don't want to go too small, because that means log message may get discarded/lost once the limit is hit. We don't want to go too big either, so a smaller host node file system isn't filed up.

@michaelmartin I think 8-10G is fine. We should also callout in the main README that this value is present to prevent that directory from getting too big, and if you need to update, here's how (with a YAML example).

added statusreview label and removed statusdoing label

created merge request !866 (merged) to address this issue

mentioned in merge request !866 (merged)

closed

removed statusreview label

reopened

added statusreview label

closed with merge request !866 (merged)

mentioned in commit 5773bcd4

removed statusreview label

Fluentbit Service Storage Filling up

Designs

Child items 0

Activity

Admin message

Fluentbit Service Storage Filling up

Activity