Fluentd

Fluentd is a data collector, which unifies the data collection and consumption. This integration allows you to use Fluentd to send logs to your Logz.io account.

note

Fluentd will fetch all existing logs, as it is not able to ignore older logs.

Configure Fluentd with Ruby Gems

Before you begin, you'll need: Ruby and ruby-dev 2.1 or higher

Install Fluentd and the Logz.io plugin

gem install fluentd fluent-plugin-logzio

Set up Fluentd

fluentd --setup ./fluent

Add an input plugin

Add a required input plugin to the configuration file. You can find the list of available input plugins here.

An example input plugin looks as follows:

<source>
  @type tail
  path /var/log/httpd-access.log
  pos_file /var/log/td-agent/httpd-access.log.pos
  tag apache.access
  <parse>
    @type apache2
  </parse>
</source>

Configure Fluentd with Logz.io output

Add this code block to your Fluent configuration file (fluent.conf by default).

See the configuration parameters below the code block.👇

<match **>
  @type logzio_buffered
  endpoint_url https://<<LISTENER-HOST>>:8071?token=<<LOG-SHIPPING-TOKEN>>&type=my_type
  output_include_time true
  output_include_tags true
  http_idle_timeout 10
  <buffer>
      @type memory
      flush_thread_count 4
      flush_interval 3s
      chunk_limit_size 16m
      queue_limit_length 4096
  </buffer>
</match>

Parameters

Parameter	Description
endpoint_url	A url composed of your Logz.io region's listener URL, account token, and log type. Replace `<<LISTENER-HOST>>` with the host for your region. For example, `listener.logz.io` if your account is hosted on AWS US East, or `listener-nl.logz.io` if hosted on Azure West Europe. The required port depends whether HTTP or HTTPS is used: HTTP = 8070, HTTPS = 8071.
output_include_time	To add a timestamp to your logs when they're processed, `true` (recommended). Otherwise, `false`.
output_include_tags	To add the `fluentd` tag to logs, `true`. Otherwise, `false`. If `true`, use in combination with `output_tags_fieldname`.
output_tags_fieldname	If `output_include_tags` is `true`, sets output tag's field name. The default is `fluentd_tag`
http_idle_timeout	Time, in seconds, that the HTTP connection will stay open without traffic before timing out.
retry_count	Counter of the times to resend failed bulks. The default is `4`.
retry_sleep	Interval in seconds to sleep initially between retries, exponential step-off. The default is `2s`.
bulk_limit	Limit to the size of the Logz.io upload bulk. Defaults to 1000000 bytes, leaving about 24kB for overhead.
bulk_limit_warning_limit	Limit to the size of the Logz.io warning message when a record exceeds bulk_limit to prevent a recursion when Fluent warnings are sent to the Logz.io output. The default is `nil` (no truncation).
proxy_uri	Your proxy uri. The default is `nil`. For example: "my.ip:12345".
proxy_cert	Your proxy cert. The default is `nil`.
gzip	Defines if the plugin needs to ship the logs in gzip compression. The default is `false`.

Run Fluentd

fluentd -c ./fluent/fluent.conf -vv

Check Logz.io for your logs

Give your logs some time to get from your system to ours, and then open Open Search Dashboards.

If you still don't see your logs, see log shipping troubleshooting.

Fluentd can receive and concatenate multiline logs. To do this, you need to add a parser and concatenation plugin to your Fluentd configuration.

Add multiline parser to your input plugin

note

Multiline parsing only works with in_tail plugins. Refer to the Fluentd documentation for more information.

Fluentd splits multiline logs by default. If your original logs span multiple lines, you may find that they arrive in your Logz.io account split into several partial logs.

The Logz.io Docker image comes with a pre-built Fluentd filter plug-in that can be used to concatenate multiline logs. The plug-in is named fluent-plugin-concat, and it concatenate multiline log separated in multiple events. You can review the full list of configuration options in the GitHub project.

The following is an example of a multiline log sent from a deployment on a k8s cluster:

2021-02-08 09:37:51,031 - errorLogger - ERROR - Traceback (most recent call last):
File "./code.py", line 25, in my_func
1/0
ZeroDivisionError: division by zero

Fluentd's default configuration will split the above log into 4 logs, 1 for each line of the original log. In other words, each line break (\n) causes a split.

To avoid this, you can use the fluent-plugin-concat and customize the configuration to meet your needs. The additional configuration is added to the values.yml file.

For the above example, we could use the following regex expressions to demarcate the start and end of our example log:

<filter **>
  @type concat
  key message # The key for part of multiline log
  multiline_start_regexp /^[0-9]{4}-[0-9]{2}-[0-9]{2}/ # This regex expression identifies line starts.
</filter>

To parse multiline, you'll need the in_tail input plugin that allows Fluentd to read events from the tail of text files.

Add the following code block to your in_tail plugin:

<parse>
  @type multiline
  format_firstline /^<<YOUR-REGEX-PATTERN>>/
</parse>

Replace <<YOUR-REGEX-PATTERN>> with the definition of your Regex pattern. You can use regex101 to define it.

The indentation of the parse plugin must be one level under the tail function as in the example below:

<source>
  @type tail
  path /var/log/httpd-access.log
  pos_file /var/log/td-agent/httpd-access.log.pos
  tag apache.access
    <parse>
      @type multiline
      format_firstline /\d{4}-\d{1,2}-\d{1,2}/
      format1 /^(?<time>\d{4}-\d{1,2}-\d{1,2} \d{1,2}:\d{1,2}:\d{1,2}) \[(?<thread>.*)\] (?<level>[^\s]+)(?<message>.*)/
    </parse>
</source>

note

Fluentd will fetch all existing logs, as it is not able to ignore older logs.

Configure Fluentd with td-agent for Windows

Before you begin, you'll need: Ruby and ruby-dev 2.1 or higher

Install Fluentd td-agent

Navigate to the downloads page of td-agent and download the latest version of the installer. After that, run the installer and follow the wizard instructions.

Install the Logz.io plugin

gem install fluentd fluent-plugin-logzio

Set up td-agent.conf

Open C:/opt/td-agent/etc/td-agent/td-agent.conf and replace its content with the following configuration:

<source>
  @type windows_eventlog2
  @id windows_eventlog2
  channels application,system,security
  read_existing_events false
  tag winevt.raw
  rate_limit 200
  <storage>
    @type local
    persistent true
    path C:\opt\td-agent\winlog.json
  </storage>
</source>

<match **>
  @type logzio_buffered
  endpoint_url https://<<LISTENER-HOST>>:8071?token=<<LOG-SHIPPING-TOKEN>>&type=<<LOG-TYPE>>
  output_include_time true
  output_include_tags true
  http_idle_timeout 10
  <buffer>
      @type memory
      flush_thread_count 4
      flush_interval 3s
      chunk_limit_size 16m
      queue_limit_length 4096
  </buffer>
</match>

Parameters

Parameter	Description
endpoint_url	A url composed of your Logz.io region's listener URL, account token, and log type. Replace `<<LISTENER-HOST>>` with the host for your region. For example, `listener.logz.io` if your account is hosted on AWS US East, or `listener-nl.logz.io` if hosted on Azure West Europe. The required port depends whether HTTP or HTTPS is used: HTTP = 8070, HTTPS = 8071. Replace `<<LOG-SHIPPING-TOKEN>>` with the token of the account you want to ship to.
type	Log type. If required, replace `<<LOG_TYPE>>` with the desired name for the log type, the default value is `fluentbit`
output_include_time	To add a timestamp to your logs when they're processed, `true` (recommended). Otherwise, `false`.
output_include_tags	To add the `fluentd` tag to logs, `true`. Otherwise, `false`. If `true`, use in combination with `output_tags_fieldname`.
output_tags_fieldname	If `output_include_tags` is `true`, sets output tag's field name. The default is `fluentd_tag`
http_idle_timeout	Time, in seconds, that the HTTP connection will stay open without traffic before timing out.

Run Fluentd td-agent

Open Td-agent Command Prompt from the Windows Start menu and run the following command:

C:\opt\td-agent> td-agent

Check Logz.io for your logs

Give your logs some time to get from your system to ours, and then open Open Search Dashboards.

If you still don't see your logs, see log shipping troubleshooting.

Fluentd can receive and concatenate multiline logs. To do this, you need to add a parser and concatenation plugin to your Fluentd configuration.

Add multiline parser to your input plugin

note

Multiline parsing only works with in_tail plugins. Refer to the Fluentd documentation for more on this.

Fluentd splits multiline logs by default. If your original logs span multiple lines, you may find that they arrive in your Logz.io account split into several partial logs.

The following is an example of a multiline log sent from a deployment on a k8s cluster:

2021-02-08 09:37:51,031 - errorLogger - ERROR - Traceback (most recent call last):
File "./code.py", line 25, in my_func
1/0
ZeroDivisionError: division by zero

Fluentd's default configuration will split the above log into 4 logs, 1 for each line of the original log. In other words, each line break (\n) causes a split.

To avoid this, you can use the fluent-plugin-concat and customize the configuration to meet your needs. The additional configuration is added to the values.yml file.

For the above example, we could use the following regex expressions to demarcate the start and end of our example log:

<filter **>
  @type concat
  key message # The key for part of multiline log
  multiline_start_regexp /^[0-9]{4}-[0-9]{2}-[0-9]{2}/ # This regex expression identifies line starts.
</filter>

To parse multiline, you'll need the in_tail input plugin that allows Fluentd to read events from the tail of text files.

Add the following code block to your in_tail plugin:

<parse>
  @type multiline
  format_firstline /^<<YOUR-REGEX-PATTERN>>/
</parse>

Replace <<YOUR-REGEX-PATTERN>> with the definition of your Regex pattern. You can use regex101 to define it.

The indentation of the parse plugin must be one level under the tail function as in the example below:

<source>
  @type tail
  path /var/log/httpd-access.log
  pos_file /var/log/td-agent/httpd-access.log.pos
  tag apache.access
    <parse>
      @type multiline
      format_firstline /\d{4}-\d{1,2}-\d{1,2}/
      format1 /^(?<time>\d{4}-\d{1,2}-\d{1,2} \d{1,2}:\d{1,2}:\d{1,2}) \[(?<thread>.*)\] (?<level>[^\s]+)(?<message>.*)/
    </parse>
</source>

Docker logs with Fluentd

Architecture overview

This integration includes:

Pulling a Docker image of containerized Fluentd
Configuring and running containerized Fluentd

Integration architecture Fluentd on Docker

Upon deployment, each container on your host system, including the Fluentd container, writes logs to a dedicated log file. Fluentd fetches the log data from this file and ships the data over HTTP or HTTPS to your Logz.io account, either via an optional proxy sever or directly.

note

Project's GitHub repo

Before you begin, you'll need: Docker installed on your host system

Pull the Docker image for containerized Fluentd

docker pull logzio/fluentd-docker-logs

Start the container

Run the following command:

docker run -it --rm \
--name fluentd-docker-logs \
-v $(pwd)/log:/fluentd/log \
-v /var/lib/docker/containers:/var/lib/docker/containers \
-v /var/run/docker.sock:/var/run/docker.sock:ro \
-p 5001:5001 \
-e LOGZIO_LOG_LISTENER="https://<<LISTENER-HOST>>:8071" \
-e LOGZIO_LOG_SHIPPING_TOKEN=<<LOG-SHIPPING-TOKEN>> \
-e LOGZIO_TYPE=docker-fluentd \
logzio/fluentd-docker-logs

Replace <<LISTENER-HOST>> with the host for your region. For example, listener.logz.io if your account is hosted on AWS US East, or listener-nl.logz.io if hosted on Azure West Europe. The required port depends whether HTTP or HTTPS is used: HTTP = 8070, HTTPS = 8071. Replace <<LOG-SHIPPING-TOKEN>> with the token of the account you want to ship to.

If you need to send the logs via a proxy server:

Add -e LOGZIO_PROXY_URI=<<YOUR-PROXY-URI>> to the above command and replace <<YOUR-PROXY-URI>> with your proxy URI.
Add -e LOGZIO_PROXY_CERT=<<YOUR-PROXY-CERTIFICATE>> to the above commad and replace <<YOUR-PROXY-CERTIFICATE>> with your proxy certificate value.

Check Logz.io for your logs

Give your logs some time to get from your system to ours, and then open Open Search Dashboards. You can filter for data of type docker-fluentd to see the incoming container logs.

If you still don’t see your data, see log shipping troubleshooting.

Advanced settings

If you need to customize the default settings of the configuration parameters, add any of the following lines to the command:

Parameter	Description	Default
`-e LOGZIO_INCLUDE_REGEX=<<LOGZIO_INCLUDE_REGEX>> \`	If a container name does not match the Regex, logs from this container will not be shipped.	`.+`
`-e LOGZIO_SLOW_FLUSH_LOG_THRESHOLD=<<LOGZIO_SLOW_FLUSH_LOG_THRESHOLD>> \`	Specifies the threshold for chunk flush performance check.	`20.0`
`-e LOGZIO_BUFFER_TYPE=<<LOGZIO_BUFFER_TYPE>> \`	Specifies which plugin to use as the backend.	`file`
`-e LOGZIO_BUFFER_PATH=<<LOGZIO_BUFFER_PATH>> \`	Specifies the path to the backend plugin.	`/var/log/Fluentd-buffers/stackdriver.buffer`
`-e LOGZIO_OVERFLOW_ACTION=<<LOGZIO_OVERFLOW_ACTION>> \`	Specifies the parameter that controls the behavior when the queue becomes full. Refer to documentation on Fluentd for more on this.	`block`
`-e LOGZIO_CHUNK_LIMIT_SIZE=<<LOGZIO_CHUNK_LIMIT_SIZE>> \`	Specifies the maximum size of a chunk allowed.	`2M`
`-e LOGZIO_QUEUE_LIMIT_LENGTH=<<LOGZIO_QUEUE_LIMIT_LENGTH>> \`	Specifies the maximum length of the output queue.	`6`
`-e LOGZIO_FLUSH_INTERVAL=<<LOGZIO_FLUSH_INTERVAL>> \`	Specifies the interval, in seconds, to wait before invoking the next buffer flush.	`5s`
`-e LOGZIO_RETRY_MAX_INTERVAL=<<LOGZIO_RETRY_MAX_INTERVAL>> \`	Specifies the maximum interval, in seconds, to wait between retries.	`30s`
`-e LOGZIO_FLUSH_THREAD_COUNT=<<LOGZIO_FLUSH_THREAD_COUNT>> \`	Specifies the number of threads to flush the buffer.	`2`
`-e LOGZIO_LOG_LEVEL=<<LOGZIO_LOG_LEVEL>> \`	Specifies the log level for this container.	`info`
`ADDITIONAL_FIELDS`	Include additional fields with every message sent, formatted as `fieldName1=fieldValue1,fieldName2=fieldValue2`	`nil`

Ship Kubernetes logs with Fluentd

See the full documentation for shipping your Kubernetes logs with Fluentd here.

Configure Fluentd with Ruby Gems​

Install Fluentd and the Logz.io plugin​

Set up Fluentd​

Add an input plugin​

Configure Fluentd with Logz.io output​

Parameters​

Run Fluentd​

Check Logz.io for your logs​

Add multiline parser to your input plugin​

Configure Fluentd with td-agent for Windows​

Install Fluentd td-agent​

Install the Logz.io plugin​

Set up td-agent.conf​

Parameters​

Run Fluentd td-agent​

Check Logz.io for your logs​

Add multiline parser to your input plugin​

Docker logs with Fluentd​

Architecture overview​

Pull the Docker image for containerized Fluentd​

Start the container​

Check Logz.io for your logs​

Advanced settings​

Ship Kubernetes logs with Fluentd​

Configure Fluentd with Ruby Gems

Install Fluentd and the Logz.io plugin

Set up Fluentd

Add an input plugin

Configure Fluentd with Logz.io output

Parameters

Run Fluentd

Check Logz.io for your logs

Add multiline parser to your input plugin

Configure Fluentd with td-agent for Windows

Install Fluentd td-agent

Install the Logz.io plugin

Set up td-agent.conf

Parameters

Run Fluentd td-agent

Check Logz.io for your logs

Add multiline parser to your input plugin

Docker logs with Fluentd

Architecture overview

Pull the Docker image for containerized Fluentd

Start the container

Check Logz.io for your logs

Advanced settings

Ship Kubernetes logs with Fluentd