Fluentd Integration with Grafana Loki

Introduction

Fluentd is an open-source data collector that lets you unify data collection and consumption for better use and understanding of data. In this guide, we'll explore how to integrate Fluentd with Grafana Loki to create a powerful logging pipeline.

Fluentd acts as a unified logging layer, collecting logs from various sources, processing them, and then forwarding them to Loki for storage and querying. This integration is particularly useful in microservices and containerized environments where log management can become complex.

Prerequisites

Before we begin, make sure you have:

Grafana Loki set up and running
Basic understanding of logging concepts
Fluentd installed (we'll cover this in the next section)

Installing Fluentd

Let's start by installing Fluentd on your system:

# For Debian/Ubuntu
sudo apt-get install -y build-essential ruby-dev
sudo gem install fluentd --no-doc

# For RHEL/CentOS
sudo yum install -y ruby-devel gcc
sudo gem install fluentd --no-doc

# Alternatively, use the td-agent package (recommended for production)
# See: https://docs.fluentd.org/installation

Once installed, verify your installation:

fluentd --version

Expected output:

fluentd 1.15.3

Installing the Loki Plugin for Fluentd

To send logs from Fluentd to Loki, we need to install the Fluentd Loki plugin:

gem install fluent-plugin-grafana-loki

Or if you're using td-agent:

td-agent-gem install fluent-plugin-grafana-loki

Basic Configuration

Now let's create a basic Fluentd configuration file that collects logs and forwards them to Loki:

<source>
  @type tail
  path /var/log/syslog
  pos_file /var/log/td-agent/syslog.pos
  tag system.syslog
  <parse>
    @type syslog
  </parse>
</source>

<match system.**>
  @type loki
  url "http://loki:3100"
  extra_labels {"job": "fluentd"}
  flush_interval 10s
  flush_at_shutdown true
  buffer_chunk_limit 1m
</match>

This configuration:

Collects logs from /var/log/syslog
Parses them using the syslog format
Forwards logs with the tag system.** to Loki running at http://loki:3100
Adds an extra label job: fluentd to all logs
Sets buffer and flush parameters for performance

Save this configuration to a file (e.g., fluentd-loki.conf).

Advanced Configuration

Adding Labels from Log Content

One of the powerful features of Loki is its label system. We can extract fields from our logs to create labels:

<match system.**>
  @type loki
  url "http://loki:3100"
  extra_labels {"job": "fluentd"}
  
  <label>
    hostname ${hostname}
    severity ${level}
    application ${tag}
  </label>
  
  <buffer>
    flush_interval 10s
    chunk_limit_size 1m
    flush_at_shutdown true
  </buffer>
</match>

Processing Logs with Filters

Fluentd allows you to process logs before sending them to Loki:

<filter system.**>
  @type record_transformer
  <record>
    hostname "#{Socket.gethostname}"
    environment "production"
  </record>
</filter>

<filter system.**>
  @type grep
  <exclude>
    key message
    pattern /health check/
  </exclude>
</filter>

This configuration:

Adds hostname and environment fields to each log record
Filters out any logs containing "health check" in the message

Collecting Docker Logs

In containerized environments, you'll likely want to collect logs from Docker containers:

<source>
  @type forward
  port 24224
  bind 0.0.0.0
</source>

<match docker.**>
  @type loki
  url "http://loki:3100"
  extra_labels {"job": "docker"}
  
  <label>
    container_name ${container_name}
    image_name ${image_name}
    container_id ${container_id}
  </label>
  
  <buffer>
    flush_interval 10s
    chunk_limit_size 1m
    flush_at_shutdown true
  </buffer>
</match>

To use this setup, configure your Docker containers to use the Fluentd logging driver:

docker run --log-driver=fluentd --log-opt fluentd-address=localhost:24224 your-image

Visual Representation of the Fluentd-Loki Integration

Here's a diagram showing how Fluentd integrates with Grafana Loki:

Complete Example Configuration

Let's put everything together in a complete configuration file:

# Input plugins - collect logs from different sources
<source>
  @type tail
  path /var/log/syslog
  pos_file /var/log/td-agent/syslog.pos
  tag system.syslog
  <parse>
    @type syslog
  </parse>
</source>

<source>
  @type tail
  path /var/log/app/*.log
  pos_file /var/log/td-agent/app.pos
  tag application.logs
  <parse>
    @type json
  </parse>
</source>

<source>
  @type forward
  port 24224
  bind 0.0.0.0
  tag docker
</source>

# Processing filters
<filter **>
  @type record_transformer
  <record>
    hostname "#{Socket.gethostname}"
    environment "production"
  </record>
</filter>

<filter application.logs>
  @type grep
  <exclude>
    key log
    pattern /DEBUG/
  </exclude>
</filter>

# Output to Loki
<match **>
  @type loki
  url "http://loki:3100"
  
  <label>
    job fluentd
    hostname ${hostname}
    environment ${environment}
    tag ${tag}
  </label>
  
  # Optional: extract labels from JSON log fields
  <extra_label>
    container_name $.container.name
    severity $.level
  </extra_label>
  
  <buffer>
    @type memory
    flush_interval 10s
    chunk_limit_size 1m
    flush_at_shutdown true
    retry_max_interval 30
    retry_forever true
  </buffer>
</match>

Troubleshooting

If you're having issues with your Fluentd-Loki integration, here are some troubleshooting tips:

Check Fluentd logs:
```
tail -f /var/log/td-agent/td-agent.log
```
Verify connectivity to Loki:
```
curl -v http://loki:3100/ready
```
Enable debug logging in Fluentd: Add this to your configuration:
```
<system>
  log_level debug
</system>
```
Check Loki logs for ingestion errors:
```
kubectl logs -l app=loki -c loki
```

Real-World Use Case: Monitoring Web Application Logs

Let's walk through a practical example of monitoring a web application using Fluentd and Loki.

Scenario

You have a Node.js web application running in Docker, and you want to:

Collect application logs
Parse JSON log output
Add metadata
Send to Loki for monitoring and alerting

Step 1: Configure Your Application Logging

First, ensure your application is logging in JSON format:

// Node.js example using Winston
const winston = require('winston');
const logger = winston.createLogger({
  level: 'info',
  format: winston.format.json(),
  defaultMeta: { service: 'web-service' },
  transports: [
    new winston.transports.Console(),
    new winston.transports.File({ filename: '/var/log/app/combined.log' })
  ]
});

// Example log
logger.info('User logged in', { userId: '123', action: 'login' });

Step 2: Create Fluentd Configuration

# Input: Tail application logs
<source>
  @type tail
  path /var/log/app/combined.log
  pos_file /var/log/td-agent/app.pos
  tag app.nodejs
  <parse>
    @type json
  </parse>
</source>

# Process: Add metadata
<filter app.nodejs>
  @type record_transformer
  <record>
    environment "production"
    hostname "#{Socket.gethostname}"
    app_version "1.0.0"
  </record>
</filter>

# Output: Send to Loki
<match app.nodejs>
  @type loki
  url "http://loki:3100"
  
  <label>
    job web_application
    environment ${environment}
    hostname ${hostname}
    service ${service}
  </label>
  
  <buffer>
    flush_interval 5s
    chunk_limit_size 1m
    flush_at_shutdown true
  </buffer>
</match>

Step 3: Set Up Docker Compose

version: '3'
services:
  web-app:
    image: your-nodejs-app
    volumes:
      - /var/log/app:/var/log/app
    # other configuration...
    
  fluentd:
    image: fluent/fluentd:v1.15
    volumes:
      - ./fluentd/conf:/fluentd/etc
      - /var/log:/var/log
    environment:
      - FLUENTD_CONF=fluentd-loki.conf
    ports:
      - "24224:24224"
    depends_on:
      - loki
      
  loki:
    image: grafana/loki:2.7.0
    ports:
      - "3100:3100"
    # other configuration...
    
  grafana:
    image: grafana/grafana:9.3.0
    ports:
      - "3000:3000"
    depends_on:
      - loki
    # other configuration...

Step 4: Create LogQL Queries in Grafana

Once your logs are flowing into Loki, you can create queries in Grafana:

Count of errors over time:

sum(count_over_time({job="web_application"} |= "error" [5m])) by (service)

Find all login attempts:

{job="web_application"} |= "login" | json | line_format "{{.userId}} logged in at {{.timestamp}}"

Summary

In this guide, we've covered:

How to install and configure Fluentd for log collection
Installing the Grafana Loki plugin for Fluentd
Basic and advanced configuration options for sending logs to Loki
Filtering and processing logs before shipment
Collecting logs from various sources including Docker containers
Troubleshooting common issues
A real-world example of monitoring a web application

Fluentd provides a flexible and powerful way to collect and process logs before sending them to Loki for storage and analysis. By leveraging Fluentd's extensive plugin ecosystem and Loki's efficient log storage and querying capabilities, you can build a comprehensive logging solution for any application or infrastructure.

Additional Resources

Exercises

Set up Fluentd to collect logs from a web server (Apache or Nginx) and forward them to Loki.
Create a Fluentd configuration that parses multiline logs from a Java application and sends them to Loki.
Configure Fluentd to add custom labels based on the content of your logs.
Set up a Grafana dashboard that displays error rates and response times from your application logs.
Implement log rotation and clean-up policies to manage your log storage efficiently.

If you spot any mistakes on this website, please let me know at [email protected]. I’d greatly appreciate your feedback! :)

Introduction​

Prerequisites​

Installing Fluentd​

Installing the Loki Plugin for Fluentd​

Basic Configuration​

Advanced Configuration​

Adding Labels from Log Content​

Processing Logs with Filters​

Collecting Docker Logs​

Visual Representation of the Fluentd-Loki Integration​

Complete Example Configuration​

Troubleshooting​

Real-World Use Case: Monitoring Web Application Logs​

Scenario​

Step 1: Configure Your Application Logging​

Step 2: Create Fluentd Configuration​

Step 3: Set Up Docker Compose​

Step 4: Create LogQL Queries in Grafana​

Summary​

Additional Resources​

Exercises​