Welcome to Day 74 of the #90DaysOfDevOps Challenge. Today, we will explore how to connect Linux and Windows EC2 instances with Grafana, enabling us to monitor various components of the servers. We’ll use Grafana, Prometheus, and Node Exporter to achieve this. So, let’s dive in!
Grafana: A Powerful Monitoring Tool
Grafana is not just an ordinary monitoring system; it is a feature-rich and highly versatile platform that empowers DevOps engineers to gain deep insights into the performance and health of their systems. With its intuitive and user-friendly interface, Grafana allows you to collect and visualize metrics from various sources effortlessly. Whether you need to monitor server metrics, application performance, or business analytics, Grafana has got you covered.
Prometheus: The Data Scraping Wizard
Prometheus is an exceptional open-source monitoring and alerting toolkit that specializes in handling time-series data like no other. With its unique pull model architecture, Prometheus can seamlessly scrape metrics from a wide range of targets, including the Node Exporter running on our EC2 instances. Its ability to scale and handle massive amounts of data while maintaining exceptional performance sets it apart as a true data scraping wizard.
Node Exporter: Metrics for Our EC2 Instances
Node Exporter acts as a dedicated Prometheus exporter, gathering crucial system-level metrics from our Linux EC2 instances. This powerful tool enables us to capture essential performance indicators such as CPU usage, memory consumption, disk space utilization, and network statistics. By harnessing Node Exporter’s capabilities, we can comprehensively monitor the health and resource utilization of our EC2 instances, ensuring they operate at peak efficiency and performance levels.
Task: Connecting EC2 Instances with Grafana
Now that we have a brief overview of the tools we’ll be using, let’s go ahead and connect our Linux EC2 instances with Grafana. We’ll follow these steps:
Install Grafana and Prometheus on an EC2 Instance:
Launch a new EC2 instance with Ubuntu or any other supported Linux distribution.
- Use the below shell script to install docker and ensure you can follow the next steps. This applies to all the instances we’ll use today.
#!/bin/bash
# Update package lists
sudo apt update
# Install required packages to use repositories over HTTPS
sudo apt install -y apt-transport-https ca-certificates curl software-properties-common
# Add Docker repository GPG key
curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo gpg --dearmor -o /usr/share/keyrings/docker-archive-keyring.gpg
# Add Docker repository
echo "deb [arch=amd64 signed-by=/usr/share/keyrings/docker-archive-keyring.gpg] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable" | sudo tee /etc/apt/sources.list.d/docker.list > /dev/null
# Update package lists again with the Docker repository
sudo apt update
# Install Docker
sudo apt install -y docker-ce docker-ce-cli containerd.io
# Start Docker
sudo systemctl start docker
# Enable Docker to start on boot
sudo systemctl enable docker
# Add $USER to the docker group
sudo usermod -aG docker $USER
# Restart system to apply the changes
sudo reboot
- SSH into the EC2 instance and install Grafana using the Official Documentation for your OS. In my case, I will use Docker Compose. I’ll also follow the Prometheus Official Documentation to instal
---
version: '3'
volumes:
prometheus-data:
driver: local
grafana-data:
driver: local
services:
grafana:
image: grafana/grafana-oss:latest
container_name: grafana
ports:
- "3000:3000"
volumes:
- grafana-data:/var/lib/grafana
restart: unless-stopped
prometheus:
image: prom/prometheus:latest
container_name: prometheus
ports:
- 9090:9090
volumes:
- /etc/prometheus:/etc/prometheus
- prometheus-data:/prometheus
command: "--config.file=/etc/prometheus/prometheus.yml"
restart: unless-stopped
Run docker compose up -d
to start the services
Install Node Exporter:
On the same Linux EC2 instance (Instance-A), install Node Exporter to collect system-level metrics. For that, I’ll amend the previous docker-compose.yml file with the below changes to include the node exporter container:
---
version: '3'
volumes:
prometheus-data:
driver: local
grafana-data:
driver: local
services:
grafana:
image: grafana/grafana-oss:latest
container_name: grafana
ports:
- "3000:3000"
volumes:
- grafana-data:/var/lib/grafana
restart: unless-stopped
prometheus:
image: prom/prometheus:latest
container_name: prometheus
ports:
- 9090:9090
volumes:
- /etc/prometheus:/etc/prometheus
- prometheus-data:/prometheus
command: "--config.file=/etc/prometheus/prometheus.yml"
restart: unless-stopped
node_exporter:
image: quay.io/prometheus/node-exporter:latest
container_name: node_exporter
command:
- '--path.rootfs=/host'
network_mode: host
pid: host
restart: unless-stopped
volumes:
- '/:/host:ro,rslave'
- Launch another Linux EC2 instance (Instance-B) and use the below docker-compose.yml file to run node exporter.
---
version: '3'
services:
node_exporter:
image: quay.io/prometheus/node-exporter:latest
container_name: node_exporter
command:
- '--path.rootfs=/host'
network_mode: host
pid: host
restart: unless-stopped
volumes:
- '/:/host:ro,rslave'
- Run
docker-compose up -d
anddocker ps
to ensure the Docker container is running
Configure Prometheus to Scrape Metrics:
On Instance-A, edit the Prometheus configuration file
prometheus.yml
to include the scraping targets for both Node Exporter instances (Instance-A and Instance-B) and save them in/etc/prometheus
(create the directory if necessary).
global:
scrape_interval: 15s # By default, scrape targets every 15 seconds.
# Attach these labels to any time series or alerts when communicating with
# external systems (federation, remote storage, Alertmanager).
# external_labels:
# monitor: 'codelab-monitor'
# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
scrape_configs:
# The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.
- job_name: 'prometheus'
# Override the global default and scrape targets from this job every 5 seconds.
scrape_interval: 5s
static_configs:
- targets: ['172.31.0.36:9090']
# Example job for node_exporter
- job_name: 'node_exporter'
static_configs:
- targets: ['172.31.0.58:9100', '172.31.0.36:9100']
# Example job for cadvisor
# - job_name: 'cadvisor'
# static_configs:
# - targets: ['cadvisor:8080']
- Restart the Prometheus service to apply the changes.
- Copy the public IP address of the instance where Prometheus is installed and paste it into the browser using port 9090 to verify Prometheus can connect to the target nodes.
Create a Grafana Dashboard:
Access the Grafana web interface by opening your browser and entering the public IP or DNS of the Grafana EC2 instance and port 3000
- Log in to Grafana using the default credentials (admin:admin) and change the password.
- Add Prometheus as a data source in Grafana, using the Prometheus EC2 instance’s IP address.
- Create a new dashboard in Grafana and add panels to visualize different metrics collected by Node Exporter.
Congratulations! You’ve successfully connected Linux EC2 instances with Grafana and created a dashboard to monitor their performance metrics. Monitoring is a crucial aspect of any DevOps practice, and Grafana along with Prometheus and Node Exporter provides a powerful combination to gain insights into the health of your infrastructure.