Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Setup Prometheus Blackbox Exporter for endpoint monitoring with SSL and alerting

Configure Prometheus Blackbox Exporter to monitor HTTP endpoints, SSL certificates, and DNS resolution with automated alerting rules and Grafana dashboards.

25 min 4 distros 41 views
monitoring Intermediate

Configure Prometheus alerting with AlertManager notifications and webhook integration

Set up AlertManager to handle Prometheus alerts with email, Slack, and webhook notifications. Configure alerting rules, notification channels, and test the complete alerting pipeline.

25 min 4 distros 15 views
monitoring Intermediate

Set up Prometheus and Grafana monitoring stack with Docker Compose

Deploy a complete Prometheus and Grafana monitoring solution using Docker Compose with persistent storage, custom dashboards, and Alertmanager integration for production-ready observability.

25 min 4 distros 36 views
monitoring Advanced

Configure advanced Grafana dashboards and alerting with Prometheus integration

Build production-ready Grafana dashboards with dynamic variables, custom panels, and sophisticated alert rules. Integrate Prometheus metrics for comprehensive monitoring with multi-condition alerting and notification channels.

45 min 4 distros 204 views
monitoring Intermediate

Set up Alertmanager with email and Slack notifications for monitoring alerts

Configure Prometheus Alertmanager with email SMTP and Slack webhook integrations for comprehensive monitoring notifications. Set up alert routing rules and test notification delivery.

25 min 4 distros 45 views
monitoring Intermediate

Monitor cron jobs and systemd timers with Prometheus and Grafana alerting

Set up comprehensive monitoring for scheduled tasks using Prometheus node_exporter and custom metrics collection. Configure Grafana dashboards and alerting rules to track job success, failures, and missed executions across your infrastructure.

45 min 4 distros 129 views
monitoring Intermediate

Monitor Node.js applications with Prometheus and Grafana for comprehensive performance metrics

Set up comprehensive Node.js application monitoring with Prometheus metrics collection and Grafana dashboards. Configure alerting rules for performance issues, memory usage, and error tracking in production environments.

45 min 4 distros 158 views
monitoring Intermediate

Configure backup monitoring with Prometheus and Grafana for automated infrastructure oversight

Set up comprehensive backup monitoring using Prometheus metrics collection and Grafana dashboards. This tutorial covers backup exporter configuration, custom metrics creation, and automated alerting for backup failures and performance issues.

45 min 4 distros 149 views
monitoring Intermediate

Set up Linux storage monitoring with smartmontools and automated health alerts

Monitor disk health and prevent storage failures with S.M.A.R.T monitoring, automated email alerts, and custom dashboards. Covers smartd daemon configuration, health checks, and integration with monitoring systems.

25 min 4 distros 101 views
devops Intermediate

Implement Consul backup and disaster recovery with automated snapshots and restoration

Set up automated Consul snapshots with GPG encryption, systemd timers, and complete disaster recovery procedures. Includes monitoring integration with Prometheus and automated restoration workflows for production environments.

45 min 4 distros 173 views
monitoring Intermediate

Set up Docker Compose monitoring stack with Prometheus and Grafana for AI model performance tracking

Deploy a complete monitoring stack using Docker Compose with Prometheus for metrics collection and Grafana for visualization, specifically configured to track AI model performance metrics like inference latency, throughput, and resource utilization.

45 min 4 distros 217 views
monitoring Advanced

Configure Thanos Receiver clustering for high availability and load distribution

Set up Thanos Receiver clustering with hashring configuration to distribute Prometheus remote write traffic across multiple replicas for high availability and scalability.

45 min 4 distros 218 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer