Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Integrate Elasticsearch 8 with Prometheus monitoring and Grafana dashboards

Set up comprehensive monitoring for Elasticsearch 8 using Prometheus metrics collection and Grafana visualization. This tutorial covers exporter installation, metric configuration, dashboard setup, and alerting rules for production environments.

45 min 4 distros 65 views
monitoring Advanced

Implement OpenTelemetry distributed context propagation across microservices with automatic instrumentation

Set up comprehensive distributed tracing across microservices using OpenTelemetry with automatic context propagation, trace correlation headers, and framework-specific auto-instrumentation for Python, Java, and Node.js applications.

45 min 4 distros 97 views
devops Advanced

Set up MinIO gateway for multi-cloud storage federation with AWS S3 and Azure integration

Learn to deploy MinIO as a unified storage gateway that seamlessly federates multiple cloud storage backends including AWS S3 and Azure Blob Storage. This advanced configuration provides intelligent failover, cost optimization, and vendor-neutral object storage management.

45 min 4 distros 36 views
monitoring Advanced

Configure Thanos Ruler for distributed alerting across multiple Prometheus clusters

Set up Thanos Ruler to create a unified alerting layer across distributed Prometheus instances. This tutorial covers installation, global rule configuration, and cross-cluster alert federation for enterprise monitoring.

45 min 4 distros 59 views
databases Intermediate

Implement automated ClickHouse backups with S3 storage and monitoring

Set up automated backup solutions for ClickHouse databases with S3 storage, retention policies, and monitoring alerts. This tutorial covers backup tool installation, S3 configuration, scheduling with systemd timers, and health monitoring.

45 min 4 distros 33 views
security Intermediate

Implement encryption for Consul backups using GPG with automated snapshots and secure storage

Learn to create encrypted Consul backups using GPG encryption with automated snapshot scripts, secure key management, and remote storage integration for production disaster recovery.

45 min 4 distros 81 views
databases Advanced

Backup and restore Redis Sentinel cluster data with automated strategies and monitoring

Learn to implement production-grade backup and restore procedures for Redis Sentinel clusters with automated scheduling, point-in-time recovery, and comprehensive monitoring to ensure data durability and business continuity.

45 min 4 distros 66 views
security Advanced

Set up intrusion detection with OSSEC HIDS and nftables integration for automated threat response

Configure OSSEC Host-based Intrusion Detection System with nftables firewall integration for real-time threat detection and automated response. Monitor system activity, detect intrusions, and automatically block malicious IPs using active response mechanisms.

45 min 4 distros 66 views
networking Intermediate

Implement SNMP device auto-discovery with network scanning and automated inventory management

Set up automated SNMP device discovery using nmap network scanning and Python scripts for dynamic inventory management. Integrate with Zabbix and Grafana for comprehensive network monitoring and visualization.

45 min 4 distros 75 views
monitoring Intermediate

Implement custom Prometheus exporters for application metrics collection and monitoring

Build production-grade custom Prometheus exporters in Python and Go to collect application-specific metrics. Learn exporter architecture, metric types, systemd deployment, and Prometheus integration for comprehensive application monitoring.

45 min 4 distros 66 views
devops Advanced

Set up Apache Airflow high availability with CeleryExecutor and Redis clustering

Deploy Apache Airflow in high availability mode using CeleryExecutor with Redis clustering for task distribution, PostgreSQL connection pooling, and load-balanced webservers for production-grade workflow orchestration.

45 min 4 distros 33 views
devops Advanced

Implement Apache Spark 3.5 cluster with YARN and HDFS for distributed computing

Set up a production-grade Apache Spark 3.5 cluster with YARN resource management and HDFS distributed storage for scalable big data processing. This tutorial covers multi-node Hadoop cluster configuration, YARN integration, and monitoring setup.

45 min 4 distros 58 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer