Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

security Advanced

Configure Keycloak high availability clustering for production with load balancing and failover

Set up a production-ready Keycloak cluster with PostgreSQL backend, HAProxy load balancing, and automatic failover to ensure identity services remain available during node failures and high traffic.

90 min 4 distros 94 views
devops Intermediate

Configure Kubernetes OpenTelemetry auto-instrumentation for microservices observability

Set up OpenTelemetry Operator in Kubernetes to automatically instrument microservices with distributed tracing. Enable seamless observability across your application stack without modifying application code.

45 min 4 distros 20 views
devops Intermediate

Configure Traefik with Consul service discovery for dynamic load balancing

Set up Traefik 3.1 reverse proxy with Consul service discovery to automatically route traffic to services without manual configuration. Learn dynamic routing, SSL automation, health checks, and monitoring setup for production environments.

25 min 4 distros 37 views
monitoring Intermediate

Set up Docker Compose monitoring stack with Prometheus and Grafana for AI model performance tracking

Deploy a complete monitoring stack using Docker Compose with Prometheus for metrics collection and Grafana for visualization, specifically configured to track AI model performance metrics like inference latency, throughput, and resource utilization.

45 min 4 distros 40 views
monitoring Intermediate

Configure backup monitoring with Prometheus and Grafana for automated infrastructure oversight

Set up comprehensive backup monitoring using Prometheus exporters and Grafana dashboards. Configure automated alerts for backup failures, track success rates, and visualize backup infrastructure health across multiple systems.

45 min 4 distros 28 views
databases Advanced

Optimize ClickHouse performance for large datasets with advanced tuning and memory management

Optimize ClickHouse for high-performance analytics on multi-terabyte datasets with advanced memory tuning, storage engine configuration, query optimization strategies, and distributed clustering for production workloads.

45 min 4 distros 51 views
hosting Advanced

Configure Apache HTTP Server clustering with mod_cluster and JBoss EAP for enterprise load balancing

Set up Apache HTTP Server with mod_cluster module to create an intelligent load balancing cluster with JBoss EAP application servers, featuring automatic node discovery, session clustering, and health monitoring.

45 min 6 distros 102 views
monitoring Intermediate

Set up ELK stack alerting with Watcher and email notifications for monitoring and incident response

Configure Elasticsearch Watcher to monitor log data and automatically send email alerts when critical system events occur. Create sophisticated alert conditions, manage email notification templates, and set up automated incident response workflows.

45 min 4 distros 41 views
devops Advanced

Integrate Jaeger with Kubernetes service mesh for comprehensive distributed tracing

Set up production-grade distributed tracing in Kubernetes using Jaeger with Istio service mesh integration. Configure OpenTelemetry instrumentation, Elasticsearch storage backend, and comprehensive observability for microservices.

45 min 4 distros 50 views
monitoring Advanced

Install and configure Graylog 5 for centralized log management with MongoDB and Elasticsearch

Set up Graylog 5 with MongoDB 8.0 and Elasticsearch 8 for centralized log collection, analysis, and alerting. Learn to configure secure inputs, data streams, and monitoring for production environments.

45 min 4 distros 69 views
networking Intermediate

Configure Tailscale with Kubernetes cluster networking integration

Set up Tailscale mesh VPN with Kubernetes cluster integration for secure pod-to-pod communication, subnet routing, and service discovery across distributed nodes.

25 min 4 distros 47 views
monitoring Advanced

Configure Thanos Receiver clustering for high availability and load distribution

Set up Thanos Receiver clustering with hashring configuration to distribute Prometheus remote write traffic across multiple replicas for high availability and scalability.

45 min 4 distros 96 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer