Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Advanced

Monitor MariaDB Galera cluster with Prometheus and Grafana for high availability insights

Configure comprehensive monitoring for MariaDB Galera clusters using Prometheus exporters and Grafana dashboards to track cluster health, replication status, and performance metrics with automated alerting for production environments.

45 min 4 distros 185 views
monitoring Advanced

Implement Thanos multi-cluster federation for global Prometheus metrics aggregation

Set up Thanos components across multiple Kubernetes clusters to enable global metrics federation, long-term storage, and unified querying of Prometheus data with high availability and unlimited retention.

45 min 4 distros 214 views
monitoring Intermediate

Set up OpenTelemetry metrics collection with Prometheus integration for distributed system monitoring

Configure OpenTelemetry Collector to gather metrics from distributed services and export them to Prometheus for comprehensive observability monitoring. This integration provides unified metrics collection across your microservices architecture.

25 min 4 distros 219 views
devops Intermediate

Integrate Redis 7 with microservices architecture for caching and session management

Set up Redis 7 as a centralized caching layer and session store for microservices, with service discovery integration and clustering for high availability. Configure distributed session management patterns and implement Redis clustering for horizontal scalability.

45 min 4 distros 293 views
databases Intermediate

Implement Redis backup automation with RDB and AOF persistence

Set up automated Redis backups using RDB snapshots and AOF logging with verification scripts and monitoring alerts. Configure persistent data storage, automated backup scheduling, and comprehensive backup validation for production Redis deployments.

45 min 4 distros 175 views
monitoring Advanced

Set up keepalived cluster monitoring with Prometheus alerts and Grafana dashboards

Configure comprehensive monitoring for keepalived VRRP clusters using Prometheus metrics collection, alerting rules for failover events, and Grafana dashboards for high availability visualization.

45 min 4 distros 194 views
monitoring Advanced

Implement Prometheus federation for multi-cluster monitoring with centralized metrics aggregation

Set up hierarchical Prometheus federation to monitor multiple Kubernetes clusters with a central aggregation layer. Configure global and local Prometheus instances with federated scrape jobs, service discovery, and unified dashboards for enterprise-scale observability.

45 min 4 distros 231 views
databases Advanced

Configure MinIO high availability clustering for production with multi-node setup and automatic failover

Set up a production-ready MinIO cluster with multiple nodes, erasure coding, and automatic failover. This tutorial covers distributed mode configuration, load balancing with SSL termination, and health monitoring for enterprise object storage.

45 min 4 distros 306 views
databases Advanced

Optimize CockroachDB 24.3 performance with advanced tuning and configuration

Master advanced CockroachDB 24.3 performance optimization through memory, CPU, storage, and query tuning. Configure cluster parameters, connection pooling, and indexing strategies for production-grade distributed SQL workloads.

45 min 4 distros 220 views
linux Intermediate

Configure system backup automation with BorgBackup and systemd timers

Set up automated, deduplicated backups with BorgBackup and systemd timers for reliable system protection. This tutorial covers installation, repository setup with encryption, automated scheduling, and monitoring for production environments.

45 min 4 distros 197 views
devops Advanced

Advanced Nomad job templates and deployment strategies with rolling updates and canary deployments

Master production-grade Nomad job templates with HCL syntax, implement rolling updates with health checks, and deploy advanced blue-green and canary deployment patterns for resilient containerized workloads.

45 min 4 distros 299 views
monitoring Intermediate

Set up ClickHouse monitoring with Prometheus and Grafana dashboards

Configure comprehensive ClickHouse monitoring using Prometheus for metrics collection and Grafana for visualization. Set up system metrics, query performance monitoring, and alerting rules for production ClickHouse deployments.

45 min 4 distros 344 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer