Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Configure NTP monitoring with Grafana dashboards and Prometheus alerting

Set up comprehensive time synchronization monitoring using chrony, Prometheus node exporter, and custom Grafana dashboards with alerting for time drift and NTP service failures.

25 min 4 distros 12 views
devops Advanced

Configure Spark on Kubernetes with cluster autoscaling for dynamic workloads

Deploy Apache Spark 3.5 on Kubernetes with automatic cluster scaling, dynamic resource allocation, and comprehensive monitoring for production data processing workloads.

45 min 4 distros 113 views
monitoring Intermediate

Configure Jaeger alerting with Prometheus and Grafana for distributed tracing observability

Set up comprehensive monitoring and alerting for Jaeger distributed tracing using Prometheus metrics collection and Grafana dashboards with automated alerts for trace performance issues.

45 min 4 distros 53 views
devops Advanced

Set up Spark Streaming with Kafka and Delta Lake for real-time analytics

Configure Apache Spark 3.5 with Kafka integration and Delta Lake support for building production-grade real-time analytics pipelines with ACID transactions and streaming capabilities.

45 min 4 distros 91 views
performance Advanced

Implement Spark SQL performance optimization with Catalyst optimizer and advanced tuning

Optimize Apache Spark 3.5 SQL performance using Catalyst optimizer with advanced query tuning, adaptive query execution, and production-grade configuration for high-throughput analytics workloads.

45 min 4 distros 64 views
monitoring Intermediate

Configure MariaDB 11.6 performance monitoring with Prometheus and Grafana dashboards

Set up comprehensive MariaDB monitoring with mysqld_exporter, Prometheus metrics collection, and custom Grafana dashboards for database performance insights and alerting.

35 min 4 distros 74 views
monitoring Intermediate

Configure PM2 log rotation and advanced monitoring with Prometheus integration

Set up comprehensive PM2 monitoring with automated log rotation, Prometheus metrics, cluster mode management, and production-ready health checks for Node.js applications.

45 min 4 distros 55 views
monitoring Intermediate

Monitor Caddy and Consul integration with Prometheus and Grafana dashboards

Set up comprehensive monitoring for Caddy reverse proxy and Consul service discovery with Prometheus metrics collection and Grafana dashboards for performance insights and alerting.

45 min 4 distros 43 views
databases Intermediate

Configure Elasticsearch 8 snapshot and restore policies with automated backup strategies

Set up comprehensive Elasticsearch 8 backup strategies with snapshot lifecycle management (SLM), filesystem and S3 repository backends, automated scheduling, and recovery procedures for production environments.

45 min 4 distros 61 views
monitoring Intermediate

Configure MySQL backup monitoring with Prometheus alerts and Grafana dashboards

Set up automated monitoring for MySQL backups with Prometheus metrics collection, alerting rules for backup failures, and comprehensive Grafana dashboards for backup status visualization.

45 min 4 distros 58 views
databases Intermediate

Set up MySQL point-in-time recovery with binary logs for database backup and restore

Configure MySQL binary logging and implement automated point-in-time recovery procedures for comprehensive database backup and restore operations with monitoring.

45 min 4 distros 64 views
monitoring Intermediate

Configure Prometheus Alertmanager with custom webhook integrations for Slack, Microsoft Teams, and PagerDuty notifications

Set up Prometheus Alertmanager with webhook receivers for Slack, Microsoft Teams, and PagerDuty notifications. Configure routing rules, test notifications, and implement alert escalation workflows for comprehensive monitoring coverage.

45 min 4 distros 181 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer