Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Implement custom Prometheus exporters for application metrics collection and monitoring

Build production-grade custom Prometheus exporters in Python and Go to collect application-specific metrics. Learn exporter architecture, metric types, systemd deployment, and Prometheus integration for comprehensive application monitoring.

45 min 4 distros 85 views
devops Advanced

Set up Apache Airflow high availability with CeleryExecutor and Redis clustering

Deploy Apache Airflow in high availability mode using CeleryExecutor with Redis clustering for task distribution, PostgreSQL connection pooling, and load-balanced webservers for production-grade workflow orchestration.

45 min 4 distros 33 views
monitoring Intermediate

Integrate Nagios Core 4.5 with Grafana dashboards for advanced monitoring visualization

Connect Nagios Core 4.5 with Grafana through NDOUtils and MySQL to create powerful monitoring dashboards. This integration provides advanced visualization capabilities, real-time alerting, and comprehensive monitoring insights for your infrastructure.

45 min 4 distros 107 views
devops Advanced

Implement Apache Spark 3.5 cluster with YARN and HDFS for distributed computing

Set up a production-grade Apache Spark 3.5 cluster with YARN resource management and HDFS distributed storage for scalable big data processing. This tutorial covers multi-node Hadoop cluster configuration, YARN integration, and monitoring setup.

45 min 4 distros 70 views
databases Advanced

Set up MariaDB Galera Cluster 11.6 for multi-master replication with automatic failover

Configure a production-ready MariaDB Galera cluster with three nodes for multi-master replication, automatic failover, and SSL encryption. Includes HAProxy load balancing and comprehensive monitoring for high availability database operations.

45 min 4 distros 156 views
performance Advanced

Configure Varnish cache invalidation with automated purging strategies for high-performance web acceleration

Learn to configure advanced Varnish VCL for cache invalidation, implement PURGE and BAN strategies, and set up automated cache tagging for optimal performance. Master selective invalidation techniques and monitoring for production-grade web acceleration.

45 min 4 distros 120 views
databases Advanced

Optimize MariaDB 11.6 performance with query analysis and indexing for high-traffic applications

Learn advanced MariaDB performance optimization techniques including query profiling, slow query analysis, and strategic indexing for high-traffic applications. This tutorial covers query execution plan analysis, covering indexes, and automated monitoring setup.

45 min 4 distros 91 views
databases Advanced

Implement TimescaleDB backup strategies and point-in-time recovery with automated failover

Configure comprehensive TimescaleDB backup solutions with continuous WAL archiving, automated point-in-time recovery procedures, and monitoring systems to ensure data protection and business continuity.

45 min 4 distros 57 views
monitoring Intermediate

Set up Kibana alerting and monitoring with Elasticsearch Watcher for automated threat detection

Configure Kibana alerting rules and Elasticsearch Watcher to automatically detect security threats and send notifications via email or Slack. Set up monitoring dashboards with automated responses for production security monitoring.

45 min 4 distros 93 views
devops Advanced

Setup Apache Airflow cluster with Kubernetes Executor for auto-scaling workflows

Deploy production-grade Apache Airflow with Kubernetes Executor for dynamic workflow scaling. Configure PostgreSQL backend, RBAC authentication, and auto-scaling policies with Prometheus monitoring integration.

45 min 4 distros 76 views
devops Advanced

Set up Istio multi-cluster service mesh with cross-cluster communication

Deploy and configure Istio across multiple Kubernetes clusters with secure cross-cluster communication, shared service discovery, and unified traffic management for distributed microservices architecture.

45 min 4 distros 95 views
monitoring Advanced

Set up Thanos Receiver for remote write scalability with Prometheus integration

Configure Thanos Receiver to handle high-volume remote write traffic from multiple Prometheus instances. This tutorial covers installation, multi-tenancy setup, and performance optimization for large-scale metrics ingestion.

45 min 4 distros 132 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer