Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

security Advanced

Configure Airflow DAG security and isolation with RBAC policies

Set up comprehensive security for Apache Airflow with role-based access control, DAG-level permissions, and resource isolation. Configure user authentication, implement fine-grained security policies, and establish monitoring for production-grade workflow orchestration.

45 min 4 distros 574 views
monitoring Advanced

Integrate Zabbix 7 with network automation and orchestration tools using Ansible and Python APIs

Build comprehensive network automation workflows by integrating Zabbix 7 monitoring with Ansible playbooks and custom Python scripts. This tutorial covers API authentication, automated host provisioning, and dynamic monitoring configuration for enterprise infrastructure management.

45 min 4 distros 400 views
performance Advanced

Configure Apache Airflow performance optimization with connection pooling and resource tuning

Optimize Apache Airflow performance through advanced connection pooling, resource tuning, and Celery executor configuration. Learn to scale workers, configure database pools, and implement comprehensive monitoring for production workflows.

45 min 4 distros 831 views
monitoring Advanced

Implement Thanos multi-cluster federation for global Prometheus metrics aggregation

Set up Thanos components across multiple Kubernetes clusters to enable global metrics federation, long-term storage, and unified querying of Prometheus data with high availability and unlimited retention.

45 min 4 distros 414 views
monitoring Advanced

Implement Prometheus federation for multi-cluster monitoring with centralized metrics aggregation

Set up hierarchical Prometheus federation to monitor multiple Kubernetes clusters with a central aggregation layer. Configure global and local Prometheus instances with federated scrape jobs, service discovery, and unified dashboards for enterprise-scale observability.

45 min 4 distros 443 views
databases Advanced

Configure MinIO high availability clustering for production with multi-node setup and automatic failover

Set up a production-ready MinIO cluster with multiple nodes, erasure coding, and automatic failover. This tutorial covers distributed mode configuration, load balancing with SSL termination, and health monitoring for enterprise object storage.

45 min 4 distros 588 views
devops Advanced

Advanced Nomad job templates and deployment strategies with rolling updates and canary deployments

Master production-grade Nomad job templates with HCL syntax, implement rolling updates with health checks, and deploy advanced blue-green and canary deployment patterns for resilient containerized workloads.

45 min 4 distros 646 views
monitoring Intermediate

Set up log alerting with Fluentd and Prometheus Alertmanager for centralized monitoring

Configure Fluentd to collect and parse logs, integrate with Prometheus metrics collection, and set up Alertmanager for intelligent routing of log-based alerts to multiple notification channels.

45 min 4 distros 555 views
devops Intermediate

Integrate Consul with Kubernetes service discovery and automatic configuration

Set up Consul for dynamic service discovery in Kubernetes clusters with automatic service registration, health checks, and configuration management for microservices orchestration.

45 min 4 distros 392 views
devops Intermediate

Setup Kubernetes Ingress NGINX with cert-manager for automated SSL certificates

Learn to deploy NGINX Ingress Controller with cert-manager for automatic SSL certificate provisioning and renewal using Let's Encrypt in production Kubernetes clusters.

35 min 4 distros 540 views
monitoring Intermediate

Set up distributed tracing for Node.js and Python microservices with OpenTelemetry and Jaeger

Configure end-to-end distributed tracing across Node.js and Python microservices using OpenTelemetry instrumentation and Jaeger backend with Elasticsearch for centralized trace collection and analysis.

45 min 4 distros 417 views
devops Intermediate

Configure Kubernetes horizontal pod autoscaler for dynamic scaling based on resource metrics

Set up HPA with CPU and memory targets for automatic pod scaling. Configure metrics server and Prometheus adapter for custom metrics monitoring. Enable dynamic workload scaling based on resource utilization.

45 min 4 distros 507 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer