Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Configure Prometheus Alertmanager with email notifications for production monitoring

Set up Prometheus Alertmanager to send email notifications when your systems trigger alerts. This tutorial covers SMTP configuration, alert routing rules, and email template customization for production monitoring workflows.

35 min 4 distros 197 views
linux Advanced

Configure Linux transparent huge pages and memory optimization for database workloads

Learn to configure transparent huge pages (THP) and optimize Linux memory management for database workloads. Covers THP disable/enable strategies, performance monitoring, and automation with systemd.

25 min 6 distros 280 views
linux Beginner

Configure Linux swap space with memory management and performance optimization

Set up and optimize Linux swap space for better system performance. Learn to create swap files, configure swappiness parameters, and monitor memory usage across different Linux distributions.

20 min 6 distros 289 views
linux Beginner

Configure Linux system monitoring with iostat and disk performance analysis

Learn to monitor disk I/O performance with iostat from the sysstat package. Set up real-time monitoring, identify bottlenecks, and automate disk performance analysis with systemd timers.

25 min 6 distros 300 views
hosting Beginner

Fix systemd service keeps restarting in a loop

Stop systemd services from continuously restarting by identifying restart loop causes, debugging configuration issues, and implementing proper service constraints to ensure stable operation.

25 min 6 distros 448 views
linux Advanced

Configure Linux container memory limits and monitoring with systemd and cgroups v2

Set up memory limits and monitoring for containers using systemd services and cgroups v2 to prevent OOM kills and track resource usage.

25 min 6 distros 381 views
hosting Beginner

Fix Apache web server high memory usage and optimize performance

Diagnose Apache memory issues and optimize MPM modules, worker processes, and configuration to reduce RAM usage and improve web server performance.

25 min 6 distros 304 views
hosting Beginner

Fix high load average with low CPU usage on web servers

Diagnose and resolve high load average issues when CPU usage remains low on Linux web servers. Learn to identify I/O bottlenecks, blocked processes, and system resource contention affecting performance.

25 min 6 distros 396 views
linux Beginner

Configure Linux process monitoring with top, htop, and btop for system performance analysis

Master Linux process monitoring using top, htop, and btop to analyze system performance, identify resource bottlenecks, and troubleshoot performance issues in production environments.

25 min 6 distros 290 views
monitoring Intermediate

Set up ELK stack alerting with Watcher and email notifications for monitoring and incident response

Configure Elasticsearch Watcher to monitor log data and automatically send email alerts when critical system events occur. Create sophisticated alert conditions, manage email notification templates, and set up automated incident response workflows.

45 min 4 distros 178 views
devops Advanced

Integrate Jaeger with Kubernetes service mesh for comprehensive distributed tracing

Set up production-grade distributed tracing in Kubernetes using Jaeger with Istio service mesh integration. Configure OpenTelemetry instrumentation, Elasticsearch storage backend, and comprehensive observability for microservices.

45 min 4 distros 214 views
linux Intermediate

Implement Linux resource quotas with systemd and automated enforcement

Configure systemd resource control and cgroups v2 to implement CPU, memory, and I/O quotas with automated enforcement. Set up monitoring and alerts for resource violations across production workloads.

45 min 4 distros 401 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer