Comprehensive guide to all 22 health checks performed by ElasticDoctor
ElasticDoctor performs 22 comprehensive health checks across four critical categories. Each check is designed to identify potential issues before they impact your cluster's performance or availability.
Severity Levels
Each check is classified by severity: Critical issues require immediate attention, Warnings should be addressed for optimal performance, and Informational items provide insights.
Monitors overall cluster health (green/yellow/red)
Checks if all nodes are online and reachable
Verifies proper shard distribution across nodes
Identifies shards that cannot be allocated
Reviews cluster-level configuration settings
Analyzes query response times and throughput
Monitors indexing speed and efficiency
Checks heap memory utilization across nodes
Monitors disk space usage and growth trends
Tracks CPU usage patterns and bottlenecks
Analyzes JVM metrics and garbage collection
Verifies authentication mechanisms are enabled
Checks role-based access control configuration
Validates encryption in transit settings
Ensures security events are being logged
Reviews network binding and firewall settings
Validates snapshot and backup settings
Reviews index lifecycle policies and settings
Checks if proper monitoring is configured
Verifies logging levels and output settings
Identifies version-specific issues and recommendations
Reviews installed plugins and their configurations
Issues that require immediate attention to prevent data loss, service disruption, or security breaches.
Examples:
Issues that should be addressed to maintain optimal performance and prevent future problems.
Examples:
General insights and recommendations for best practices and optimization opportunities.
Examples:
Start with critical issues as they pose the highest risk to your cluster's stability and data integrity.
Focus on issues with high impact that affect multiple nodes or the entire cluster.
Each check includes specific remediation steps and best practices for resolution.
Re-run diagnostics after making changes to verify improvements and track your cluster's health over time.
Our team can help you interpret health check results and provide guidance on remediation strategies.