ElasticDoctor - Elasticsearch Health Diagnostics

Your Data Layer Dashboard

The cat indices check is your first look at the data layer health. It provides a quick overview of all indices, their health status, document counts, and storage usage - essential for understanding your cluster's data landscape.

Among all Elasticsearch APIs, cat indices is probably the most frequently used by administrators. It gives you an instant snapshot of your cluster's data: which indices exist, how healthy they are, how much space they use, and how many documents they contain. This check forms the foundation of data layer diagnostics.

What You'll Learn

Index Assessment

• Reading index health indicators
• Understanding size and document metrics
• Identifying index naming patterns
• Spotting storage inefficiencies

Problem Detection

• Finding oversized or undersized indices
• Detecting index proliferation issues
• Identifying stale or unused indices
• Monitoring storage growth patterns

Cat Indices API Deep Dive

GET RequestAll ES Versions (5.x - 9.x)

GET /_cat/indices?v&s=store.size:desc

Simple English Explanation

Think of this API as getting a "table of contents" for your data. It's like asking: "What books (indices) do I have on my shelf, how thick are they (size), how many pages (documents), and are they in good condition (health)?"

It's the first thing most administrators check when investigating data-related issues.

Sample Output

health status index                    pri rep docs.count docs.deleted store.size pri.store.size
green  open   logs-2024.12.04          1   1    1234567           123      2.1gb        1.0gb
yellow open   metrics-2024.12.04       1   1     987654            45      1.8gb        900mb
green  open   users                    1   1       5432             0      12.4mb       6.2mb
red    open   corrupted-index          1   1          0             0         0b           0b
green  open   .kibana_7.15.0_001       1   1        156             3      1.2mb        600kb

📊 Key Columns

• health: green/yellow/red status
• status: open/close state
• index: Index name
• pri/rep: Primary/replica shard counts
• docs.count: Total documents
• store.size: Total storage used

🔧 Useful Parameters

• ?v: Show column headers
• ?s=store.size:desc: Sort by size
• ?h=health,index,docs.count,store.size: Custom columns
• ?bytes=b: Show exact byte values
• logs-*: Filter by pattern

Common Index Issues and Solutions

🚨 Critical: Red Index Status

Immediate Impact

• Data unavailable or partially accessible
• Search results incomplete
• Indexing may fail for affected shards
• Potential data loss risk

Emergency Actions

1. Check allocation explain for specific shard
2. Verify node availability and disk space
3. Review cluster logs for errors
4. Consider restoring from backup
5. Use allocation reroute if safe

⚠️ Warning: Index Proliferation

Symptoms

• Hundreds or thousands of indices
• Many small indices (<1GB each)
• Poor search performance
• High cluster state overhead

Solutions

• Implement data streams for time-series data
• Use rollover policies to control index size
• Delete old indices with ILM
• Consider index templates for consistency

ℹ️ Info: Storage Optimization Opportunities

Optimization Targets

• Large indices that could be split
• Old indices suitable for cold storage
• Indices with high deleted document ratio
• Unused or forgotten indices

Optimization Actions

• Force merge indices with high deletions
• Move old data to warm/cold tiers
• Delete unused indices after verification
• Implement automated lifecycle policies

ElasticDoctor Analysis Implementation

🔍 How ElasticDoctor Analyzes Index Health

Health Status Assessment

ElasticDoctor automatically categorizes indices by health status (red/yellow/green) and provides immediate alerts for any indices with availability issues or reduced redundancy.

Size and Storage Analysis

Identifies oversized indices that may impact performance, calculates storage efficiency metrics, and detects indices with high document deletion ratios that waste disk space.

Index Proliferation Detection

Monitors index creation patterns to identify proliferation issues, recommends consolidation strategies, and suggests lifecycle management improvements.

Optimization Recommendations

Provides actionable recommendations for force merging, data tier migration, index lifecycle policies, and capacity planning based on usage patterns.

Index Management Best Practices

✅ Healthy Index Patterns

• Use consistent naming conventions
• Implement time-based indices for logs
• Keep shard sizes between 10-50GB
• Monitor and maintain green health status
• Use index templates for consistency

💡 Monitoring Tips

• Check indices health daily
• Monitor storage growth trends
• Set up alerts for red/yellow status
• Track index creation patterns
• Review document deletion ratios

❌ Index Anti-Patterns

• Ignoring red index status
• Creating too many small indices
• Not implementing lifecycle management
• Keeping unused indices indefinitely
• Allowing unlimited index growth

⚠️ Performance Impact

• Red indices affect search completeness
• Too many indices slow cluster operations
• Large indices can impact query performance
• High deletion ratios waste storage

Index Health Mastery

Essential Insights

• Health First: Green status should be the norm
• Size Matters: Monitor and control index growth
• Pattern Recognition: Identify optimization opportunities
• Proactive Management: Prevent issues before they impact users

Action Plan

• Implement daily index health monitoring
• Set up automated lifecycle management
• Create index optimization procedures
• Establish capacity planning processes

Previous: Hot Threads Check Next: Index Settings Check

Cat Indices Check: Index Health Overview and Quick Assessment