Health Checks
HTTP Ping Endpoint
The simplest health check is the HTTP ping endpoint:- Load balancer health checks
- Docker/Kubernetes liveness probes
- Uptime monitoring
Docker Health Check
Add to your Dockerfile or docker-compose.yml:Kubernetes Probes
TCP Connection Test
Test the native protocol port:System Tables
Proton exposes extensive runtime information through system tables in thesystem database.
Query Monitoring
Current Queries
View currently running queries:Query Log
Analyze query performance history:Performance Metrics
Current Metrics
Real-time server metrics:Asynchronous Metrics
Periodically updated metrics:Event Counters
Cumulative event statistics:Resource Usage
Memory Usage
Current memory consumption:Disk Usage
Table storage statistics:Stream and Table Information
List All Streams
Stream Statistics
Error Monitoring
Track errors by type:Log Files
Log Locations
Default log file paths:- Server log:
/var/log/proton-server/proton-server.log - Error log:
/var/log/proton-server/proton-server.err.log
Log Levels
Configure inconfig.yaml:
View Logs in Docker
Parse Logs for Errors
Performance Monitoring
Query Performance Dashboard
Create a monitoring query:Throughput Monitoring
Resource Utilization Over Time
Grafana Integration
Use the Proton Grafana data source to build dashboards.Example Dashboard Queries
Active Queries:Alerting
Key Metrics to Alert On
- Server Availability:
/pingendpoint down - High Error Rate: Errors in
system.errorsincreasing - Memory Usage:
jemalloc.allocated> 80% of RAM - Slow Queries: p95 latency > threshold
- Failed Queries: High count in
system.query_log - Disk Space: Storage > 90% full
Example Alert Queries
High Error Rate:Monitoring Best Practices
- Set up automated health checks for uptime monitoring
- Monitor query performance regularly via
system.query_log - Track resource usage (CPU, memory, disk) trends
- Configure log rotation to prevent disk space issues
- Set up alerts for critical metrics (errors, latency, memory)
- Use Grafana dashboards for visualization
- Review slow queries weekly and optimize
- Monitor streaming query health for long-running queries
- Track checkpoint sizes for stateful queries
- Keep historical metrics for capacity planning
Troubleshooting Common Issues
High Memory Usage
Slow Queries
Connection Issues
Next Steps
- Optimize performance with Performance Tuning
- Configure alerts and logging in Configuration
- Review Deployment best practices