I came across this in this blog post and am adding it here as a checklist. It’s a worthwhile list to have so you can understand how your Java app behaves in production.
- Response times and throughput
- Load Average
- Error Rates (and how to solve them)
- GC rate and pause duration
- Business Metrics
- Uptime and service health
- Log size