Service Level Indicator (SLI)
A specific number that measures what users actually experience from a service, like success rate, response time, or error rate.
Monitoring and Logging
DevOps glossary terms in Monitoring and Logging.
A specific number that measures what users actually experience from a service, like success rate, response time, or error rate.
Monitoring and Logging
Automated rotation, compression, and retention of log files to prevent unbounded growth and simplify troubleshooting and storage.
Monitoring and Logging
Observability uses logs, metrics, and traces to infer system state and quickly troubleshoot performance and reliability issues.
Monitoring and Logging
Open source distributed tracing backend from Grafana Labs for ingesting, storing, and querying traces to troubleshoot microservices performance.
Monitoring and Logging
Distributed search and analytics engine for indexing, querying, and aggregating large datasets in near real time.
Monitoring and Logging
Live view of system operations.
Monitoring and Logging
Linux kernel tech for running tiny safe programs in the kernel to trace, measure, and sometimes control system and network behavior.
Monitoring and Logging
Prometheus is an open-source monitoring and alerting toolkit that scrapes time-series metrics from services and evaluates queries and alert rules.
Monitoring and Logging
OpenTelemetry is an open-source observability framework that standardizes traces, metrics, and logs from app services.
Monitoring and Logging
Prometheus Recording Rule is a Prometheus rule that precomputes PromQL into time series for faster alerts.
Monitoring and Logging