Using the Prometheus query log
Prometheus has the ability to log all the queries run by the engine to a log file, as of 2.16.0. This guide demonstrates how to use that log file, which fields it contains, and provides advanced tips about how to operate the log file.
Enable the query log
The query log can be toggled at runtime. It can therefore be activated when you want to investigate slownesses or high load on your Prometheus instance.
To enable or disable the query log, two steps are needed:
- Adapt the configuration to add or remove the query log configuration.
- Reload the Prometheus server configuration.
Logging all the queries to a file
This example demonstrates how to log all the queries to a file called /prometheus/query.log
. We will assume that /prometheus
is the data directory and that Prometheus has write access to it.
First, adapt the prometheus.yml
configuration file:
global:
scrape_interval: 15s
evaluation_interval: 15s
query_log_file: /prometheus/query.log
scrape_configs:
- job_name: 'prometheus'
static_configs:
- targets: ['localhost:9090']
Then, reload the Prometheus configuration:
$ curl -X POST http://127.0.0.1:9090/-/reload
Or, if Prometheus is not launched with --web.enable-lifecycle
, and you’re not running on Windows, you can trigger the reload by sending a SIGHUP to the Prometheus process.
The file /prometheus/query.log
should now exist and all the queries will be logged to that file.
To disable the query log, repeat the operation but remove query_log_file
from the configuration.
Verifying if the query log is enabled
Prometheus conveniently exposes metrics that indicates if the query log is enabled and working:
# HELP prometheus_engine_query_log_enabled State of the query log.
# TYPE prometheus_engine_query_log_enabled gauge
prometheus_engine_query_log_enabled 0
# HELP prometheus_engine_query_log_failures_total The number of query log failures.
# TYPE prometheus_engine_query_log_failures_total counter
prometheus_engine_query_log_failures_total 0
The first metric, prometheus_engine_query_log_enabled
is set to 1 of the query log is enabled, and 0 otherwise. The second one, prometheus_engine_query_log_failures_total
, indicates the number of queries that could not be logged.
Format of the query log
The query log is a JSON-formatted log. Here is an overview of the fields present for a query:
{
"params": {
"end": "2020-02-08T14:59:50.368Z",
"query": "up == 0",
"start": "2020-02-08T13:59:50.368Z",
"step": 5
},
"stats": {
"timings": {
"evalTotalTime": 0.000447452,
"execQueueTime": 7.599e-06,
"execTotalTime": 0.000461232,
"innerEvalTime": 0.000427033,
"queryPreparationTime": 1.4177e-05,
"resultSortTime": 6.48e-07
}
},
"ts": "2020-02-08T14:59:50.387Z"
}
params
: The query. The start and end timestamp, the step and the actual query statement.stats
: Statistics. Currently, it contains internal engine timers.ts
: The timestamp when the query ended.
Additionally, depending on what triggered the request, you will have additional fields in the JSON lines.
API Queries and consoles
HTTP requests contain the client IP, the method, and the path:
{
"httpRequest": {
"clientIP": "127.0.0.1",
"method": "GET",
"path": "/api/v1/query_range"
}
}
The path will contain the web prefix if it is set, and can also point to a console.
The client IP is the network IP address and does not take into consideration the headers like X-Forwarded-For
. If you wish to log the original caller behind a proxy, you need to do so in the proxy itself.
Recording rules and alerts
Recording rules and alerts contain a ruleGroup element which contains the path of the file and the name of the group:
{
"ruleGroup": {
"file": "rules.yml",
"name": "partners"
}
}
Rotating the query log
Prometheus will not rotate the query log itself. Instead, you can use external tools to do so.
One of those tools is logrotate. It is enabled by default on most Linux distributions.
Here is an example of file you can add as /etc/logrotate.d/prometheus
:
/prometheus/query.log {
daily
rotate 7
compress
delaycompress
postrotate
killall -HUP prometheus
endscript
}
That will rotate your file daily and keep one week of history.