/api/query¶
Probably the most useful endpoint in the API, /api/query
enables extracting data from the storage system in various formats determined by the serializer selected. Queries can be submitted via the 1.0 query string format or body content.
Query API Endpoints¶
The /query
endpoint is documented below. As of 2.2 data matching a query can be deleted by using the DELETE
verb. The configuration parameter tsd.http.query.allow_delete
must be enabled to allow deletions. Data that is deleted will be returned in the query results. Executing the query a second time should return empty results.
Warning
Deleting data is permanent. Also beware that when deleting, some data outside the boundaries of the start and end times may be deleted as data is stored on an hourly basis.
Verbs¶
GET
POST
DELETE
Requests¶
Request parameters include:
Name |
Data Type |
Required |
Description |
Default |
QS |
RW |
Example |
---|---|---|---|---|---|---|---|
start |
String, Integer |
Required |
The start time for the query. This can be a relative or absolute timestamp. See Querying or Reading Data for details. |
start |
1h-ago |
||
end |
String, Integer |
Optional |
An end time for the query. If not supplied, the TSD will assume the local system time on the server. This may be a relative or absolute timestamp. See Querying or Reading Data for details. |
current time |
end |
1s-ago |
|
queries |
Array |
Required |
One or more sub queries used to select the time series to return. These may be metric |
m or tsuids |
See below |
||
noAnnotations |
Boolean |
Optional |
Whether or not to return annotations with a query. The default is to return annotations for the requested timespan but this flag can disable the return. This affects both local and global notes and overrides |
false |
no_annotations |
false |
|
globalAnnotations |
Boolean |
Optional |
Whether or not the query should retrieve global annotations for the requested timespan |
false |
global_annotations |
true |
|
msResolution (or ms) |
Boolean |
Optional |
Whether or not to output data point timestamps in milliseconds or seconds. The msResolution flag is recommended. If this flag is not provided and there are multiple data points within a second, those data points will be down sampled using the query’s aggregation function. |
false |
ms |
true |
|
showTSUIDs |
Boolean |
Optional |
Whether or not to output the TSUIDs associated with timeseries in the results. If multiple time series were aggregated into one set, multiple TSUIDs will be returned in a sorted manner |
false |
show_tsuids |
true |
|
showSummary (2.2) |
Boolean |
Optional |
Whether or not to show a summary of timings surrounding the query in the results. This creates another object in the map that is unlike the data point objects. See Query Details and Stats |
false |
show_summary |
true |
|
showStats (2.2) |
Boolean |
Optional |
Whether or not to show detailed timings surrounding the query in the results. This creates another object in the map that is unlike the data point objects. See Query Details and Stats |
false |
show_stats |
true |
|
showQuery (2.2) |
Boolean |
Optional |
Whether or not to return the original sub query with the query results. If the request contains many sub queries then this is a good way to determine which results belong to which sub query. Note that in the case of a |
false |
show_query |
true |
|
delete |
Boolean |
Optional |
Can be passed to the JSON with a POST to delete any data points that match the given query. |
false |
W |
true |
|
timezone (2.3) |
String |
Optional |
An optional timezone for calendar-based downsampling. Must be a valid timezone database name supported by the JRE installed on the TSD server. |
UTC |
timezone |
Asia/Kabul |
|
useCalendar (2.3) |
Boolean |
Optional |
Whether or not use the calendar based on the given timezone for downsampling intervals |
false |
true |
Sub Queries¶
An OpenTSDB query requires at least one sub query, a means of selecting which time series should be included in the result set. There are two types:
Metric Query - The full name of a metric is supplied along with an optional list of tags. This is optimized for aggregating multiple time series into one result.
TSUID Query - A list of one or more TSUIDs that share a common metric. This is optimized for fetching individual time series where aggregation is not required.
A query can include more than one sub query and any mixture of the two types. When submitting a query via content body, if a list of TSUIDs is supplied, the metric and tags for that particular sub query will be ignored.
Each sub query can retrieve individual or groups of timeseries data, performing aggregation or grouping calculations on each set. Fields for each sub query include:
Name |
Data Type |
Required |
Description |
Default |
Example |
---|---|---|---|---|---|
aggregator |
String |
Required |
The name of an aggregation function to use. See /api/aggregators |
sum |
|
metric |
String |
Required |
The name of a metric stored in the system |
sys.cpu.0 |
|
rate |
Boolean |
Optional |
Whether or not the data should be converted into deltas before returning. This is useful if the metric is a continuously incrementing counter and you want to view the rate of change between data points. |
false |
true |
rateOptions |
Map |
Optional |
Monotonically increasing counter handling options |
See below |
See below |
downsample |
String |
Optional |
An optional downsampling function to reduce the amount of data returned. |
See below |
5m-avg |
tags |
Map |
Optional |
To drill down to specific timeseries or group results by tag, supply one or more map values in the same format as the query string. Tags are converted to filters in 2.2. See the notes below about conversions. Note that if no tags are specified, all metrics in the system will be aggregated into the results. Deprecated in 2.2 |
See Below |
|
filters (2.2) |
List |
Optional |
Filters the time series emitted in the results. Note that if no filters are specified, all time series for the given metric will be aggregated into the results. |
See Below |
|
explicitTags (2.3) |
Boolean |
Optional |
Returns the series that include only the tag keys provided in the filters. |
false |
true |
percentiles (2.4) |
List |
Optional |
Fetches histogram data for the metric and computes the given list of percentiles on the data. Percentiles are floating point values from 0 to 100. More details below. |
[99.9, 95.0, 75.0] |
|
rollupUsage (2.4) |
String |
Optional |
An optional fallback mode when fetching rollup data. Can either be |
ROLLUP_NOFALLBACK |
ROLLUP_RAW |
Rate Options¶
When passing rate options in a query string, the options must be enclosed in curly braces. For example: m=sum:rate{counter,,1000}:if.octets.in
. If you wish to use the default counterMax
but do want to supply a resetValue
, you must add two commas as in the previous example. Additional fields in the rateOptions
object include the following:
Name |
Data Type |
Required |
Description |
Default |
Example |
---|---|---|---|---|---|
counter |
Boolean |
Optional |
Whether or not the underlying data is a monotonically increasing counter that may roll over |
false |
true |
counterMax |
Integer |
Optional |
A positive integer representing the maximum value for the counter. |
Java Long.MaxValue |
65535 |
resetValue |
Integer |
Optional |
An optional value that, when exceeded, will cause the aggregator to return a |
0 |
65000 |
dropResets |
Boolean |
Optional |
Whether or not to simply drop rolled-over or reset data points. |
false |
true |
Downsampling¶
Downsample specifications const if an interval, a unit of time, the c
flag to use calendar based downsampling (as of 2.3), an aggregator and (as of 2.2) an optional fill policy. The format of a downsample spec is:
<interval><units>[c]-<aggregator>[-<fill policy>]
For example:
1h-sum
30m-avg-nan
24h-max-zero
1dc-sum
0all-sum
See Downsampling for details on downsampling, a list of supported fill policies and how calendar based downsampling operates.
Filters¶
New for 2.2, OpenTSDB includes expanded and plugable filters across tag key and value combinations. For a list of filters loaded in the TSD, see /api/config/filters. For descriptions of the built-in filters see Query Filters. Filters can be used in both query string and POST formatted queries. Multiple filters on the same tag key are allowed and when processed, they are ANDed together e.g. if we have two filters host=literal_or(web01)
and host=literal_or(web02)
the query will always return empty. If two or more filters are included for the same tag key and one has group by enabled but another does not, then group by will effectively be true for all filters on that tag key. Fields for POST queries pertaining to filters include:
Name |
Data Type |
Required |
Description |
Default |
Example |
---|---|---|---|---|---|
type |
String |
Required |
The name of the filter to invoke. See /api/config/filters |
regexp |
|
tagk |
String |
Required |
The tag key to invoke the filter on |
host |
|
filter |
String |
Required |
The filter expression to evaluate and depends on the filter being used |
web.*.mysite.com |
|
groupBy |
Boolean |
Optional |
Whether or not to group the results by each value matched by the filter. By default all values matching the filter will be aggregated into a single series. |
false |
true |
For URI queries, the type precedes the filter expression in parentheses. The format is <tagk>=<type>(<filter_expression>)
. Whether or not results are grouped depends on which curly bracket the filter is in. Two curly braces are now supported per metric query. The first set is the group by filter and the second is a non group by filter, e.g. {host=wildcard(web*)}{colo=regexp(sjc.*)}
. This specifies any metrics where the colo matches the regex expression “sjc.*” and the host tag value starts with “web” and the results are grouped by host. If you only want to filter without grouping then the first curly set must be empty, e.g. {}{host=wildcard(web*),colo=regexp(sjc.*)}
. This specifies nany metrics where colo matches the regex expression “sjc.*” and the host tag value starts with “web” and the results are not grouped.
Note
Regular expression, wildcard filters with a pre/post/in-fix or literal ors with many values can cause queries to return slower as each row of data must be resolved to their string values then processed.
Note
When submitting a JSON query to OpenTSDB 2.2 or later, use either tags
OR filters
. Only one will take effect and the order is indeterminate as the JSON parser may deserialize one before the other. We recommend using filters for all future queries.
Filter Conversions¶
Values in the POST query tags
map and the group by curly brace of URI queries are automatically converted to filters to provide backwards compatibility with existing systems. The auto conversions include:
Example |
Description |
---|---|
|
Wildcard filter, effectively makes sure the tag key is present in the series |
|
Case sensitive literal OR filter |
|
Case sensitive literal OR filter |
|
Case insensitive wildcard filter. An asterisk (star) with any other strings now becomes a wildcard filter shortcut |
Percentiles¶
With OpenTSDB 2.4, the database can store and query histogram or digest data for accurate percentile calculations (as opposed to the built-in percentile aggregators). If one or more percentiles are requested in a query, the TSD will scan storage explicitly for histograms (of any codec type) and regular numeric data will be ignored. More than one percentile can be computed at the same time, for example it may be common to fetch the 99.999th, 99.9th, 99.0th and 95th percentiles in one query via percentiles[99.999, 99.9, 99.0, 95.0]
. NOTE For some plugin implementations (such as the Yahoo Data Sketches implementation) the percentile list must be given in descending sorted order.
Results are serialized in the same was as regular data point time series for compatibility with existing graph systems. However the percentile will be appended to the metric name and time series for each group-by and percentile will be returned. For example, if the user asks for percentiles[99.9,75.0]
over the sys.cpu.nice
metric, the results will have time series sys.cpu.nice_pct_99.9
and sys.cpu.nice_pct_75.0
.
Note
Currently the only supported downsampling and aggregation operators over histogram data is SUM
. This is the most common use-case in that you may want to group all of the hosts for a server by colo in which case we’ll sum all of the histograms and then compute the percentiles based on the sums. Likewise, if you want to downsample to every hour, the grouped histograms are again summed over time and finally the percentiles are extracted.
Metric Query String Format¶
The full specification for a metric query string sub query is as follows:
m=<aggregator>:[rate[{counter[,<counter_max>[,<reset_value>]]}]:][<down_sampler>:][percentiles\[<p1>, <pn>\]:][explicit_tags:]<metric_name>[{<tag_name1>=<grouping filter>[,...<tag_nameN>=<grouping_filter>]}][{<tag_name1>=<non grouping filter>[,...<tag_nameN>=<non_grouping_filter>]}]
It can be a little daunting at first but you can break it down into components. If you’re ever confused, try using the built-in GUI to plot a graph the way you want it, then look at the URL to see how the query is formatted. Changes to any of the form fields will update the URL (which you can actually copy and paste to share with other users). For examples, please see Query Examples.
TSUID Query String Format¶
TSUID queries are simpler than Metric queries. Simply pass a list of one or more hexadecimal encoded TSUIDs separated by commas:
tsuid=<aggregator>:<tsuid1>[,...<tsuidN>]
Example Query String Requests¶
http://localhost:4242/api/query?start=1h-ago&m=sum:rate:proc.stat.cpu{host=foo,type=idle}
http://localhost:4242/api/query?start=1h-ago&tsuid=sum:000001000002000042,000001000002000043
Example Content Request¶
Please see the serializer documentation for request information: HTTP Serializers. The following examples pertain to the default JSON serializer.
{
"start": 1356998400,
"end": 1356998460,
"queries": [
{
"aggregator": "sum",
"metric": "sys.cpu.0",
"rate": "true",
"tags": {
"host": "*",
"dc": "lga"
}
},
{
"aggregator": "sum",
"tsuids": [
"000001000002000042",
"000001000002000043"
]
}
]
}
2.2 query with filters
{
"start": 1356998400,
"end": 1356998460,
"queries": [
{
"aggregator": "sum",
"metric": "sys.cpu.0",
"rate": "true",
"filters": [
{
"type":"wildcard",
"tagk":"host",
"filter":"*",
"groupBy":true
},
{
"type":"literal_or",
"tagk":"dc",
"filter":"lga|lga1|lga2",
"groupBy":false
}
]
},
{
"aggregator": "sum",
"tsuids": [
"000001000002000042",
"000001000002000043"
]
}
]
}
Response¶
The output generated for a query depends heavily on the chosen serializer HTTP Serializers. A request may result in multiple sets of data returned, particularly if the request included multiple queries or grouping was requested. Some common fields included with each data set in the response will be:
Name |
Description |
---|---|
metric |
Name of the metric retrieved for the time series |
tags |
A list of tags only returned when the results are for a single time series. If results are aggregated, this value may be null or an empty map |
aggregatedTags |
If more than one timeseries were included in the result set, i.e. they were aggregated, this will display a list of tag names that were found in common across all time series. |
dps |
Retrieved data points after being processed by the aggregators. Each data point consists of a timestamp and a value, the format determined by the serializer. |
annotations |
If the query retrieved annotations for timeseries over the requested timespan, they will be returned in this group. Annotations for every timeseries will be merged into one set and sorted by |
globalAnnotations |
If requested by the user, the query will scan for global annotations during the timespan and the results returned in this group |
Unless there was an error with the query, you will generally receive a 200
status with content. However if your query couldn’t find any data, it will return an empty result set. In the case of the JSON serializer, the result will be an empty array:
[]
For the JSON serializer, the timestamp will always be a Unix epoch style integer followed by the value as an integer or a floating point. For example, the default output is "dps"{"<timestamp>":<value>}
. By default the timestamps will be in seconds. If the msResolution
flag is set, then the timestamps will be in milliseconds.
Example Aggregated Default Response¶
[
{
"metric": "tsd.hbase.puts",
"tags": {},
"aggregatedTags": [
"host"
],
"annotations": [
{
"tsuid": "00001C0000FB0000FB",
"description": "Testing Annotations",
"notes": "These would be details about the event, the description is just a summary",
"custom": {
"owner": "jdoe",
"dept": "ops"
},
"endTime": 0,
"startTime": 1365966062
}
],
"globalAnnotations": [
{
"description": "Notice",
"notes": "DAL was down during this period",
"custom": null,
"endTime": 1365966164,
"startTime": 1365966064
}
],
"tsuids": [
"0023E3000002000008000006000001"
],
"dps": {
"1365966001": 25595461080,
"1365966061": 25595542522,
"1365966062": 25595543979,
...
"1365973801": 25717417859
}
}
]
Example Aggregated Array Response¶
[
{
"metric": "tsd.hbase.puts",
"tags": {},
"aggregatedTags": [
"host"
],
"dps": [
[
1365966001,
25595461080
],
[
1365966061,
25595542522
],
...
[
1365974221,
25722266376
]
]
}
]
Example Multi-Set Response¶
For the following example, two TSDs were running and the query was: http://localhost:4242/api/query?start=1h-ago&m=sum:tsd.hbase.puts{host=*}
. This returns two explicit time series.
[
{
"metric": "tsd.hbase.puts",
"tags": {
"host": "tsdb-1.mysite.com"
},
"aggregatedTags": [],
"dps": {
"1365966001": 3758788892,
"1365966061": 3758804070,
...
"1365974281": 3778141673
}
},
{
"metric": "tsd.hbase.puts",
"tags": {
"host": "tsdb-2.mysite.com"
},
"aggregatedTags": [],
"dps": {
"1365966001": 3902179270,
"1365966062": 3902197769,
...
"1365974281": 3922266478
}
}
]
Example With Show Summary and Query¶
[
{
"metric": "tsd.hbase.puts",
"tags": {},
"aggregatedTags": [
"host"
],
"query": {
"aggregator": "sum",
"metric": "tsd.hbase.puts",
"tsuids": null,
"downsample": null,
"rate": true,
"explicitTags": false,
"filters": [
{
"tagk": "host",
"filter": "*",
"group_by": true,
"type": "wildcard"
}
],
"rateOptions": null,
"tags": { }
},
"dps": {
"1365966001": 25595461080,
"1365966061": 25595542522,
"1365966062": 25595543979,
...
"1365973801": 25717417859
}
},
{
"statsSummary": {
"datapoints": 0,
"rawDatapoints": 56,
"aggregationTime": 0,
"serializationTime": 20,
"storageTime": 6,
"timeTotal": 26
}
}
]