histogram

The histogram aggregation in APL allows you to create a histogram that groups numeric values into intervals or “bins.” This is useful for visualizing the distribution of data, such as the frequency of response times, request durations, or other continuous numerical fields. You can use it to analyze patterns and trends in datasets like logs, traces, or metrics. It is especially helpful when you need to summarize a large volume of data into a digestible form, providing insights on the distribution of values.

The histogram aggregation is ideal for identifying peaks, valleys, and outliers in your data. For example, you can analyze the distribution of request durations in web server logs or span durations in OpenTelemetry traces to understand performance bottlenecks.

The histogram aggregation in APL is a statistical aggregation that returns estimated results. The estimation comes with the benefit of speed at the expense of accuracy. This means that histogram is fast and light on resources even on a large or high-cardinality dataset, but it doesn’t provide precise results.

For users of other query languages

If you come from other query languages, this section explains how to adjust your existing queries to achieve the same results in APL.

Splunk SPL users

In Splunk SPL, a similar operation to APL’s histogram is the timechart or histogram command, which groups events into time buckets. However, in APL, the histogram function focuses on numeric values, allowing you to control the number of bins precisely.

| stats count by duration | timechart span=10 count

ANSI SQL users

In ANSI SQL, you can use the GROUP BY clause combined with range calculations to achieve a similar result to APL’s histogram. However, APL’s histogram function simplifies the process by automatically calculating bin intervals.

SELECT COUNT(*), FLOOR(req_duration_ms/10)*10 as duration_bin
FROM sample_http_logs
GROUP BY duration_bin

Usage

Syntax

histogram(numeric_field, number_of_bins)

Parameters

numeric_field: The numeric field to create a histogram for. For example, request duration or span duration.
number_of_bins: The number of bins (intervals) to use for grouping the numeric values.

Returns

The histogram aggregation returns a table where each row represents a bin, along with the number of occurrences (counts) that fall within each bin.

Use case examples

You can use the histogram aggregation to analyze the distribution of request durations in web server logs.

Query

['sample-http-logs']
| summarize histogram(req_duration_ms, 100) by bin_auto(_time)

Run in Playground

Output

req_duration_ms_bin	count
0	50
100	200
200	120

This query creates a histogram that groups request durations into bins of 100 milliseconds and shows the count of requests in each bin. It helps you visualize how frequently requests fall within certain duration ranges.

You can use the histogram aggregation to analyze the distribution of request durations in web server logs.

Query

['sample-http-logs']
| summarize histogram(req_duration_ms, 100) by bin_auto(_time)

Run in Playground

Output

req_duration_ms_bin	count
0	50
100	200
200	120

In OpenTelemetry traces, you can use the histogram aggregation to analyze the distribution of span durations.

Query

['otel-demo-traces']
| summarize histogram(duration, 100) by bin_auto(_time)

Run in Playground

Output

duration_bin	count
0.1s	30
0.2s	120
0.3s	50

This query groups the span durations into 100ms intervals, making it easier to spot latency issues in your traces.

In security logs, the histogram aggregation helps you understand the frequency distribution of request durations to detect anomalies or attacks.

Query

['sample-http-logs']
| where status == '200'
| summarize histogram(req_duration_ms, 50) by bin_auto(_time)

Run in Playground

Output

req_duration_ms_bin	count
0	150
50	400
100	100

This query analyzes the request durations for HTTP 200 (Success) responses, helping you identify patterns in security-related events.

percentile: Use percentile when you need to find the specific value below which a percentage of observations fall, which can provide more precise distribution analysis.
avg: Use avg for calculating the average value of a numeric field, useful when you are more interested in the central tendency rather than distribution.
sum: The sum function adds up the total values in a numeric field, helpful for determining overall totals.
count: Use count when you need a simple tally of rows or events, often in conjunction with histogram for more basic summarization.

dcountif make_list

On this page

For users of other query languages
Usage
Syntax
Parameters
Returns
Use case examples
List of related aggregations

For users of other query languages

If you come from other query languages, this section explains how to adjust your existing queries to achieve the same results in APL.

Splunk SPL users

| stats count by duration | timechart span=10 count

ANSI SQL users

SELECT COUNT(*), FLOOR(req_duration_ms/10)*10 as duration_bin
FROM sample_http_logs
GROUP BY duration_bin

Usage

Syntax

histogram(numeric_field, number_of_bins)

Parameters

numeric_field: The numeric field to create a histogram for. For example, request duration or span duration.
number_of_bins: The number of bins (intervals) to use for grouping the numeric values.

Returns

The histogram aggregation returns a table where each row represents a bin, along with the number of occurrences (counts) that fall within each bin.

Use case examples

You can use the histogram aggregation to analyze the distribution of request durations in web server logs.

Query

['sample-http-logs']
| summarize histogram(req_duration_ms, 100) by bin_auto(_time)

Run in Playground

Output

req_duration_ms_bin	count
0	50
100	200
200	120

You can use the histogram aggregation to analyze the distribution of request durations in web server logs.

Query

['sample-http-logs']
| summarize histogram(req_duration_ms, 100) by bin_auto(_time)

Run in Playground

Output

req_duration_ms_bin	count
0	50
100	200
200	120

In OpenTelemetry traces, you can use the histogram aggregation to analyze the distribution of span durations.

Query

['otel-demo-traces']
| summarize histogram(duration, 100) by bin_auto(_time)

Run in Playground

Output

duration_bin	count
0.1s	30
0.2s	120
0.3s	50

This query groups the span durations into 100ms intervals, making it easier to spot latency issues in your traces.

In security logs, the histogram aggregation helps you understand the frequency distribution of request durations to detect anomalies or attacks.

Query

['sample-http-logs']
| where status == '200'
| summarize histogram(req_duration_ms, 50) by bin_auto(_time)

Run in Playground

Output

req_duration_ms_bin	count
0	150
50	400
100	100

This query analyzes the request durations for HTTP 200 (Success) responses, helping you identify patterns in security-related events.

percentile: Use percentile when you need to find the specific value below which a percentage of observations fall, which can provide more precise distribution analysis.
avg: Use avg for calculating the average value of a numeric field, useful when you are more interested in the central tendency rather than distribution.
sum: The sum function adds up the total values in a numeric field, helpful for determining overall totals.
count: Use count when you need a simple tally of rows or events, often in conjunction with histogram for more basic summarization.

dcountif make_list

On this page

For users of other query languages
Usage
Syntax
Parameters
Returns
Use case examples
List of related aggregations

For users of other query languages

Usage

Syntax

Parameters

Returns

Use case examples

Get started

Functions

Operators

Reference

Migration

histogram

For users of other query languages

Usage

Syntax

Parameters

Returns

Use case examples

​For users of other query languages

​Usage

​Syntax

​Parameters

​Returns

​Use case examples

​List of related aggregations

Get started

Functions

Operators

Reference

Migration

​For users of other query languages

​Usage

​Syntax

​Parameters

​Returns

​Use case examples

​List of related aggregations

For users of other query languages

Usage

Syntax

Parameters

Returns

Use case examples

List of related aggregations

For users of other query languages

Usage

Syntax

Parameters

Returns

Use case examples

List of related aggregations