Search commands > stats, chart, and timechart | Splunk (2024)

Thestats,chart, andtimechartcommands are great commands to know (especiallystats). When I first started learning about the Splunk search commands, I found it challenging to understand the benefits of each command, especially how the BY clause impacts the output of a search. It wasn't until I did a comparison of the output (with some trial and a whole lotta error) that I was able to understand the differences between the commands.

These three commands are transforming commands. A transforming command takes your event data and converts it into an organized results table. You can use these three commands to calculate statistics, such as count, sum, and average.

Note:The BY keyword is shown in these examples and in the Splunk documentation in uppercase for readability. You can use uppercase or lowercase in your searches when you specify the BY keyword.

The Stats Command Results Table

Let's start with thestatscommand. We are going to count the number of events for each HTTP status code.

... | stats count BY status

The count of the events for each unique status code is listed in separate rows in a table on the Statistics tab:

statuscount
20034282
400701
403228
404690


Basically the field values (200, 400, 403, 404) become row labels in the results table.

For thestatscommand, fields that you specify in the BY clause group the results based on those fields. For example, we receive events from three different hosts: www1, www2, and www3. If we add thehostfield to our BY clause, the results are broken out into more distinct groups.

... | stats count BY status, host

Each unique combination of status and host is listed on aseparaterowin the results table.

statushostcount
200www111835
200www211186
200www311261
400www1233
400www2257
400www3211
403www2228
404www1244
404www2209
404www3

237

Each field you specify in the BY clause becomes a separate column in the results table. You're splitting the rows first on status, then on host. The fields that you specify in the BY clause of thestatscommand are referred to as <row-split> fields.

In this example, there are fiveactionsthat customers can take on our website: addtocart, changequantity, purchase, remove, and view.

Let's add action to the search.

... | stats count BY status, host, action

You are splitting the rows first on status, then on host, and then on action. Below is a partial list of the results table that is produced when we add theactionfield to the BYclause:

statushostactioncount
200www1addtocart1837
200www1changequantity428
200www1purchase1860
200www1remove432
200www1view1523
200www2addtocart1743
200www2changequantity365
200www2purchase1742

One big advantage of using thestatscommand is that you can specify more than two fields in the BY clause and create results tables that show very granular statistical calculations.

Chart Command Results Table

Using the same basic search, let's compare the results produced by thechartcommand with the results produced by thestatscommand.

If you specify only one BY field, the results from thestatsandchartcommands are identical. Using thechartcommand in the search with two BY fields is where you really see differences.

Remember the results returned when we used the stats command with two BY fields are:

statushostcount
200www111835
200www211186
200www311261
400www1233
400www2257
400www3211
403www2228
404www1244
404www2209
404www3

237

Now let's substitute thechartcommand for thestatscommand in the search.

... | chart count BY status, host

The search returns the following results:

statuswww1www2www3
200118351118611261
400233257211
40302880
404244209237

Thechartcommand uses the first BY field,status, to group the results. For each unique value in thestatusfield, the results appear on a separate row. This first BY field is referred to as the <row-split>field. Thechartcommand uses the second BY field,host, to split the results into separate columns. This second BY field is referred to as the <column-split> field. The values for thehostfield become the column labels.

Notice the results for the 403 status code in both results tables. With thestatscommand, there are no results forthe 403 status code andthe www1 and www3 hosts. With thechartcommand, when there are no events for the <column-split> field that contain the value for the <row-split> field, a 0 is returned.

One important difference between thestatsandchartcommands is how many fields you can specify in the BY clause.

With thestatscommand, you can specify a list of fields in the BY clause, all of which are <row-split> fields. The syntax for thestatscommand BY clause is:

BY <field-list>

For thechartcommand, you can specify at most two fields. One <row-split> field and one <column-split> field.

Thechartcommand provides two alternative ways to specify these fields in the BY clause. For example:

... | chart count BY status, host

... | chart count OVER status BY host

The syntax for thechartcommand BY clause is:

[ BY <row-split> <column-split> ] | [ OVER <row-split> ] [BY <column-split>] ]

The advantage of using thechartcommand is that it creates a consolidated results table that is better for creating charts. Let me show you what I mean.

Stats and Chart Command Visualizations

When you run the stats and chart commands, the event data is transformed into results tables that appear on the Statistics tab. Click theVisualizationtab to generate a graph from the results. Here is the visualization for thestatscommand results table:

Search commands > stats, chart, and timechart | Splunk (1)

Thestatusfield forms the X-axis, and thehostandcountfields form the data series. The range of count values form the Y-axis.

There are several problems with this chart:

  1. There are multiple values for the same status code on the X-axis.
  2. Thehostvalues (www1, www2, and www3) are string values and cannot be measured in the chart. Thehostshows up in the legend, but there are no blue columns in the chart.

Because of these issues, the chart is confusing and does not convey the information that is in the results table.

While youcancreate a usable visualization from thestatscommand results table, the visualization is useful only when you specify one BY clause field.

It's better to use thechartcommand when you want to create a visualization using two BY clause fields:

Search commands > stats, chart, and timechart | Splunk (2)

Thestatusfield forms the X-axis and thehostvalues form the data series. The range of count values form the Y-axis.

What About the Timechart Command?

When you use thetimechartcommand, the results table is always grouped by the event timestamp (the_timefield). The time value is the <row-split> for the results table. So in the BY clause, you specify only one field, the <column-split> field. For example, this search generates a count and specifies the status field as the<column-split> field:

... | timechart count BY status

This search produces this results table:

_time200400403404
2018-07-05103827719
2018-07-0649811113598
2018-07-0751239945105
2018-07-08501611222105
2018-07-094732863484
2018-07-10479110223107
2018-07-114783853998
2018-07-123818792374

If you search by thehostfield instead, this results table is produced:

_time

www1

www2

www3

2018-07-05372429419
2018-07-06211118371836
2018-07-07188720461935
2018-07-08192718692005
2018-07-09193716541792
2018-07-10198018321733
2018-07-11185518471836
2018-07-12155913981436

The time increments that you see in the_timecolumn are based on the search time range or the arguments that you specify with thetimechartcommand. In the previous examples the time range was set toAll timeand there are only a few weeks of data. Because we didn't specify aspan, a default time span is used. In this situation, the default span is 1 day.

If you specify a time range likeLast 24 hours, the default time span is 30 minutes. TheUsagesection in the timechart documentation specifies the default time spans for the most common time ranges. This results table shows the default time span of 30 minutes:

_timewww1www2www3
2018-07-12 15:00:00442273
2018-07-12 15:30:00345331
2018-07-12 16:00:00143336
2018-07-12 16:30:00462154
2018-07-12 17:00:00752638
2018-07-12 17:30:00385114
2018-07-12 18:00:00622415


Thetimechartcommand includes several options that are not available with the stats and chart commands. For example, you can specify a time span like we have in this search:

... | timechart span=12h count BY host

_timewww1www2www3
2018-07-04 17:00801783819
2018-07-05 05:00795847723
2018-07-05 17:00192616611642
2018-07-06 05:00150117741542
2018-07-06 17:00203319091857
2018-07-07 05:00148216711594
2018-07-07 17:00202718182036

In this example, the 12-hour increments in the results table are based on when you run the search (local time) and how that aligns that with UNIX time (sometimes referred to as epoch time).

Note: There are other options you can specify with thetimechartcommand, which we'll explore in a separate blog.

So how do these results appear in a chart? On theVisualizationtab, you see that_timeforms the X-axis. The axis marks the Midnight and Noon values for each date. However, the columns that represent the data start at 1700 each day and end at 0500 the next day.

The field specified in the BY clause forms the data series. The range of count values forms the Y-axis.

Search commands > stats, chart, and timechart | Splunk (3)

In Summary

The stats, chart, and timechart commands have some similarities, but you’ve got to pay attention to the BY clauses that you use with them.

  • Usethe stats command when you want to create results tables that show granular statistical calculations.
  • Use the stats command when you want to specify 3 or more fields in the BY clause.
  • Use the chart command when you want to create results tables that show consolidated and summarized calculations.
  • Use the chart command to create visualizations from the results table data.
  • Use the timechart command to create results tables and charts that are based on time.

SPL it like you mean it - Laura

References

Other blogs:

  • Search Command> stats, eventstats and streamstats

Splunk documentation:

  • Stats command:https://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Stats
  • Chart command:https://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Chart
  • Timechart command:https://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Timechart

----------------------------------------------------
Thanks!
Laura Stewart

Search commands > stats, chart, and timechart | Splunk (4)

Splunk

The world’s leading organizations trustSplunkto help keep their digital systems secure and reliable. Our software solutions and services help to prevent major issues, absorb shocks and accelerate transformation. Learnwhat Splunk doesandwhy customers choose Splunk.

Search commands > stats, chart, and timechart | Splunk (2024)
Top Articles
Latest Posts
Article information

Author: Dean Jakubowski Ret

Last Updated:

Views: 5790

Rating: 5 / 5 (50 voted)

Reviews: 81% of readers found this page helpful

Author information

Name: Dean Jakubowski Ret

Birthday: 1996-05-10

Address: Apt. 425 4346 Santiago Islands, Shariside, AK 38830-1874

Phone: +96313309894162

Job: Legacy Sales Designer

Hobby: Baseball, Wood carving, Candle making, Jigsaw puzzles, Lacemaking, Parkour, Drawing

Introduction: My name is Dean Jakubowski Ret, I am a enthusiastic, friendly, homely, handsome, zealous, brainy, elegant person who loves writing and wants to share my knowledge and understanding with you.