Use blending to reaggregate data

Learn how to work with previously aggregated data.

Reaggregation is a common need in data visualization. This article will help you understand the concept of reaggregation and how to achieve it in Insights using data blending.

One example of reaggregation is calculating the average of averages. For example, say you have a table of stock price changes:

Sector

Ticker

Price Change

Tech

GOOG

+6

Tech

AAPL

+5

Tech

MSFT

-3

Tech

NFLX

-1

Energy

E1

+2

Energy

E2

+10

Energy

E3

-3

Finance

F1

-6

The average price change for this data is a simple aggregation.

Average of Price Change

1.25

To calculate the average price change for every sector, you'd group this table by the Sector dimension.

Sector

Average of Price Change

Tech

1.75

Energy

3

Finance

-6

To reaggregate this data, you'd apply another aggregation function, for example, applying average again:

Average of Average of Price Change

-0.42

Reaggregation in Insights

To reaggregate metrics in Insights, use data blending. Blending lets you work around the fact that previously aggregated fields are set to the AUTO field type. You can't change this field type, nor can you apply another aggregation function to such fields.

For example, to find the average change of stock prices per sector in Insights, you'd create a blend configuration that joins the same data source with itself. Use Sector as the join key, and include the Average Price Change metric in both the left and right data sources, as shown below:

Blending disaggregates data

Blending data creates a new table from the columns that you select in the blend configuration. Metrics in the new table are treated as unaggregated numbers.Because Price Change is no longer an aggregated metric, you can now apply a new aggregation function on it. The table below shows the results of creating a new metric AVG(Price Change) with the previously aggregated numbers:

This new metric reaggregates the numbers 1.75, 3 and -6 and displays their average: -0.42 .

Create a ratio column using blending

Another use for blending is to create ratio metrics with already aggregated numbers. Say you want to create a ratio column that divides one metric by another.

In this example, we'll use two fields; Clicks and Impressions, coming from two different data sources.

Website

Clicks

google.com

300

facebook.com

400

twitter.com

200

Website

Impressions

google.com

2000

facebook.com

2500

twitter.com

2000

You can create a ratio column with a calculated field Clicks/Impressions by blending these two data sources.

Website

Clicks

Impressions

Clicks / Impressions

google.com

300

2000

0.15

facebook.com

400

2500

0.16

twitter.com

200

2000

0.1

Grand Total

900

6500

0.41

All the rows of Clicks/Impressions have correct information except the summary row which shows the sum of the ratio column SUM(Clicks / Impressions). This happens because Clicks/Impressions is calculated for each row [0.15, 0.16, 0.1] and then the SUM function is applied to it. [0.15 + 0.16 + 0.1 = 0.41].

The correct result is 900/6500 = 0.14 .You can do this by calculating the ratio column values using the formula SUM(Clicks) / SUM(Impressions).

Website

Clicks

Impressions

SUM(Clicks) / SUM(Impressions)

google.com

300

2000

0.15

facebook.com

400

2500

0.16

twitter.com

200

2000

0.1

Grand Total

900

6500

0.14

In this case, the summary row shows SUM( SUM(Clicks) / SUM(Impressions) ). SUM(Clicks) [900] is divided by SUM(Impressions)[6500] to give 0.14. The SUM function is then applied to it again. The result is still 0.14.

Last updated