Summary data is time-series data that is not tied to an individual person ID. Summary data represents aggregated data at a different level of aggregation, for example campaigns. You can use this data in Customer Journey Analytics to support various use cases. For example, data containing a date and a single metric value, or data containing multiple dimensions and metrics.
This summary data can then be used to present high-level performance indicators or perform analysis. Examples of summary data can be advertising impressions, email opens, advertising spend, cost of good sold, S&P indices, and more. You can also use summary data to upload targets or goals on an hourly or daily basis.
Summary data is time-series data from a summary dataset. That summary dataset is based on a schema that uses the XDM Summary Metrics class as its base class.
Only hourly or daily based time-series data are supported.
You can configure a connection, data view and subsequently report on only summary data. It is not required to have additional event, profile or lookup data as part of your configuration.
An example of using summary data is combining summarized advertising campaign data with on-site clickstream data for reporting.
Your summary data contains the following advertising campaign data.
Campaign Code | Impressions | Cost |
---|---|---|
abc123 | 1,250 | $1,500 |
def456 | 775 | $650 |
ghi789 | 500 | $500 |
Your on-site clickstream data contains the following events.
Tracking Code | Click-throughs | Revenue |
---|---|---|
abc123 | 1,250 | $7,200 |
def456 | 775 | $1,250 |
ghi789 | 500 | $750 |
As explained in Combined event dataset, when defining a connection, Customer Journey Analytics creates an overall combined event dataset. When you configure your data view for dimensions originating from a summary dataset, options are available to group and hide dimensions as a preparation for reporting in Workspace. Specifically for summary data, the summary data is combined with event data, based on the summary data group component configuration.
Tracking Code | Campaign Code | Impressions | Cost | Click-throughs | Revenue |
---|---|---|---|---|---|
abc123 | abc123 | 1,250 | $1,500 | 1,250 | $7,200 |
def456 | def123 | 775 | $650 | 775 | $1,250 |
ghi789 | ghi789 | 500 | $500 | 500 | $750 |
Combining the summarized event data and on-site clickstream data enables you to report in Workspace on return on ad spend (ROAS).
Tracking Code | Impressions | Cost | Click-throughs | Revenue | ROAS |
---|---|---|---|---|---|
abc123 | 1,250 | $1,500 | 1,250 | $7,200 | 4.80 |
def456 | 775 | $650 | 775 | $1,250 | 1.92 |
ghi789 | 500 | $500 | 500 | $750 | 1.50 |
If you want to report using a dimension defined in an additional lookup dataset (for example, campaign name), you have to follow these additional steps:
See the Ingest and report on summary data use case for a detailed article on how to make use of, report on, and analyze summary data in Customer Journey Analytics.
To use summary data properly in your reports and analysis, a number of prerequisites apply. The following sections detail these prerequisites.
When configuring the dataset containing the summary data in Customer Journey Analytics, you notice that the granularity is automatically derived from the data. The selections for Timestamp and Timezone dropdown are disabled as both are derived from the schema definition.
You cannot mix and match the hourly and daily granularity of your summary data in one dataset (or with different datasets), using one summary data schema. For example, if you do have daily and hourly granular summary data for advertising campaign data, you need two schemas: one for the daily and one for the hourly summary data. And then upload the relevant granular data into datasets associated with the proper schema (for example, upload hourly data into a dataset associated with the hourly summary data schema).
The timezone of your summary data is defined at the summary schema level in Experience Platform. The timezone applies to hourly granular data only.
For daily granularity, Experience Platform assumes UTC, unless a timezone offset is included in the timestamp. When adding the summary dataset containing the daily summary data, Customer Journey Analytics ignores the timezone definition set on the schema and respects the day associated with the timestamp from the data in the dataset.
For hourly granularity, Customer Journey Analytics respects the timezone configured on the summary data schema in Experience Platform when interpreting the timestamp. The table below provides some examples of this interpretation.
Timestamp source data |
Timezone schema |
Timestamp Experience Platform |
Timezone data view |
Timestamp Customer Journey Analytics |
---|---|---|---|---|
2024-07-29T01:00:00 | default GMT | 2024-07-29T01:00:00 | GMT | 2024-07-29T01:00:00 |
2024-07-29T01:00:00 | default GMT | 2024-07-29T01:00:00 | PST | 2024-07-28T18:00:00 |
2024-07-30T01:00:00-05:00 | default GMT | 2024-07-30T06:00:00 | GMT | 2024-07-30T06:00:00 |
2024-07-30T01:00:00-05:00 | default GMT | 2024-07-30T06:00:00 | PST | 2024-07-29T23:00:00 |
2024-08-02 | default GMT | 2024-08-02T00:00:00 | IST | 2024-08-02T05:00:00 |
2024-07-29T01:00:00 | America/ Los_Angeles |
2024-07-28T18:00:00 | PST | 2024-07-28T18:00:00 |
2024-07-30T01:00:00-05:00 | Australia/ Sydney |
2024-07-30T17:00:00 | CET | 2024-07-30T08:00:00 |
For timezones with a 30 minutes offset (for example IST, India Standard Time), the 30 minute offset is dropped when reporting on summary data. For example: 12:30 is reported as 12:00.
To ensure, the proper timezone is used for your hourly granular summary data, you must ensure the schema used for summary data has the proper timezone configured.
To configure the granularity and timezone for your summary data schema, you have to use the following API call as there is no equivalent user interface available.
curl -X POST \
https://platform.adobe.io/data/foundation/schemaregistry/tenant/descriptors \
-H "Authorization: Bearer {$ACCESS_TOKEN}" \
-H 'Content-Type: application/json' \
-H 'x-api-key: {$API_KEY}' \
-H 'x-gw-ims-org-id: {$ORG_ID}' \
-H 'x-sandbox-name: {$SANDBOX_NAME}' \
-d '{
"@type": "xdm:descriptorTimeSeriesGranularity",
"xdm:sourceSchema": "{$SCHEMA_ID}",
"xdm:sourceVersion": 1,
"xdm:granularity": "{$GRANULARITY}",
"xdm:ianaTimezone": "{$TIMEZONE}"
}'
Variable | Value |
---|---|
$ACCESS_TOKEN $API_KEY $ORG_ID $SANDBOX_NAME |
See Authenticate and access Experience Platform APIs for more information on how to specify values for these variables. |
$SCHEMA_ID |
You can find the id of your schema in the Experience Platform UI. Select your summary schema from the list of schemas, and find the API Usage > Schema ID in the right panel. Use that id as the value. |
$GRANULARITY |
Specify hour or day as the value. |
$TIMEZONE |
Specify the proper timezone identifier value from the TZ identifier column in the List of tz database time zones. For example: America/Los_Angeles . |
Ensure the component settings for a summary data group are the same. See Summary data group component settings for more details.