When you create a connection, Customer Journey Analytics combines all event datasets into a single dataset. This combined event dataset is what Customer Journey Analytics uses for reporting (together with profile and lookup datasets). When you include multiple event datasets in a connection:
Consider the following example. You have two event datasets, each with different fields containing different data.
Adobe Experience Platform typically stores a timestamp in UNIX® milliseconds. For readability in this example, date and time are used.
example_id | timestamp | string_color | string_animal | metric_a |
---|---|---|---|---|
user_310 | 1 Jan 7:02 AM | Red | Fox | |
user_310 | 1 Jan 7:04 AM | 2 | ||
user_310 | 1 Jan 7:08 AM | Blue | 3 | |
user_847 | 2 Jan 12:31 PM | Turtle | 4 | |
user_847 | 2 Jan 12:44 PM | 2 |
different_id | timestamp | string_color | string_shape | metric_b |
---|---|---|---|---|
user_847 | 2 Jan 12:26 PM | Yellow | Circle | 8.5 |
user_847 | 2 Jan 1:01 PM | Red | ||
alternateid_656 | 2 Jan 8:58 PM | Red | Square | 4.2 |
alternateid_656 | 2 Jan 9:03 PM | Triangle | 3.1 |
When you create a connection using these two event datasets, and have identified
example_id
as the Person ID for the first dataset, anddifferent_id
as the Person ID for the second dataset,the following combined dataset is used for reporting.
id | timestamp | string_color | string_animal | string_shape | metric_a | metric_b |
---|---|---|---|---|---|---|
user_310 | 1 Jan 7:02 AM | Red | Fox | |||
user_310 | 1 Jan 7:04 AM | 2 | ||||
user_310 | 1 Jan 7:08 AM | Blue | 3 | |||
user_847 | 2 Jan 12:26 PM | Yellow | Circle | 8.5 | ||
user_847 | 2 Jan 12:31 PM | Turtle | 4 | |||
user_847 | 2 Jan 12:44 PM | 2 | ||||
user_847 | 2 Jan 1:01 PM | Red | ||||
alternateid_656 | 2 Jan 8:58 PM | Red | Square | 4.2 | ||
alternateid_656 | 2 Jan 9:03 PM | Triangle | 3.1 |
To illustrate the importance of schema paths, consider this scenario. In the first dataset, string_color
is based on schema path _experience.whatever.string_color
and in the second dataset on schema path _experience.somethingelse.string_color
. In this scenario, the data is not merged into one column in the resulting combined dataset. Instead, the result is two string_color
columns in the combined dataset:
id | timestamp | _experience. whatever. string_color |
_experience. somethingelse. string_color |
string_animal | string_shape | metric_a | metric_b |
---|---|---|---|---|---|---|---|
user_310 | 1 Jan 7:02 AM | Red | Fox | ||||
user_310 | 1 Jan 7:04 AM | 2 | |||||
user_310 | 1 Jan 7:08 AM | Blue | 3 | ||||
user_847 | 2 Jan 12:26 PM | Yellow | Circle | 8.5 | |||
user_847 | 2 Jan 12:31 PM | Turtle | 4 | ||||
user_847 | 2 Jan 12:44 PM | 2 | |||||
user_847 | 2 Jan 1:01 PM | Red | |||||
alternateid_656 | 2 Jan 8:58 PM | Red | Square | 4.2 | |||
alternateid_656 | 2 Jan 9:03 PM | Triangle | 3.1 |
This combined event dataset is what is used in reporting. It does not matter which dataset a row comes from. Customer Journey Analytics treats all data as if it is in the same dataset. If a matching Person ID appears in both datasets, they are considered the same unique person. If a matching Person ID appears in both datasets with a timestamp within 30 minutes, they are considered part of the same session. Fields with identical schema paths are merged.
This concept also applies to attribution. It does not matter which dataset a row comes from; attribution works exactly as if all events came from a single dataset. Using the above tables as an example:
If your connection only included the first table and not the second, pulling a report using the string_color
dimension and metric_a
metric using last touch attribution would show:
string_color | metric_a |
---|---|
Unspecified | 6 |
Blue | 3 |
Red | 2 |
However, if you included both tables in your connection, attribution changes since user_847
is in both datasets. A row from the second dataset attributes metric_a
to ‘Yellow’ where they were previously unspecified:
string_color | metric_a |
---|---|
Yellow | 6 |
Blue | 3 |
Red | 2 |
If a merged field is a lookup key for one event dataset in the connection, the associated lookup dataset will enrich all values of that field. It does not matter which event dataset a row comes from, as the lookup relationship is associated with the shared schema path.
The next level of combining datasets is cross-channel analysis, where datasets from different channels are combined, based on a common identifier (Person ID). Cross-channel analysis can benefit from stitching functionality, allowing you to rekey a dataset’s Person ID so the dataset is properly updated to enable a seamless combination of multiple datasets. Stitching looks at user data from both authenticated and unauthenticated sessions to generate a stitched ID.
Cross-channel analysis allows you to answer questions such as:
For more information on cross-channel analysis, refer to the specific use case:
For a more in-depth discussion of stitching functionality, go to: