In this lesson, you will create datasets to receive your data. You will be excited to know that this is the shortest lesson in the tutorial!
All data that is successfully ingested into Adobe Experience Platform is persisted in the data lake as datasets. A dataset is a storage and management construct for a collection of data, typically a table, that contains a schema (columns) and fields (rows). Datasets also contain metadata that describes various aspects of the data they store.
Data Architects will need to create datasets outside of this tutorial.
Before you begin the exercises, watch this short video to learn more about datasets:
In the Configure Permissions lesson, you set up all the access controls required to complete this lesson.
In this exercise, we will create datasets in the UI. Let’s start with the loyalty data:
Go to Datasets in the Platform user interface’s left navigation
Select the Create dataset button
On the next screen, select Create dataset from schema
On the next screen, select your Luma Loyalty Schema
and then select the Next button
Name the dataset Luma Loyalty Dataset
and select the Finish button
When the dataset has saved, you will be taken to a screen like this:
That’s it! I told you this was going to be quick. Create these other datasets using the same steps:
Luma Offline Purchase Events Dataset
for your Luma Offline Purchase Events Schema
Luma Web Events Dataset
for your Luma Web Events Schema
Luma Product Catalog Dataset
for your Luma Product Catalog Schema
Now create the Luma CRM Dataset
using the API.
If you want to skip the API exercise and create the Luma CRM Dataset
in the user interface that’s fine. Name it Luma CRM Dataset
and use the Luma CRM Schema
.
First we need to get the $id
of the Luma CRM Schema
:
Luma CRM Schema
item and copy the $id
valueNow you can create the dataset:
Download Catalog Service API.postman_collection.json to your Luma Tutorial Assets
folder.
Import the collection into Postman
Select the request Catalog Service API > Datasets > Create a new dataset.
Paste the following as the Body of the request, replacing the id value with your own:
{
"name": "Luma CRM Dataset",
"schemaRef": {
"id": "REPLACE_WITH_YOUR_OWN_ID",
"contentType": "application/vnd.adobe.xed-full+json;version=1"
},
"fileDescription": {
"persisted": true,
"containerFormat": "parquet",
"format": "parquet"
}
}
Select the Send button
You should get a 201 Created response containing the id of your new dataset!
Common issues making this request and likely fixes:
400: There was a problem retrieving xdm schema
. Make sure you have replaced the id in the sample above with the id of your own Luma CRM Schema
401: Not Authorized to PUT/POST/PATCH/DELETE for this path : /global/schemas/
: Update the CONTAINER_ID environment variable from global
to tenant
403: PALM Access Denied. POST access is denied for this resource from access control
: Verify your user permissions in the Admin ConsoleYou can go back to the Datasets screen in the Platform user interface, you can verify the successful creation of all five datasets!
Now that all of our schemas, identities, and datasets are in place, we can enable them for Real-Time Customer Profile.