Skip to content

Exercise 1 - Unify Data with Data Streams

The goal of this exercise is to learn how to use Data Streams to ingest data from multiple sources into Data Cloud and how those sources map into the Customer 360 model.

Different systems will have different data models. Welo, as a tech company, has Account and Contact data in Sales Cloud and product usage datasets in AWS S3. For this workshop we’re using S3 but most customers keep their usage data in cloud data warehouses like Snowflake, BigQuery, Databricks or Redshift, which Data Cloud accesses via our zero-copy partnerships.

Each system has different columns representing people and information related to people. For instance, the Welo Platform Users dataset contains an Account key but this refers to the way that Welo’s product understands an Account, not an account as it exists in Salesforce CRM.

Data Cloud abstracts this complexity by mapping each source system to a canonical model. This canonical model is called the Customer 360 Data Model and has many out-of-the-box tables based on industry best practices to meet many use cases. This schema is also normalized. An example is how Data Cloud has the concept of an Individual. User data from S3 and contact data from Sales Cloud are both mapped to the Individual object. If you’re coming from a Salesforce background, you’ll notice that the party data model has separate tables for things like Phone Numbers, Addresses and Emails.

Why Do I Need to Map Data Streams?

Every company has a different product architecture and stores usage data in a different way. That’s why you need to map your data sources to the canonical data model in Data Cloud.

To streamline today’s workshop, much of the data has already been ingested into Data Cloud. To give you a feel for the ease of ingesting data into Data Cloud, we’ll review Welo Platform Login information from S3. The point of this is to allow their Sales, Marketing, Success, and other customer-facing teams to have a unified view at the Account level and individual User level. This then enables every GTM team to have more personalized, 1:1 interactions with customers and users, resulting in greater customer retention, expansion and loyalty opportunities.

Step 1: Reviewing Data Streams and the Customer 360 Data Model

  1. From the Data Cloud Home Page, navigate to the Data Streams tab. If you see Recently Viewed at the top, switch the view to All Data Streams.
  2. From here, we can see information about the data being ingested into Data Cloud, where it’s coming from, the Total Records in the stream, and so on. This is really the first step in the ‘data journey.’ Let’s take a look at what’s already set-up for Welo.
  3. From CRM, data streams have been set up for the Account, Asset, Case and Contact objects.
  4. From the Welo platform (by way of Amazon s3), data streams have been set up for both Accounts and Users.
  5. From Salesforce Marketing Cloud, data streams have been set up for email profile and engagement data like Subscribers, Sends, Opens, Clicks and Bounces.
  6. And from Welo’s website, data streams have been set up for Visitors and Page Views. Image
  7. So let’s bring Welo Platform Logins into Data Cloud! Click New on the upper right. This New Data Stream wizard lists many different sources and this isn’t all of them, just a small number enabled in this workshop environment.
  8. Select Installed Data Kits & Packages as the source and click Next. Image
  9. On the left under Data Kits, select Welo Workshop and then scroll down and check WeloPlatformLogins_stream, then click Next. Image
  10. Click Next at the bottom, then click Next two more times, and then finally click Deploy. Image
  11. Once deployed, you’ll see an overview of all your Data Streams. Locate your new Welo Platform Logins data stream and click into it.
  12. Click the Review button in the Data Mapping box. Image
  13. The Source is on the left in a Data Stream mapping. In our case, this represents the fields in the Welo Platform Logins dataset from S3.
  14. The Destination is on the right. As you can see, this has been pre-configured and mapped to a custom Data Model Object called Welo Platform Logins.
  15. Click Close on the mapping screen to return to the Data Stream detail page.
  16. From the Data Stream detail page, click Refresh Now to execute the stream and ingest the login data into Data Cloud. On the next pop-up, select Refresh Only New Files. Image

Step 2: Customer 360 Data Model

  1. Next, navigate to the Data Model tab. By default, this tab lists the Mapped Data Model Objects (DMOs). Image

  2. Let’s look at the “Big Picture” first. In the Upper Right, change from List view to Graph view. Image

  3. This shows a data model visualization, including relationships between objects.
    Image

Observe the distinct colors for the Profile, Engagement and Other categories.

Choose Profile if your dataset aligns with:

  • A list of consumers with identifiers like consumer IDs, email addresses, or phone numbers
  • A list of businesses or accounts with account IDs
  • A list of employees or any other population for segmentation or as a segment’s starting population

Opt for the Engagement category if your dataset is time-series oriented, such as Web Engagement, Email Engagement, or Orders.

If your dataset is related to Profile and Engagement but doesn’t fit those categories, like product, store or vehicle info, go for the Other category.

You may want to leave the Data Model graph open to refer back to throughout the workshop.

Step 3: Unify Accounts & Profiles with Identity Resolution

Once data has been ingested into the Data Cloud, we need to unify the data. In this process we’ll see how Welo can unify Individual Profiles and Accounts. Why?

  • Individuals: Welo has Users in their Product who may or may not exist as Contacts in Sales Cloud. Now sales and success users can identify product users to drive expansion opportunities as part of a product-led sales playbook.
  • Accounts: Welo also has Account duplication, which has negative, downstream impacts to their GTM teams. With Account Unification, Welo’s sales teams will have better insights into overall account health and revenue opportunities while collaborating with marketing on ABM.

What’s the point of a unified identity?

A customer can be a CRM contact, a product user, or a contact from a data enrichment provider. Data also isn’t exact. Fuzzy matching lets you create a single profile for your customer.

In this step, we review the configuration since the unification was executed before the workshop.

  1. While still in Data Cloud, click the Identity Resolutions tab. You may need to change the List View filter to All Identity Resolutions.
    Like the Data Streams tab, this shows a list of Ruleset Names along with their attributes and results, including the number and percentage of matching records.

  2. Click the ruleset Unified Individual to see more details. This represents how our Salesforce Contacts, Welo Platform Users, SFMC Subscribers and Website Page Visit get connected. Image

  3. Under the Ruleset Properties tab, review the two primary parts of the ruleset:

  4. Match Rules: These define the attributes used to identify matching records. In this example, we use a fuzzy match for the Name and a normalized match for the email.

  5. Reconciliation Rules: Matched data sources may provide different values for a single field. Reconciliation rules specify the best value to save to the unified profile. Options include Most Frequent or Source Priority.

This concludes Exercise 1

Released under the MIT License.