<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Can data be unified based on client profile (unified profile) in databricks? in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/can-data-be-unified-based-on-client-profile-unified-profile-in/m-p/62176#M6736</link>
    <description>&lt;P&gt;You want to identify actual persons based on one or more profiles (based on e-mail address etc).&amp;nbsp; That is something that is not available out-of-the box in Databricks.&amp;nbsp; The 'unified' in Databricks means you have a single platform for several data topics like engineering, analytics, ML.&lt;BR /&gt;What you are looking for is in fact a Customer Data Platform which kan uniquely identify a natural person based on characteristics stored.&lt;BR /&gt;How Salesforce does it is probably a secret, but very probably they use a combination of name, address, e-mail address etc to check if there are different profiles pointing to the same natural person.&lt;BR /&gt;Basically there are 2 approaches in identifying person: deterministic an probabilistic.&lt;BR /&gt;The method I already described is deterministic (based on hard rules).&amp;nbsp; A more advanced technique, which can lead to better or worse results than the deterministic method, is the probabilistic method. Here we try to identify a person based on probabilistic models, so this is a form of statistical learning/machine learning.&lt;BR /&gt;A combination of both is also possible.&lt;/P&gt;&lt;P&gt;I already mentioned Databricks itself does not have an own CDP, but that does not mean you cannot build one using databricks.&lt;BR /&gt;The methods I described can be applied in notebooks. Also there is an interesting blog on the databricks site about Arc, a probablistic model:&lt;BR /&gt;&lt;A href="https://www.databricks.com/blog/linking-unlinkables-simple-automated-scalable-data-linking-databricks-arc" target="_blank" rel="noopener"&gt;https://www.databricks.com/blog/linking-unlinkables-simple-automated-scalable-data-linking-databricks-arc&lt;/A&gt;&lt;/P&gt;&lt;P&gt;There is a lot to be found online.&amp;nbsp; But beware Salesforce CDP does not only identify persons, there is also the whole UI experience, flexible filtering, creating campaigns etc.&amp;nbsp; That is something I do not see in Databricks.&lt;/P&gt;</description>
    <pubDate>Wed, 28 Feb 2024 07:40:46 GMT</pubDate>
    <dc:creator>-werners-</dc:creator>
    <dc:date>2024-02-28T07:40:46Z</dc:date>
    <item>
      <title>Can data be unified based on client profile (unified profile) in databricks?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/can-data-be-unified-based-on-client-profile-unified-profile-in/m-p/62134#M6734</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;my question is in regard to how data in salesforce data cloud gets unified based on client profiles. Can similar action be done on data in databricks. i believe unity catalog just provides unified layer for security and governance. is there a way to unify data as well? or will it involve writing business rules and transforming the data accordingly in code?&lt;/P&gt;</description>
      <pubDate>Tue, 27 Feb 2024 17:18:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/can-data-be-unified-based-on-client-profile-unified-profile-in/m-p/62134#M6734</guid>
      <dc:creator>Ruby8376</dc:creator>
      <dc:date>2024-02-27T17:18:01Z</dc:date>
    </item>
    <item>
      <title>Re: Can data be unified based on client profile (unified profile) in databricks?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/can-data-be-unified-based-on-client-profile-unified-profile-in/m-p/62135#M6735</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/14792"&gt;@-werners-&lt;/a&gt;&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;can you please help?&lt;/P&gt;</description>
      <pubDate>Tue, 27 Feb 2024 17:20:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/can-data-be-unified-based-on-client-profile-unified-profile-in/m-p/62135#M6735</guid>
      <dc:creator>Ruby8376</dc:creator>
      <dc:date>2024-02-27T17:20:08Z</dc:date>
    </item>
    <item>
      <title>Re: Can data be unified based on client profile (unified profile) in databricks?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/can-data-be-unified-based-on-client-profile-unified-profile-in/m-p/62176#M6736</link>
      <description>&lt;P&gt;You want to identify actual persons based on one or more profiles (based on e-mail address etc).&amp;nbsp; That is something that is not available out-of-the box in Databricks.&amp;nbsp; The 'unified' in Databricks means you have a single platform for several data topics like engineering, analytics, ML.&lt;BR /&gt;What you are looking for is in fact a Customer Data Platform which kan uniquely identify a natural person based on characteristics stored.&lt;BR /&gt;How Salesforce does it is probably a secret, but very probably they use a combination of name, address, e-mail address etc to check if there are different profiles pointing to the same natural person.&lt;BR /&gt;Basically there are 2 approaches in identifying person: deterministic an probabilistic.&lt;BR /&gt;The method I already described is deterministic (based on hard rules).&amp;nbsp; A more advanced technique, which can lead to better or worse results than the deterministic method, is the probabilistic method. Here we try to identify a person based on probabilistic models, so this is a form of statistical learning/machine learning.&lt;BR /&gt;A combination of both is also possible.&lt;/P&gt;&lt;P&gt;I already mentioned Databricks itself does not have an own CDP, but that does not mean you cannot build one using databricks.&lt;BR /&gt;The methods I described can be applied in notebooks. Also there is an interesting blog on the databricks site about Arc, a probablistic model:&lt;BR /&gt;&lt;A href="https://www.databricks.com/blog/linking-unlinkables-simple-automated-scalable-data-linking-databricks-arc" target="_blank" rel="noopener"&gt;https://www.databricks.com/blog/linking-unlinkables-simple-automated-scalable-data-linking-databricks-arc&lt;/A&gt;&lt;/P&gt;&lt;P&gt;There is a lot to be found online.&amp;nbsp; But beware Salesforce CDP does not only identify persons, there is also the whole UI experience, flexible filtering, creating campaigns etc.&amp;nbsp; That is something I do not see in Databricks.&lt;/P&gt;</description>
      <pubDate>Wed, 28 Feb 2024 07:40:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/can-data-be-unified-based-on-client-profile-unified-profile-in/m-p/62176#M6736</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2024-02-28T07:40:46Z</dc:date>
    </item>
    <item>
      <title>Re: Can data be unified based on client profile (unified profile) in databricks?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/can-data-be-unified-based-on-client-profile-unified-profile-in/m-p/63280#M6737</link>
      <description>&lt;P&gt;Thank you so much !!&lt;/P&gt;</description>
      <pubDate>Mon, 11 Mar 2024 19:45:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/can-data-be-unified-based-on-client-profile-unified-profile-in/m-p/63280#M6737</guid>
      <dc:creator>Ruby8376</dc:creator>
      <dc:date>2024-03-11T19:45:09Z</dc:date>
    </item>
  </channel>
</rss>

