<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Stream static join with aggregation in Warehousing &amp; Analytics</title>
    <link>https://community.databricks.com/t5/warehousing-analytics/stream-static-join-with-aggregation/m-p/65606#M1262</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am trying to make Stream - Static join with aggregation with no luck. I have a streaming table where I am getting events with two nasted arrays&lt;/P&gt;&lt;P&gt;ID&amp;nbsp; &amp;nbsp;Array1&amp;nbsp; &amp;nbsp;Array2&lt;/P&gt;&lt;P&gt;1&amp;nbsp; &amp;nbsp; &amp;nbsp;[1,2]&amp;nbsp; &amp;nbsp; &amp;nbsp;[3,4]&lt;/P&gt;&lt;P&gt;I need make two joins to static dictionary tables (without any timestamps) based on IDs contained in arrays, second join is based on first, once I will get descriptive values I need to assign and sum scores, so result should be like that:&lt;/P&gt;&lt;P&gt;ID&amp;nbsp; &amp;nbsp;Score1&amp;nbsp; &amp;nbsp;Score2&lt;/P&gt;&lt;P&gt;1&amp;nbsp; &amp;nbsp; &amp;nbsp; 10&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;12&amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;/P&gt;&lt;P&gt;so one record per id. I am using Databricks SQL. Any ideas how I can make it? Exploding arrays cause cartesian results, when I join to scoring table results will be incorrect. I was thinking to join by array and get an array in both cases, but I am getting errors that I need to apply watermark (even to static tables!). Appreciate any ideas....&lt;/P&gt;</description>
    <pubDate>Fri, 05 Apr 2024 12:27:55 GMT</pubDate>
    <dc:creator>rocky5</dc:creator>
    <dc:date>2024-04-05T12:27:55Z</dc:date>
    <item>
      <title>Stream static join with aggregation</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/stream-static-join-with-aggregation/m-p/65606#M1262</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am trying to make Stream - Static join with aggregation with no luck. I have a streaming table where I am getting events with two nasted arrays&lt;/P&gt;&lt;P&gt;ID&amp;nbsp; &amp;nbsp;Array1&amp;nbsp; &amp;nbsp;Array2&lt;/P&gt;&lt;P&gt;1&amp;nbsp; &amp;nbsp; &amp;nbsp;[1,2]&amp;nbsp; &amp;nbsp; &amp;nbsp;[3,4]&lt;/P&gt;&lt;P&gt;I need make two joins to static dictionary tables (without any timestamps) based on IDs contained in arrays, second join is based on first, once I will get descriptive values I need to assign and sum scores, so result should be like that:&lt;/P&gt;&lt;P&gt;ID&amp;nbsp; &amp;nbsp;Score1&amp;nbsp; &amp;nbsp;Score2&lt;/P&gt;&lt;P&gt;1&amp;nbsp; &amp;nbsp; &amp;nbsp; 10&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;12&amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;/P&gt;&lt;P&gt;so one record per id. I am using Databricks SQL. Any ideas how I can make it? Exploding arrays cause cartesian results, when I join to scoring table results will be incorrect. I was thinking to join by array and get an array in both cases, but I am getting errors that I need to apply watermark (even to static tables!). Appreciate any ideas....&lt;/P&gt;</description>
      <pubDate>Fri, 05 Apr 2024 12:27:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/stream-static-join-with-aggregation/m-p/65606#M1262</guid>
      <dc:creator>rocky5</dc:creator>
      <dc:date>2024-04-05T12:27:55Z</dc:date>
    </item>
  </channel>
</rss>

