<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic WATER MARK ERROR WHILE JOINING WITH MULTIPLE STREAM TABLES in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83746#M36997</link>
    <description>&lt;P&gt;I am creating a ETL pipeline where i am reading multiple stream table into temp tables and at the end am trying to join those tables to get the output feed into another live table. So for that am using below method where i am giving list of tables as parameter to a method. Inside method using for loop am streaming them one by one into temp tables. After that am trying to execute the sql to get the DF for my new live table.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;My Python Method : -&lt;/STRONG&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;def&lt;/SPAN&gt;&lt;SPAN&gt; generate_dlt_from_sources(sources,target_name, sql_query,schema,main_table):&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/97035"&gt;@Dlt&lt;/a&gt;.table(&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; name=target_name&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; schema=schema,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; temporary=temporary&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; ) &lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;def&lt;/SPAN&gt;&lt;SPAN&gt; create_table():&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;for&lt;/SPAN&gt;&lt;SPAN&gt; source &lt;/SPAN&gt;&lt;SPAN&gt;in&lt;/SPAN&gt;&lt;SPAN&gt; sources:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; spark.readStream.table(source).createOrReplaceTempView(&lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"temp_&lt;/SPAN&gt;&lt;SPAN&gt;{source.split(&lt;/SPAN&gt;&lt;SPAN&gt;'.'&lt;/SPAN&gt;&lt;SPAN&gt;)[-&lt;/SPAN&gt;&lt;SPAN&gt;1&lt;/SPAN&gt;&lt;SPAN&gt;]}&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; spark.read.table(main_table).createOrReplaceTempView(&lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"temp_&lt;/SPAN&gt;&lt;SPAN&gt;{main_table}&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;return&lt;/SPAN&gt;&lt;SPAN&gt; (spark.sql(sql_query))&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&lt;STRONG&gt;Below are my inputs to that method&amp;nbsp;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;sources = ["table1","table2","table3","table4",------,"tablen"]&lt;BR /&gt;&lt;SPAN&gt;main_table = "parent_table"&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN&gt;target_name = "output_table"&lt;BR /&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN&gt;sql_query = "&lt;SPAN&gt;WITH&lt;SPAN&gt; CTE_IDVALS &lt;SPAN&gt;AS&lt;SPAN&gt; (&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;SELECT&lt;SPAN&gt; id,val &lt;SPAN&gt;FROM&lt;SPAN&gt; temp_table1&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;UNION&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;SELECT&lt;SPAN&gt; id,val &lt;SPAN&gt;FROM&lt;SPAN&gt; temp_table2&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;UNION&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;SELECT&lt;SPAN&gt; id,val &lt;SPAN&gt;FROM&lt;SPAN&gt; temp_table3&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;UNION&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;SELECT&lt;SPAN&gt; id,val &lt;SPAN&gt;FROM&lt;SPAN&gt; temp_table4&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;SELECT&lt;SPAN&gt; p.id,p.val&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;FROM&lt;SPAN&gt; temp_parent_table p &lt;SPAN&gt;LEFT JOIN&lt;SPAN&gt; CTE_IDVALS &lt;SPAN&gt;ON&lt;SPAN&gt; CTE_IDVALS.id = p.id&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;WHERE&lt;SPAN&gt; CTE_IDVALS.id &lt;SPAN&gt;IS &lt;SPAN&gt;NULL&lt;SPAN&gt;"&lt;BR /&gt;schema = "id long , val string"&lt;BR /&gt;&lt;BR /&gt;I am getting below error while running it in pipeline&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&lt;STRONG&gt;ERROR:-&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;Failed to start stream XXXXX in either append mode or complete mode. Append mode error: Stream-stream LeftOuter join between two streaming DataFrame/Datasets is not supported without a watermark in the join keys, or a watermark on the nullable side and an appropriate range condition;&lt;BR /&gt;&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;P&gt;&lt;SPAN&gt;Please help!&lt;/SPAN&gt;&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 21 Aug 2024 10:00:54 GMT</pubDate>
    <dc:creator>BricksGuy</dc:creator>
    <dc:date>2024-08-21T10:00:54Z</dc:date>
    <item>
      <title>WATER MARK ERROR WHILE JOINING WITH MULTIPLE STREAM TABLES</title>
      <link>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83746#M36997</link>
      <description>&lt;P&gt;I am creating a ETL pipeline where i am reading multiple stream table into temp tables and at the end am trying to join those tables to get the output feed into another live table. So for that am using below method where i am giving list of tables as parameter to a method. Inside method using for loop am streaming them one by one into temp tables. After that am trying to execute the sql to get the DF for my new live table.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;My Python Method : -&lt;/STRONG&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;def&lt;/SPAN&gt;&lt;SPAN&gt; generate_dlt_from_sources(sources,target_name, sql_query,schema,main_table):&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/97035"&gt;@Dlt&lt;/a&gt;.table(&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; name=target_name&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; schema=schema,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; temporary=temporary&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; ) &lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;def&lt;/SPAN&gt;&lt;SPAN&gt; create_table():&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;for&lt;/SPAN&gt;&lt;SPAN&gt; source &lt;/SPAN&gt;&lt;SPAN&gt;in&lt;/SPAN&gt;&lt;SPAN&gt; sources:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; spark.readStream.table(source).createOrReplaceTempView(&lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"temp_&lt;/SPAN&gt;&lt;SPAN&gt;{source.split(&lt;/SPAN&gt;&lt;SPAN&gt;'.'&lt;/SPAN&gt;&lt;SPAN&gt;)[-&lt;/SPAN&gt;&lt;SPAN&gt;1&lt;/SPAN&gt;&lt;SPAN&gt;]}&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; spark.read.table(main_table).createOrReplaceTempView(&lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"temp_&lt;/SPAN&gt;&lt;SPAN&gt;{main_table}&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;return&lt;/SPAN&gt;&lt;SPAN&gt; (spark.sql(sql_query))&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&lt;STRONG&gt;Below are my inputs to that method&amp;nbsp;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;sources = ["table1","table2","table3","table4",------,"tablen"]&lt;BR /&gt;&lt;SPAN&gt;main_table = "parent_table"&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN&gt;target_name = "output_table"&lt;BR /&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN&gt;sql_query = "&lt;SPAN&gt;WITH&lt;SPAN&gt; CTE_IDVALS &lt;SPAN&gt;AS&lt;SPAN&gt; (&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;SELECT&lt;SPAN&gt; id,val &lt;SPAN&gt;FROM&lt;SPAN&gt; temp_table1&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;UNION&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;SELECT&lt;SPAN&gt; id,val &lt;SPAN&gt;FROM&lt;SPAN&gt; temp_table2&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;UNION&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;SELECT&lt;SPAN&gt; id,val &lt;SPAN&gt;FROM&lt;SPAN&gt; temp_table3&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;UNION&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;SPAN&gt;SELECT&lt;SPAN&gt; id,val &lt;SPAN&gt;FROM&lt;SPAN&gt; temp_table4&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;SELECT&lt;SPAN&gt; p.id,p.val&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;FROM&lt;SPAN&gt; temp_parent_table p &lt;SPAN&gt;LEFT JOIN&lt;SPAN&gt; CTE_IDVALS &lt;SPAN&gt;ON&lt;SPAN&gt; CTE_IDVALS.id = p.id&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;WHERE&lt;SPAN&gt; CTE_IDVALS.id &lt;SPAN&gt;IS &lt;SPAN&gt;NULL&lt;SPAN&gt;"&lt;BR /&gt;schema = "id long , val string"&lt;BR /&gt;&lt;BR /&gt;I am getting below error while running it in pipeline&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;&lt;STRONG&gt;ERROR:-&lt;BR /&gt;&lt;FONT color="#FF0000"&gt;Failed to start stream XXXXX in either append mode or complete mode. Append mode error: Stream-stream LeftOuter join between two streaming DataFrame/Datasets is not supported without a watermark in the join keys, or a watermark on the nullable side and an appropriate range condition;&lt;BR /&gt;&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;P&gt;&lt;SPAN&gt;Please help!&lt;/SPAN&gt;&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Aug 2024 10:00:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83746#M36997</guid>
      <dc:creator>BricksGuy</dc:creator>
      <dc:date>2024-08-21T10:00:54Z</dc:date>
    </item>
    <item>
      <title>Re: WATER MARK ERROR WHILE JOINING WITH MULTIPLE STREAM TABLES</title>
      <link>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83798#M37004</link>
      <description>&lt;P&gt;A stream-stream left join needs a watermark.&lt;/P&gt;&lt;P&gt;f.e.:&lt;/P&gt;&lt;P&gt;stream_df = stream_df.withWatermark(&lt;SPAN class=""&gt;"timestamp_column"&lt;/SPAN&gt;, &lt;SPAN class=""&gt;"30 minutes"&lt;/SPAN&gt;)&lt;BR /&gt;joined_df = stream_df.join(other_stream_df, &lt;SPAN class=""&gt;"join_key"&lt;/SPAN&gt;)&lt;/P&gt;</description>
      <pubDate>Wed, 21 Aug 2024 13:10:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83798#M37004</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2024-08-21T13:10:17Z</dc:date>
    </item>
    <item>
      <title>Re: WATER MARK ERROR WHILE JOINING WITH MULTIPLE STREAM TABLES</title>
      <link>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83799#M37005</link>
      <description>&lt;P&gt;&amp;nbsp;Thank you for the reply. But in my case am using sql query to read data from those temp tables. So how can i&amp;nbsp; handle the water mark issue in above scenario.&lt;/P&gt;</description>
      <pubDate>Wed, 21 Aug 2024 13:17:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83799#M37005</guid>
      <dc:creator>BricksGuy</dc:creator>
      <dc:date>2024-08-21T13:17:40Z</dc:date>
    </item>
    <item>
      <title>Re: WATER MARK ERROR WHILE JOINING WITH MULTIPLE STREAM TABLES</title>
      <link>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83801#M37006</link>
      <description>&lt;P&gt;try spark.read.table(main_table).withWatermark(&lt;SPAN class=""&gt;"timestamp_column"&lt;/SPAN&gt;, &lt;SPAN class=""&gt;"30 minutes"&lt;/SPAN&gt;).createOrReplaceTempView&lt;BR /&gt;In SQL it is also possible using the WATERMARK function.&lt;BR /&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/delta-live-tables/stateful-processing" target="_blank"&gt;https://learn.microsoft.com/en-us/azure/databricks/delta-live-tables/stateful-processing&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Aug 2024 13:25:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83801#M37006</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2024-08-21T13:25:04Z</dc:date>
    </item>
    <item>
      <title>Re: WATER MARK ERROR WHILE JOINING WITH MULTIPLE STREAM TABLES</title>
      <link>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83829#M37015</link>
      <description>&lt;P&gt;Do i need to water mark it while writing to temp table and in sql statement too&lt;/P&gt;</description>
      <pubDate>Wed, 21 Aug 2024 15:53:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83829#M37015</guid>
      <dc:creator>BricksGuy</dc:creator>
      <dc:date>2024-08-21T15:53:43Z</dc:date>
    </item>
    <item>
      <title>Re: WATER MARK ERROR WHILE JOINING WITH MULTIPLE STREAM TABLES</title>
      <link>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83889#M37055</link>
      <description>&lt;P&gt;it is necessary for the join so if the dataframe has a watermark that's enough.&lt;BR /&gt;No need to define it multiple times.&lt;/P&gt;</description>
      <pubDate>Thu, 22 Aug 2024 07:13:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83889#M37055</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2024-08-22T07:13:56Z</dc:date>
    </item>
    <item>
      <title>Re: WATER MARK ERROR WHILE JOINING WITH MULTIPLE STREAM TABLES</title>
      <link>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83970#M37087</link>
      <description>&lt;P&gt;I did put watermark on the data frame but still getting the same error while executing sql&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 22 Aug 2024 20:19:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83970#M37087</guid>
      <dc:creator>BricksGuy</dc:creator>
      <dc:date>2024-08-22T20:19:03Z</dc:date>
    </item>
    <item>
      <title>Re: WATER MARK ERROR WHILE JOINING WITH MULTIPLE STREAM TABLES</title>
      <link>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83991#M37093</link>
      <description>&lt;P&gt;Sorry, I put the watermark on the non-streaming table.&amp;nbsp; That is wrong of course, the watermark has to be set on the streaming table (source in this case).&lt;/P&gt;</description>
      <pubDate>Fri, 23 Aug 2024 06:16:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/water-mark-error-while-joining-with-multiple-stream-tables/m-p/83991#M37093</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2024-08-23T06:16:49Z</dc:date>
    </item>
  </channel>
</rss>

