<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Temp Table Vs Temp View Vs temp table function- which one is better for large Databrick data pro in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/81009#M36194</link>
    <description>&lt;P&gt;What about Temporary view vs Temporary function as both are lazy evaluations. Any preference of when to use view vs function. Usually UDFs are not recommended in where clause. Does this still holds true in Databricks ?&lt;/P&gt;</description>
    <pubDate>Mon, 29 Jul 2024 18:31:10 GMT</pubDate>
    <dc:creator>Abhot</dc:creator>
    <dc:date>2024-07-29T18:31:10Z</dc:date>
    <item>
      <title>Temp Table Vs Temp View Vs temp table function- which one is better for large Databrick data processing</title>
      <link>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/4087#M903</link>
      <description>&lt;P&gt;Hello , &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;1 ) Which one is better during large data processing - Temp table vs Temporary view vs temp Table function . &lt;/P&gt;&lt;P&gt;2) How lazy evaluation better for processing ? and which one of the above helps in lazy evaluation &lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 22 May 2023 20:23:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/4087#M903</guid>
      <dc:creator>Abhot</dc:creator>
      <dc:date>2023-05-22T20:23:43Z</dc:date>
    </item>
    <item>
      <title>Re: Temp Table Vs Temp View Vs temp table function- which one is better for large Databrick data pro</title>
      <link>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/80980#M36183</link>
      <description>&lt;P&gt;Does anyone have any suggestions regarding the question above?&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jul 2024 13:47:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/80980#M36183</guid>
      <dc:creator>Abhot</dc:creator>
      <dc:date>2024-07-29T13:47:11Z</dc:date>
    </item>
    <item>
      <title>Re: Temp Table Vs Temp View Vs temp table function- which one is better for large Databrick data pro</title>
      <link>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/80982#M36185</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/65869"&gt;@Abhot&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;It depends, if you're going to use it multiple times in your processing then creating temp view will be slower due to lazy evaluation (unless you persist that view (dataframe cache)).&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;SPAN&gt;Every time your temp view is called, it will execute the view definition/code. This also depends upon your cluster capacity, how complex the view definition is, how much data the temp view has.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;So every time you use view name it will be recomputed. In such scenarios it's better to use temporary table, especially if transformantion are complex.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jul 2024 14:09:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/80982#M36185</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-07-29T14:09:09Z</dc:date>
    </item>
    <item>
      <title>Re: Temp Table Vs Temp View Vs temp table function- which one is better for large Databrick data pro</title>
      <link>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/81009#M36194</link>
      <description>&lt;P&gt;What about Temporary view vs Temporary function as both are lazy evaluations. Any preference of when to use view vs function. Usually UDFs are not recommended in where clause. Does this still holds true in Databricks ?&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jul 2024 18:31:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/81009#M36194</guid>
      <dc:creator>Abhot</dc:creator>
      <dc:date>2024-07-29T18:31:10Z</dc:date>
    </item>
    <item>
      <title>Re: Temp Table Vs Temp View Vs temp table function- which one is better for large Databrick data pro</title>
      <link>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/81010#M36195</link>
      <description>&lt;P&gt;Most often it's better to use temp view over UDFS.&amp;nbsp;&lt;SPAN&gt;UDFs might introduce significant processing bottlenecks into code execution. Databricks uses a number of different optimizers automatically for code written with included Apache Spark, SQL, and Delta Lake syntax. When custom logic is introduced by UDFs, these optimizers do not have the ability to efficiently plan tasks around this custom logic. In addition, logic that executes outside the JVM has additional costs around data serialization.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jul 2024 18:35:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/temp-table-vs-temp-view-vs-temp-table-function-which-one-is/m-p/81010#M36195</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-07-29T18:35:49Z</dc:date>
    </item>
  </channel>
</rss>

