<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How can I enable disk cache in this scenario/ in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-can-i-enable-disk-cache-in-this-scenario/m-p/62021#M31873</link>
    <description>&lt;P&gt;I have a notebook where I read multiple tables from delta lake (let say schema is db) and after that I did some sort of transformation (image enclosed) using all these tables lwith transformations like join,filter etc. After transformation and writing it to delta table, I am getting insight from databricks to use disk cache(image enclosed). In this scenario how can I use disk cache. I used&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;spark.conf.&lt;/SPAN&gt;&lt;SPAN&gt;get&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;"spark.databricks.io.cache.enabled"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;SPAN&gt;"true"&lt;/SPAN&gt;&lt;SPAN&gt;) for disk cache but still getting the same insight.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Also, whenever I am trying to write the final DF to any table in delta lake getting the same insight that use disk cache.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;How can I fix this. Is there any other optimization technique I can adapt rather than this.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Please check the image enclosed with it.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;</description>
    <pubDate>Mon, 26 Feb 2024 18:50:59 GMT</pubDate>
    <dc:creator>anupam676</dc:creator>
    <dc:date>2024-02-26T18:50:59Z</dc:date>
    <item>
      <title>How can I enable disk cache in this scenario/</title>
      <link>https://community.databricks.com/t5/data-engineering/how-can-i-enable-disk-cache-in-this-scenario/m-p/62021#M31873</link>
      <description>&lt;P&gt;I have a notebook where I read multiple tables from delta lake (let say schema is db) and after that I did some sort of transformation (image enclosed) using all these tables lwith transformations like join,filter etc. After transformation and writing it to delta table, I am getting insight from databricks to use disk cache(image enclosed). In this scenario how can I use disk cache. I used&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;spark.conf.&lt;/SPAN&gt;&lt;SPAN&gt;get&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;"spark.databricks.io.cache.enabled"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;SPAN&gt;"true"&lt;/SPAN&gt;&lt;SPAN&gt;) for disk cache but still getting the same insight.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Also, whenever I am trying to write the final DF to any table in delta lake getting the same insight that use disk cache.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;How can I fix this. Is there any other optimization technique I can adapt rather than this.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Please check the image enclosed with it.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 26 Feb 2024 18:50:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-can-i-enable-disk-cache-in-this-scenario/m-p/62021#M31873</guid>
      <dc:creator>anupam676</dc:creator>
      <dc:date>2024-02-26T18:50:59Z</dc:date>
    </item>
    <item>
      <title>Re: How can I enable disk cache in this scenario/</title>
      <link>https://community.databricks.com/t5/data-engineering/how-can-i-enable-disk-cache-in-this-scenario/m-p/62027#M31878</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/100909"&gt;@anupam676&lt;/a&gt;&amp;nbsp;- could you please use set function instead of get&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;spark.conf.s&lt;/SPAN&gt;&lt;SPAN&gt;et&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;"spark.databricks.io.cache.enabled"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;SPAN&gt;"true"&lt;/SPAN&gt;&lt;SPAN&gt;)&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 26 Feb 2024 19:54:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-can-i-enable-disk-cache-in-this-scenario/m-p/62027#M31878</guid>
      <dc:creator>shan_chandra</dc:creator>
      <dc:date>2024-02-26T19:54:06Z</dc:date>
    </item>
    <item>
      <title>Re: How can I enable disk cache in this scenario/</title>
      <link>https://community.databricks.com/t5/data-engineering/how-can-i-enable-disk-cache-in-this-scenario/m-p/62066#M31892</link>
      <description>&lt;P&gt;Thank you&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/616"&gt;@shan_chandra&lt;/a&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 27 Feb 2024 08:11:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-can-i-enable-disk-cache-in-this-scenario/m-p/62066#M31892</guid>
      <dc:creator>anupam676</dc:creator>
      <dc:date>2024-02-27T08:11:46Z</dc:date>
    </item>
  </channel>
</rss>

