<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Stop Cache in free edition in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/135186#M50297</link>
    <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/188931"&gt;@Hritik_Moon&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. check if your data is cached, this you can see in sparkUI &amp;gt; storage tab.&lt;/P&gt;&lt;P&gt;2. if it is not cached, try to add a action statement after you cache. eg : df.count(). Data is cached with the first action statement it encounters. Now check in spark UI.&lt;/P&gt;&lt;P&gt;3. if you have only one action statement, you dont see any difference. But if you have multiple action statement, you tend to see the relevant transformations before your cached dataframe gets skipped. You can see these skips in your DAG.&lt;/P&gt;</description>
    <pubDate>Fri, 17 Oct 2025 04:14:34 GMT</pubDate>
    <dc:creator>Prajapathy_NKR</dc:creator>
    <dc:date>2025-10-17T04:14:34Z</dc:date>
    <item>
      <title>Stop Cache in free edition</title>
      <link>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134734#M50178</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am using databricks free edition, is there a way to turn off IO caching.&lt;/P&gt;&lt;P&gt;I am trying to learn optimization and cant see any difference in query run time with caching enabled.&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2025 11:51:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134734#M50178</guid>
      <dc:creator>Hritik_Moon</dc:creator>
      <dc:date>2025-10-13T11:51:04Z</dc:date>
    </item>
    <item>
      <title>Re: Stop Cache in free edition</title>
      <link>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134741#M50181</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/188931"&gt;@Hritik_Moon&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;I guess you cannot. To disable disk cache you need to have ability to run following command:&lt;/P&gt;&lt;LI-CODE lang="python"&gt;spark.conf.set("spark.databricks.io.cache.enabled", "[true | false]")&lt;/LI-CODE&gt;&lt;P&gt;But serverless &lt;SPAN&gt;compute does not support setting most Spark properties for notebooks or jobs. The following are the properties you can configure:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="szymon_dybczak_0-1760357977345.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20700iE0734595DA2350B9/image-size/medium?v=v2&amp;amp;px=400" role="button" title="szymon_dybczak_0-1760357977345.png" alt="szymon_dybczak_0-1760357977345.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So, if you want to have a proper envirionment to learn apache spark optimization use OSS Apache Spark docker container as an alternative&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2025 12:20:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134741#M50181</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-10-13T12:20:14Z</dc:date>
    </item>
    <item>
      <title>Re: Stop Cache in free edition</title>
      <link>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134743#M50183</link>
      <description>&lt;P&gt;Thanks, I have no prior experience with docker and how to get spark but I guess youtube will help&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":beaming_face_with_smiling_eyes:"&gt;😁&lt;/span&gt;.&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2025 12:26:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134743#M50183</guid>
      <dc:creator>Hritik_Moon</dc:creator>
      <dc:date>2025-10-13T12:26:45Z</dc:date>
    </item>
    <item>
      <title>Re: Stop Cache in free edition</title>
      <link>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134744#M50184</link>
      <description>&lt;P&gt;Yep, it's really simple to setup. As an added benefit you will have a full control over your environment &lt;span class="lia-unicode-emoji" title=":grinning_face_with_smiling_eyes:"&gt;😄&lt;/span&gt; Here you have an yt video that shows how to setup it:&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.youtube.com/watch?v=FteThJ-YvXk" target="_blank"&gt;How to Run a Spark Cluster with Multiple Workers Locally Using Docker&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2025 12:29:32 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134744#M50184</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-10-13T12:29:32Z</dc:date>
    </item>
    <item>
      <title>Re: Stop Cache in free edition</title>
      <link>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134745#M50185</link>
      <description>&lt;P&gt;Thanks, I will be back later with additional questions&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":smiling_face_with_smiling_eyes:"&gt;😊&lt;/span&gt;.&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2025 12:31:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134745#M50185</guid>
      <dc:creator>Hritik_Moon</dc:creator>
      <dc:date>2025-10-13T12:31:54Z</dc:date>
    </item>
    <item>
      <title>Re: Stop Cache in free edition</title>
      <link>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134746#M50186</link>
      <description>&lt;P&gt;Sure, one suggestion though. If your next question will be related to cache then ask it here. But if it will be something completely unrelated to this topic, please start new one.&amp;nbsp;&lt;BR /&gt;Usually, all questions and answers should be related to given thread. This way it's much easier for others to find what they're looking for. Also, if someone's answer solved your issue/help you try to pick that answer as a solution for a given thread.&lt;/P&gt;</description>
      <pubDate>Mon, 13 Oct 2025 12:37:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/134746#M50186</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-10-13T12:37:34Z</dc:date>
    </item>
    <item>
      <title>Re: Stop Cache in free edition</title>
      <link>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/135186#M50297</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/188931"&gt;@Hritik_Moon&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. check if your data is cached, this you can see in sparkUI &amp;gt; storage tab.&lt;/P&gt;&lt;P&gt;2. if it is not cached, try to add a action statement after you cache. eg : df.count(). Data is cached with the first action statement it encounters. Now check in spark UI.&lt;/P&gt;&lt;P&gt;3. if you have only one action statement, you dont see any difference. But if you have multiple action statement, you tend to see the relevant transformations before your cached dataframe gets skipped. You can see these skips in your DAG.&lt;/P&gt;</description>
      <pubDate>Fri, 17 Oct 2025 04:14:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/stop-cache-in-free-edition/m-p/135186#M50297</guid>
      <dc:creator>Prajapathy_NKR</dc:creator>
      <dc:date>2025-10-17T04:14:34Z</dc:date>
    </item>
  </channel>
</rss>

